Operations Engineering Manager, Fleet Reliability
CoreWeaveLivingston, NJ / Richmond, VA / Las Vegas, NV / Bellevue, WA$143k – $210kPosted 23 February 2026
Job Description
<div class="content-intro"><div>
<div>
<div class="gmail_quote">
<div>
<div><span id="m_1770241969069985273m_-2746164444908759431gmail-docs-internal-guid-131e4fb0-7fff-b4e9-ff50-e8cf32449b1b">CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at <a href="http://www.coreweave.com/" target="_blank" data-saferedirecturl="https://www.google.com/url?q=http://www.coreweave.comsource=gmailust=1762613132717000usg=AOvVaw3D-UOhNaqEvF5BEWxjYyAU">www.coreweave.com</a>.</span></div>
</div>
</div>
</div>
</div></div><h3><strong>What You'll do</strong></h3>
<p>The Fleet Reliability Operations Team is the heart of CoreWeave’s capacity delivery and maintenance effort. This team is responsible for provisioning, updating and triaging server nodes, and executing the processes and tooling that configure and validate our server fleet. This team is the first in line to respond to hardware issues in production, and is empowered to drive automation and observability design and priority for our server fleet lifecycle.</p>
<p>We are seeking an Operations Manager for the Fleet Reliability Operations team who can help us maintain and improve our high volume of delivery and scale as we 10x the size of our fleet. This individual will develop a strong pipeline of talent, manage onboarding and training, provide process and thought leadership across the team’s domain, and champion reliability and customer satisfaction. As the manager of this team, you would have the opportunity to:</p>
<ul>
<li>Build and lead a 24/7 team of process-oriented, reliability and observability-focused engineers.</li>
<li>Lead the socialization and documentation of clear and consistent processes for provisioning, validating and troubleshooting nodes in our server fleet.</li>
<li>Think critically about and advocate for process and automation improvements prioritizing event-driven automated remediation as the end goal.</li>
<li>Provide a 24/7 engineering support function for high-criticality, time-sensitive node delivery and maintenance.</li>
<li>Drive and improve our program of onboarding, documentation, enablement, and performance management to help your team members achieve new heights of personal growth and capability.</li>
<li>Drive the culture and tone for how your team keeps score both in how they communicate with and support each other and how they enable the rest of CoreWeave.</li>
</ul>
<h3><strong>Who You Are</strong></h3>
<ul>
<li>You have seven or more years of experience in a software or infrastructure engineering industry, of which at least two years were in a leadership capacity.</li>
<li>You have a background that includes the knowledge and practice of SRE fundamentals, incident management, blameless culture, observability, and change management.</li>
<li>You believe in the value of automation and will champion practices that drive reliability and drive the adoption of cross-team processes and tooling.</li>
<li>You love helping people on their journeys to become their best selves and are comfortable extending the range of your influence to partners, peers, and senior leadership.</li>
</ul>
<p><span data-sheets-root="1"><strong>The base salary range for this role is $143,000 to $210,000. </strong>The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits p ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
More jobs at CoreWeave
See all →Senior Software Engineer, Security
Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA · 6 March 2026
Corporate Development Associate
Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA · 6 March 2026
Full-Stack AI Engineer
Sunnyvale, CA/Bellevue, WA · 6 March 2026
Manager, Accounts Payable
Dallas, TX · 6 March 2026
More Vite jobs
See all →Senior Data Program Manager
New Relic · New York City, New York, USA; San Francisco, California, USA; Seattle, Washington, USA
Senior Director, Human Intelligence
Edelman · Toronto, Ontario, Canada
Temporary Vice President, Health
Edelman · Remote - USA, Georgia
Assistant Account Executive
Edelman · Bogota, Colombia