Job Description
<div class="content-intro"><div>
<div>
<div class="gmail_quote">
<div>
<div><span id="m_1770241969069985273m_-2746164444908759431gmail-docs-internal-guid-131e4fb0-7fff-b4e9-ff50-e8cf32449b1b">CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at <a href="http://www.coreweave.com/" target="_blank" data-saferedirecturl="https://www.google.com/url?q=http://www.coreweave.comsource=gmailust=1762613132717000usg=AOvVaw3D-UOhNaqEvF5BEWxjYyAU">www.coreweave.com</a>.</span></div>
</div>
</div>
</div>
</div></div><h2><span style="font-size: 14pt;"><strong>What You’ll Do</strong></span></h2>
<p>As a Senior Engineer in Compute Services, you will be responsible for building fault-tolerant and reliable infrastructure to support both our internal processes and our customer platform. If you’re passionate about GitOps, KubeOps, DevOps—really, all the ops—this role could be a great fit for you!</p>
<ul>
<li>Design, develop, and maintain automated tooling to provision Kubernetes control planes on bare-metal</li>
<li>Use Python, Golang, and Bash to create tooling and go operators</li>
<li>Perform day 2 lifecycle tasks and maintenance on running clusters</li>
<li>Identify gaps and implement fault-tolerant architectures</li>
<li>Optimize reliability using the Grafana ecosystem</li>
<li>Design automated testing to validate build quality and stability</li>
<li>Participate in an on-call rotation every two months serving as point of contact</li>
</ul>
<h2><span style="font-size: 14pt;"><strong>Who You Are</strong></span></h2>
<p>Investing in our people is one of our top priorities, and we value candidates who can bring their diversified experiences to our teams. Here are some qualities we’ve found compatible with our team. We'd love to talk about whether this aligns with your experience and interests and what you’re excited to work on next.</p>
<ul>
<li>Proven experience provisioning Kubernetes using tools such as kubeadm, Cluster API, Kubeception, Kubespray, or similar</li>
<li>Demonstrated ability debugging complex kubernetes cluster issues and carrying out upgrades</li>
<li>Proficiency in Golang, Bash, and Python</li>
<li>Advanced Linux OS troubleshooting skills</li>
<li>Extensive experience with Ansible</li>
<li>Advanced DevOps experience (e.g., GitLab CI, GitHub Actions)</li>
<li>Demonstrated ability to collaborate effectively on shared codebases</li>
<li>Excellent documentation skills and high attention to detail</li>
<li>Strong analytical and problem-solving abilities</li>
<li>Experience participating in an on-call rotation to support production services</li>
</ul>
<p><strong>Preferred Qualifications</strong></p>
<ul>
<li>Bare-metal OS provisioning experience</li>
<li>Kubernetes operator coding experience</li>
<li>Advanced Linux networking expertise</li>
<li>AWX/Ansible tower knowledge</li>
</ul>
<p><span data-sheets-root="1">The base salary range for this role is $182,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).</span></p><div class="content-conclusion"><p><strong>What We Offer</strong></p>
<p>The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can includ ... (truncated, view full listing at source)