Senior Site Reliability Engineer
BetterUpAustin, TX; New York, NY; San Francisco, CA; Chicago, IL; Arlington, VAPosted 21 January 2026
Job Description
Let’s face it, a company whose mission is human transformation better have some fresh thinking about the employer/employee relationship.We do. We can’t cram it all in here, but you’ll start noticing it from the first interview.Even our candidate experience is different. And when you get an offer from us (and accept it), you get way more than a paycheck. You get a personal BetterUp Coach, a development plan, a trained and coached manager, the most amazing team you’ve ever met (yes, each with their own personal BetterUp Coach), and most importantly, work that matters.This makes for a remarkably focused and fulfilling work experience. Frankly, it’s not for everyone. But for people with fire in their belly, it’s a game-changing, career-defining, soul-lifting move.Join us and we promise you the most intense and fulfilling years of your career, doing life-changing work in a fun, inventive, soulful culture.If that sounds exciting—and the job description below feels like a fit—we really should start talking.We are a hybrid company with a focus on in-person collaboration when necessary. Employees are expected to be available to work from one of our office hubs at least two days per week, or eight days per month. Our US hub locations include: Austin, TX; Chicago, IL; New York City, NY; San Francisco, CA; and the Washington, DC metro area. If this is a role based in Europe, our Europe hub locations are London, UK and Amsterdam, NL. Please ensure you can realistically commit to this structure before applying.What you’ll do:Leverage AI-powered tools and automation to transform how we monitor, troubleshoot, and maintain production systemsBuild and operate cloud infrastructure on AWS, using Terraform to codify and version-control our entire environmentManage and scale Kubernetes clusters that power BetterUp's platform, ensuring high availability and performanceDesign intelligent alerting and observability systemsCollaborate with engineering teams to embed reliability into the development lifecycle, shifting left on operational concernsAutomate incident response workflows and build self-healing infrastructureExperiment with and adopt emerging AI tools for log analysis, anomaly detection, and predictive maintenanceDrive continuous improvement through data-driven retrospectives and reliability metricsIf you have some or all of the following, please apply:4+ years of experience in SRE or infrastructure rolesGenuine excitement about AI tooling: you're already using copilots, AI assistants, or LLM-based tools in your workflow and are excited to push your skillset further in this areaDeep experience with AWSHands-on Kubernetes experience: deploying, scaling, debugging, and securing clustersStrong Terraform skills with experience managing complex, multi-environment infrastructureFamiliarity with modern observability stacks (Datadog, Prometheus, OpenTelemetry)Strong debugging instincts and comfort navigating distributed systemsClear communication skills - you can explain a production incident to engineers and executives alikeA builder's mindset: you see manual processes as opportunities for automationAI at BetterUpOur team thrives at the intersection of human expertise and AI capability. As an AI-forward company, adaptation and continuous learning are part of our daily work. We're looking for teammates who are excited to evolve alongside technology – people who experiment boldly, share their discoveries openly, and help define best practices for AI-augmented work. These professionals thoughtfully integrate AI into their work to deliver exceptional results while maintaining the human judgment and creativity that drives real innovation. During our interview process, you’ll have opportunities to showcase how you harness AI to learn, iterate, and amplify your impact.Benefits:At BetterUp, we are committed to living out our mission every day and that starts with providing benefits that allow our employees to care for themselves, support their families, and ... (truncated, view full listing at source)
Apply Now
Direct link to company career page