Senior SRE Manager

ClickUp
United StatesPosted 14 August 2025

Job Description

<div class="content-intro"><div class="ql-block" data-block-id="block-ac776bb3-c45a-49bd-9ccd-6eee2eb78c97">ClickUp is revolutionizing the way the world works. As the only all-in-one productivity platform built from day one for true convergence, ClickUp unifies tasks, docs, chat, calendar, enterprise search, and more—supercharged by context-driven AI. While others scramble to bundle fragmented tools or bolt on AI, we anticipated this future and made it our foundation from the start. Headquartered in San Diego with a rapidly expanding global footprint, we empower over three million teams to break free from silos and reclaim their time—saving at least one day every week. Join ClickUp, one of the fastest-growing SaaS companies on the planet, and help millions of users transform the way they work. We’re not just building software. We’re shaping the future of work. Come join us in building the future—together. 🦄</div> <div class="ql-block" data-block-id="block-ac776bb3-c45a-49bd-9ccd-6eee2eb78c97"> </div></div><div class="ql-block" data-block-id="block-3e99015c-05e8-4b54-93ca-2f621694b069">We're seeking a Senior Manager of SRE to lead our U.S.-based Site Reliability and Release Engineering teams. This role is focused on strengthening our infrastructure cost efficiency, driving vendor optimization, and building operational excellence through high-impact incident management and team development.</div> <div class="ql-block" data-block-id="block-3e99015c-05e8-4b54-93ca-2f621694b069"> </div> <div class="ql-block" data-block-id="block-bb89787f-6b8d-45a2-b1f4-18c17ae851c3">You won’t be expected to be hands-on day-to-day, but you should be ready to step in during high-severity issues, lead incident response efforts, and coach others to do the same. This is a strategic leadership role with technical depth—perfect for someone with strong systems expertise who’s now focused on scaling operational maturity, optimizing spend, and leveling up their teams.</div> <div class="ql-block" data-block-id="block-bb89787f-6b8d-45a2-b1f4-18c17ae851c3"> </div> <div class="ql-block" data-block-id="block-5163ea32-5110-4a2b-8a01-50b2be0633a4"><strong><u>The Role</u></strong></div> <ul> <li>Lead SRE & Release Engineering teams focused on system reliability, release velocity, and incident response across critical services.</li> <li>Drive infrastructure cost optimization initiatives to ensure efficient AWS and tooling usage, reduce waste, and improve observability ROI (e.g., Datadog).</li> <li>Own vendor management for key platforms and tools; negotiate contracts, monitor value, and guide adoption best practices.</li> <li>Incident command during major production events, partnering with engineering leads to troubleshoot and mitigate effectively.</li> <li>Develop and grow a bench of capable incident commanders, instilling calm, rigor, and accountability across on-call and response workflows.</li> <li>Partner closely with Finance and Engineering to manage infrastructure budgets, forecast spend, and justify tooling investments.</li> <li>Establish performance and reliability metrics to monitor service health and team effectiveness.</li> <li>Identify team skill gaps, mentor SREs, and create pathways for continued growth and technical leadership.</li> <li>Collaborate cross-functionally with DevOps, Platform, and Product teams to align on reliability priorities—without needing to drive developer productivity directly.</li> </ul> <div class="ql-block" data-block-id="block-ea62be38-68dc-4ac7-849c-ec8184ca103f"><strong>& ... (truncated, view full listing at source)