SRE, Site Reliability Engineering

Klaviyo
Dublin, IEPosted 24 February 2026

Job Description

<div class="content-intro"><p><em>At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the description, we hope you’ll still consider applying. Want to learn more about life at Klaviyo? Visit <a class="_ymio1r31 _ypr0glyw _zcxs1o36 _mizu194a _1ah3dkaa _ra3xnqa1 _128mdkaa _1cvmnqa1 _4davt94y _4bfu18uv _1hms8stv _ajmmnqa1 _vchhusvi _kqswh2mm _ect4ttxp _syaz13af _1a3b18uv _4fpr8stv _5goinqa1 _f8pj13af _9oik18uv _1bnxglyw _jf4cnqa1 _30l313af _1nrm18uv _c2waglyw _1iohnqa1 _9h8h12zz _10531ra0 _1ien1ra0 _n0fx1ra0 _1vhv17z1" href="http://klaviyo.com/careers" data-renderer-mark="true">klaviyo.com/careers</a> to see how we empower creators to own their own destiny.</em></p></div><h2><strong>Site Reliability Engineer II – Site Reliability Engineering (Dublin)</strong></h2> <h3><strong>Team Overview</strong></h3> <p>As a Site Reliability Engineer II, you will help ensure Klaviyo’s critical platforms are reliable, scalable, and sustainable while enabling rapid product development.</p> <p>We treat reliability as a core product feature and use software engineering to solve complex systems and operational challenges. Our work spans infrastructure, security, and software engineering, and focuses on building and operating systems that are reliable, secure, and performant at scale.</p> <p>The SRE team’s charter is to build and operate foundational services and infrastructure, reduce operational toil through automation, and continuously improve systems based on real production learnings. Your work will directly impact how Klaviyo engineers build software and how customers experience our platform every day.</p> <h3><strong>How you’ll make an impact</strong></h3> <p>As a Site Reliability Engineer II, you will contribute to the reliability and operational excellence of Klaviyo’s platforms by working on well-scoped projects and owning services with support from senior engineers. You will:</p> <ul> <li>Build, operate, and improve production systems with a focus on reliability, scalability, and performance</li> <li>Apply software engineering principles to automate operational tasks and reduce manual toil</li> <li>Contribute to the design and implementation of systems using established SRE best practices</li> <li>Help define and measure SLIs and SLOs for services you support</li> <li>Improve observability through metrics, dashboards, logging, and tracing</li> <li>Participate in on-call rotations and respond to production incidents with guidance and support</li> <li>Assist with incident investigation and contribute to post-incident reviews and follow-up actions</li> <li>Perform basic analysis around system behavior, capacity usage, and scaling characteristics</li> <li>Identify reliability issues or operational pain points and work with teammates to address them</li> <li>Collaborate with product, platform, and security engineers to ship reliable systems</li> <li>Write and maintain clear operational runbooks and system documentation</li> </ul> <h3><strong>Who you are</strong></h3> <p>You are an early-to-mid career SRE who is comfortable operating production systems and eager to deepen your expertise in reliability engineering.</p> <p>You:</p> <ul> <li>Have experience operating cloud-native production systems and services</li> <li>Write production-quality code (e.g. Python, Go, or similar) to automate operations and improve reliability</li> <li>Understand common failure modes in distributed systems, such as dependency failures, resource exhaustion, and partial outages</li> <li>Have experience working with containerized workloads and platforms (e.g. Kubernetes) in production environments</li> <li>Are comfortable participating in on-call rotations and diagnosing straightf ... (truncated, view full listing at source)