Senior Site Reliability Engineer (Observability)

Iterable
Hybrid - Lisbon, PortugalPosted 21 January 2026

Tech Stack

Job Description

<div class="content-intro"><p>Iterable is the leading AI-powered customer engagement platform that helps leading brands like Redfin, SeatGeek, Priceline, Calm, and Box create dynamic, individualized experiences at scale. Our platform empowers organizations to activate customer data, design seamless cross-channel interactions, and optimize engagement—all with enterprise-grade security and compliance. Today, nearly 1,200 brands across 50+ countries rely on Iterable to drive growth, deepen customer relationships, and deliver joyful customer experiences.</p> <p>Our success is powered by extraordinary people who bring our core values—Trust, Growth Mindset, Balance, and Humility—to life. We foster a culture of innovation, collaboration, and inclusion, where ideas are valued and individuals are empowered to do their best work. That’s why we’ve been recognized as one of <a href="https://www.inc.com/best-workplaces/2022">Inc’s Best Workplaces</a> and <a href="https://iterable.com/blog/inc-names-iterable-one-of-americas-fastest-growing-companies/">Fastest Growing Companies</a>, and were recognized on Forbes’ list of America’s Best Startup Employers in 2022. Notably, Iterable has also been listed on<a href="https://blog.wealthfront.com/announcing-2021-career-launching-companies/"> Wealthfront’s Career Launching Companies List</a> and has held a top 10 ranking on the<a href="https://wearegirlsclub.com/top-25-companies-where-women-want-to-work/"> Top 25 Companies Where Women Want to Work</a>.</p> <p>With a global presence—including offices in San Francisco, New York, Denver, London, and Lisbon, plus remote employees worldwide—we are committed to building a diverse and inclusive workplace. We welcome candidates from all backgrounds and encourage you to apply. Learn more about our story and mission on our<a href="https://iterable.com/culture/"> Culture</a> and<a href="https://iterable.com/company/"> About Us</a> pages. Let’s shape the future of customer engagement together!</p></div><p><strong>How you will make an impact:</strong></p> <p>As a Senior Engineer on the Observability Team, your impact is measured by the clarity and reliability with which our engineers can see into their systems. You don't just provide a suite of tools; you serve as a <strong>strategic observability partner</strong> for the entire engineering organization.</p> <ul> <li><strong>Strategic Observability Partnership:</strong> You will collaborate deeply with product teams to ensure the frameworks we provide actually solve their problems. Your success is measured by how well teams can diagnose their own services, not just by the uptime of our clusters. You will act as a consultant to help teams define meaningful Observability that reflect the true customer experience.</li> <li><strong>Set the observability vision</strong> – Own the long-term roadmap for Datadog, Grafana, Prometheus, Elasticsearch, Quickwit, and emerging OpenTelemetry tooling. Define SLIs/SLOs that align platform health with customer experience.</li> <li><strong>Lead large-scale implementations</strong> - Design and automate scalable pipelines (metrics, traces, logs, events) so every engineer has sub-second, queryable visibility into production.</li> <li><strong>Harden our platform</strong> - Drive upgrades, capacity modeling, and policy enforcement for our dedicated observability-focused clusters; introduce best-in-class patterns for multi-tenant isolation and cost optimization.</li> <li><strong>Ship platform enhancements</strong> – Contribute production-quality Go or Python services, operators, and Terraform modules that elevate reliability, perf ... (truncated, view full listing at source)