Senior Site Reliability Engineer
ZuoraChennai, Tamil Nadu, IndiaPosted 19 March 2026
Job Description
Company Overview
At Zuora, we do Modern Business . We’re helping people subscribe to new ways of doing business that are better for people, companies and ultimately the planet. It’s an approach resulting from the shift to the Subscription Economy that puts customers first by building recurring relationships instead of one-time product sales and focuses on sustainable growth. Through our leading expertise and multi-product suite, we are transforming all industries and working with the world’s most innovative companies to monetize new business models, nurture subscriber relationships and optimize their digital experiences.
The Team Role
Zuora’s Cloud Engineering organization owns the reliability, scalability, and operational excellence of our global, customer-facing SaaS platforms. Operating across the US, India, Beijing, Costa Rica, and remote locations, we follow a follow-the-sun model to deliver 24x7x365 reliability for mission-critical systems. The team partners closely with Engineering, Security, Customer Support, Global Services, and Product to ensure customer trust, platform resilience, and operational efficiency.
We are seeking a Senior Site Reliability Engineer to play a technical leadership role in advancing Zuora’s reliability strategy with a strong focus on AI-driven automation and intelligent operations . This role goes beyond execution and requires ownership of complex systems, definition of new approaches, and influence across teams. The ideal candidate brings deep SRE expertise combined with an AI-centric mindset to design, build, and operationalize intelligent automation at scale.
Our Tech Stack: AWS, Microservices, Kafka, Kubernetes, Terraform, Jenkins, Puppet, Python, Linux
AI Automation Focus: AI-assisted operations, intelligent alerting, auto-remediation, predictive reliability, workflow automation
What you’ll do
Reliability Architecture Platform Strategy: Own and evolve the reliability architecture of large-scale, distributed SaaS systems by defining SLOs, SLIs, error budgets, and resilience patterns aligned with business objectives. Drive system-level improvements across services and regions, proactively identifying architectural risks, capacity constraints, and failure modes, while influencing platform and application design to improve long-term reliability and operability.
AI-Driven Automation Intelligent Operations: Design, build, and operationalize AI-powered automation to reduce operational toil and improve system stability. Apply AI and machine learning techniques to incident detection, anomaly identification, root cause analysis, auto-remediation, and capacity forecasting, enabling proactive and predictive reliability management at scale.
Advanced Cloud Infrastructure Engineering: Lead the design and operation of complex AWS-based infrastructure and Kubernetes platforms, optimizing for availability, security, and cost efficiency. Define advanced Infrastructure-as-Code patterns using Terraform and configuration management tools to support scalable, repeatable, and policy-driven environments across multiple stages and regions.
Incident Leadership Operational Excellence: Act as a technical leader during high-severity production incidents, driving structured response, decision-making, and recovery. Establish intelligent incident response mechanisms using automation, AI-assisted diagnostics, and enriched runbooks, while leading deep post-incident analysis focused on systemic improvements rather than short-term fixes.
Technical Leadership Cross-Functional Influence: Influence reliability outcomes beyond the SRE team by partnering closely with Engineering, Product, and Security stakeholders. Provide technical mentorship to senior and mid-level engineers, guide adoption of best practices, and contribute to the development of new reliability standards, tooling strategies, and operational policies across the organization.
Your experience
Required Qualifications
8+ years of hands-on ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card