Lead Site Reliability Engineer
MastercardPune, IndiaPosted 7 April 2026
Tech Stack
Job Description
Our Purpose
Mastercard powers economies and empowers people in 200 countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Lead Site Reliability Engineer
Role Overview
We are seeking a highly technical Lead Site Reliability Engineer (SRE) to architect, engineer, and operate highly reliable, scalable, and secure platforms across multi-cloud (AWS, Azure) and hybrid (on-prem + cloud) environments.
This is a deeply hands-on engineering role requiring expertise in distributed systems, Kubernetes, hybrid networking, automation, CI/CD, observability, and production incident leadership. The Lead SRE will serve as the technical authority for reliability across interconnected cloud and datacenter ecosystems.
Core Responsibilities
1. Reliability Engineering Across Hybrid & Multi-Cloud
• Define and implement SLIs, SLOs, and error budgets across cloud-native and on-prem workloads.
• Architect high-availability designs spanning:
o AWS and Azure regions
o On-prem datacenters
o Cross-cloud failover patterns
• Design DR strategies (RTO/RPO driven) across hybrid environments.
• Eliminate single points of failure across network, compute, storage, and DNS layers.
• Conduct resilience validation, chaos testing, and failure scenario modeling.
2. Multi-Cloud Architecture & Engineering
• Engineer and operate workloads across:
o Amazon Web Services
o Microsoft Azure
• Design cross-cloud networking (VPN, ExpressRoute, Direct Connect, Transit Gateway).
• Implement workload portability and cloud-agnostic deployment strategies.
• Optimize cost, performance, and reliability across providers.
• Design cloud-native autoscaling, load balancing, and traffic routing strategies.
3. Hybrid Infrastructure (On-Prem + Cloud Integration)
• Integrate on-prem infrastructure with cloud platforms using:
o Active Directory / IAM federation
o Hybrid DNS architecture
o Secure certificate lifecycle management
• Troubleshoot hybrid connectivity issues (BGP routing, firewall policies, NAT, MTU mismatches).
• Manage hybrid Kubernetes deployments and private registry integrations.
• Support legacy-to-cloud modernization initiatives.
4. Kubernetes & Container Platform Engineering
• Architect and operate:
o Amazon EKS
o Azure Kubernetes Service
o Self-managed Kubernetes clusters (on-prem)
• Optimize cluster autoscaling, resource allocation, and performance.
• Implement cluster security hardening and RBAC governance.
• Troubleshoot CNI, ingress controllers, service mesh, and pod networking issues.
• Implement GitOps-driven deployments.
5. Observability Engineering Across Distributed Systems
• Build unified observability across hybrid environments using:
o Splunk
o Dynatrace
o Prometheus
o Grafana
o OpenTelemetry
• Implement centralized logging across cloud and on-prem workloads.
• Design distributed tracing across multi-cloud microservices.
• Engineer proactive alerting to reduce MTTR and improve signal quality.
6. CI/CD & Infrastructure Automation
• Engineer resilient CI/CD pipelines (Jenkins, GitHub Actions, Azure DevOps).
• Implement cross-cloud infrastructure as code using:
o Terraform
o CloudFormation
• Automate:
o Certificate rotation
o Auto-scaling policies
o Patch orchestration
o Drift detection
• Improve deployment reliability via blue-green and canary strategies.
7. Advanced Production Troubleshooting
• Lead technical investigation of:
o DNS resolution failures (private/public zones, hybrid forwarding)
o TLS/PKI certificate failures
o Network latency across hybrid circuits
o Memory leaks & kernel-level issues
o Thread contention & CPU throttling
• Perform packet-lev ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card