Software Development Engineer - (SRE) - II
AdobeNoidaPosted 1 March 2026
Job Description
Job Description – Site Reliability Engineer (P20) Our Company Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences. We’re passionate about empowering people to craft beautiful and powerful images, videos, and apps, and to transform how companies interact with customers across every screen. We’re on a mission to hire the very best and are committed to building exceptional employee experiences where everyone is respected and has equal opportunity. We know that new ideas can come from everywhere in the organization, and the next big idea could be yours! Platform Reliability & System Engineering Design, build, and operate reliable, scalable, and highly available distributed systems for the Adobe Pass platform. Implement reliability best practices to ensure high availability, fault tolerance, and performance across multi-region cloud environments. Identify reliability risks, troubleshoot system bottlenecks, and implement mitigation and remediation strategies. Partner with software engineering and infrastructure teams to improve system resilience, scalability, and operational efficiency. Automation & Observability Develop and maintain automation for deployment, monitoring, recovery, and operational workflows to reduce manual intervention. Build and enhance observability solutions using metrics, logs, and distributed tracing to ensure production visibility. Improve monitoring coverage, alert quality, and operational readiness across services. Contribute to automation frameworks that improve operational efficiency and reliability. Incident Management & Operational Excellence Participate in on-call rotations and actively troubleshoot and resolve production incidents. Perform root cause analysis (RCA) and implement corrective and preventive actions. Contribute to incident response, service restoration, and continuous reliability improvements. Support production readiness reviews and ensure operational best practices are followed. Performance & Scalability Analyse system performance, identify bottlenecks, and implement tuning and optimisation improvements. Assist in capacity planning and scaling efforts to support growing platform demand. Optimise infrastructure utilisation while maintaining service reliability and performance. Collaboration & Engineering Excellence Collaborate with engineering, infrastructure, and operations teams to improve platform reliability and operational maturity. Contribute to reliability best practices, automation standards, and operational processes. Participate in system design discussions with a focus on scalability, reliability, and operational supportability. Share knowledge and contribute to improving overall team capabilities. Qualifications Bachelor's in computer science, Engineering, or related field. 4–7 years of experience in Site Reliability Engineering, Production Engineering, or distributed systems operations. Strong experience operating cloud-native systems in AWS, Azure, or GCP environments. Proficiency in at least one programming or scripting language such as Python, Go, Java, or Bash. Experience with Kubernetes, containers, and microservices-based architectures. Experience with Infrastructure as Code tools such as Terraform or CloudFormation. Familiarity with monitoring and observability tools such as Prometheus, Grafana, Datadog, or OpenTelemetry. Understanding of distributed systems, networking, storage, and databases. Strong troubleshooting, debugging, and problem-solving skills. Preferred Qualifications Experience supporting high-availability, large-scale production systems. Familiarity with CI/CD pipelines and automation tooling. Experience working with distributed data systems such as Kafka or similar technologies. Knowledge of reliability engineering principles such as SLOs, SLIs, and incident management. Cloud or Kubernetes certificatio ... (truncated, view full listing at source)
Apply Now
Direct link to company career page