Senior Site Reliability Developer (SRE)
AutodeskToronto, ON, CANPosted 7 April 2026
Tech Stack
Job Description
Job Requisition ID #
26WD94664
Position Overview
We are seeking a highly motivated and experienced Senior Site Reliability Developer (SRE) to manage critical cloud infrastructure and site reliability operations for the Autodesk Platform Services and Emerging Technologies organization . The team delivers high-value, exabyte-scale and cloud data platform components powering desktop, mobile, and web products. This enables our product teams to build cohesive in-product data experiences, our partners to integrate and expand our data, and our end-users to work with their data across all Autodesk products.
This pivotal role focuses on ensuring the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.
Responsibilities
Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture
Independently manage requirement analysis, solution design, implementation, and release planning
Ensure strict adherence to security, trust, compliance guidelines, and standards
Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security
Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices
Implement and maintain configuration management and infrastructure as code (IaC) using Terraform
Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and periodic maintenance activities
Contribute to remediation of critical vulnerabilities (CVEs)
Promote and document security and best practices across all pillars of DevOps/SRE throughout system design
Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues
Participate in on-call rotations, providing critical 24x7 support for production systems
Minimum Qualifications
Bachelor’s degree or higher in Computer Science, Engineering, or a related field
5 years of progressive experience in Site Reliability Engineering, DevOps, or a similar field
Proficiency with managing AWS resources and understanding of networking and security protocols
Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation
Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory
Experience with container-based technologies like Docker, Kubernetes and AWS ECS
Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch
Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment
Strong experience with UNIX/Linux systems and programming languages such as Python, Go, Bash, Groovy, and Node.js
Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, Flink, Jenkins, GitHub, Jira, Google Apigee, ServiceNow, and Splunk
Preferred Qualifications
Knowledge of applying AI and ML solutions for engineering processes and/or DevOps automation
Knowledge of standardized observability frameworks such as OpenTelemetry
Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)
Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures
Broad knowledge of data streaming pipelines like Kinesis, Firehose, and Kafka
Knowledge on core Java and SpringBoot concepts in JVM optimization
Knowledge on build tools, e.g. Gradle
Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment
Self ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card
More jobs at Autodesk
See all →More Node jobs
See all →Developer Relations Engineer, Tools
Tenstorrent · Austin, Texas, United States; Fort Collins, Colorado, United States; Portland, Oregon, United States; Santa Clara, California, United States; Toronto, Ontario, Canada; United States
Performance Architect, AI HW
Tenstorrent · Toronto, Ontario, Canada
Power Architect, AI Data Center Chiplets
Tenstorrent · United States
SOC Emulation Engineer - Hardware Emulation Infrastructure
Tenstorrent · Austin, Texas, United States; Santa Clara, California, United States