Software Engineer, Infrastructure Platform

Docker
CanadaPosted 25 March 2026

Job Description

Software Engineer, Infrastructure Platform At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride! Our Infrastructure Engineering team builds and operates the cloud-native platform that powers Docker’s suite of products. We design resilient services, automate where it helps most, and measure what matters so hundreds of engineers can ship safely to millions of users every day. A core focus is self-service. We build paved-road platform capabilities that let internal teams provision, deploy, observe, and operate services with minimal friction and strong guardrails. We treat the platform as a product with clear contracts, well-defined defaults, and great documentation. Success is measured by adoption and fewer support requests. HOW WE WORK - Write it down, ship it, iterate: RFCs and design docs, code review, and small safe releases. - Sustainable reliability: we prioritize root-cause fixes, good alerts, and automation over heroics. - Cross-functional by default: we partner closely with product and security teams. - AI-accelerated execution: we build agentic workflows to reduce toil and improve incident response, with guardrails, auditability, and human review. WHAT YOU’LL WORK ON - Reducing toil through automation, including AI-assisted and agentic operational workflows. - Building self-service onboarding and deployment workflows that reduce tickets and speed delivery. - Scaling Kubernetes foundations and evolving our traffic and ingress stack. RESPONSIBILITIES 1) Self-Service Platform Services - Build and operate internal platform services and APIs in Go, including provisioning, quotas and policies, cost insights, and platform workflows. - Deliver golden paths for self-serve onboarding and day-2 operations, including access, deployment setup, observability defaults, and governance guardrails. - Partner with teams to drive adoption through clear docs, examples, and measurable outcomes. 2) Infrastructure as Code and Reliability - Codify infrastructure with Terraform and GitOps practices, and contribute to platform tooling in Go. - Define and improve SLOs, alerting, and operational readiness. Participate in incident response and preventive follow-ups. - Help standardize safe delivery patterns, including testing gates, canaries, and rollback triggers, so deployments are routine and low-risk. 3) Kubernetes and Networking Foundations - Operate and scale multi-tenant EKS clusters and traffic and ingress systems to deliver secure, reliable routing. - Evaluate and adopt improvements with a bias toward incremental rollout and measurable impact. 4) AI and Agentic Workflows for Reliability - Build and iterate on agentic workflows that reduce operational toil, including triage support, context gathering, safe runbook execution, and remediation suggestions. - Integrate automation into delivery and operations in a way that is safe, observable, and auditable. 5) On-Call and Incident Response Operational ownership is part of this role. - You’ll join an on-call rotation after onboarding and shadowing, and participate in incident response during your shifts. - We aim for sustainable on-call through good alerting, automation, and blameless postmortems focused on prevention. QUALIFICATIONS Core Engineering Skills (must-have) - 4+ years of backend software engineering experience building large-scale cloud or distributed systems - Strong software development skills in Go or a similar language, including design, testing, debugging, and code review. - Experience shipping and operating cloud services in production, often 3+ years. We hire for ... (truncated, view full listing at source)
Apply Now

Direct link to company career page

AI Resume Fit Check

See exactly which skills you match and which are missing before you apply. Free, instant, no spam.

Check my resume fit

Free · No credit card

Share