Site Reliability Engineer II

Coalition
Any location, CanadaUp to $160kPosted 27 February 2026

Job Description

<div class="content-intro"><h4><span style="color: rgb(0, 0, 0);">About us</span></h4> <p>Coalition is the world's first Active Insurance provider designed to help prevent digital risk before it strikes. Founded in 2017, Coalition combines comprehensive insurance coverage and innovative cybersecurity tools to help businesses manage and mitigate potential cyberattacks. </p> <p>Opportunities to make an impact with bold thinking are real—and happening daily at Coalition.</p></div><h4><span style="color: rgb(0, 0, 0);">About the role</span></h4> <div> <p>We are looking for a <strong>Site Reliability Engineer</strong> to join our Platform SRE team. In this role, you will build and operate the infrastructure, tools, and "paved roads" that empower our developers to deliver scalable, secure, and reliable software with speed and confidence.</p> You’ll work across the entire stack—from infrastructure automation and observability to developer enablement and system reliability. You will be a key collaborator with software engineering and security teams, helping to evolve our Infrastructure as Code (IaC), enhance CI/CD pipelines, and scale our internal developer platform. We value pragmatism and engineering excellence, primarily using Python, Go, and AWS to reduce toil and build self-service capabilities.</div> <h4><span style="color: rgb(0, 0, 0);">Responsibilities</span></h4> <ul> <li><strong>Infrastructure Automation:</strong> Design, build, and scale production environments using AWS and Terraform.</li> <li><strong>System Reliability:</strong> Improve the resilience and operability of our platform through failure-based testing and automated recovery strategies.</li> <li><strong>Developer Enablement:</strong> Design and implement reusable platform components and self-service tools to streamline the developer experience.</li> <li><strong>Observability:</strong> Implement and maintain robust observability practices, including system metrics, distributed tracing, and SLO management.</li> <li><strong>Mentorship Standards:</strong> Guide junior engineers, uphold high infrastructure quality, and contribute to the team’s evolving best practices.</li> <li><strong>Collaboration:</strong> Participate in technical design discussions, sharing feedback and adapting strategies based on team input and evolving requirements.</li> </ul> <h4><span style="color: rgb(0, 0, 0);">Skills and Qualifications</span></h4> <ul> <li><strong>Experience:</strong> 4+ years in SRE, DevOps, Cloud Engineering, or Software Development roles.</li> <li><strong>Cloud Proficiency:</strong> Hands-on experience operating and scaling production environments within <strong>AWS</strong>.</li> <li><strong>Infrastructure as Code:</strong> Strong expertise with <strong>Terraform</strong> for managing complex cloud infrastructure.</li> <li><strong>Programming:</strong> Proficiency in <strong>Go or Python</strong>, with experience building production-grade automation, tooling, or libraries.</li> <li><strong>Containers Orchestration:</strong> Experience with <strong>ECS or Kubernetes</strong>.</li> <li><strong>CI/CD:</strong> Familiarity with modern deployment tools, specifically <strong>GitHub Actions</strong>.</li> <li><strong>Communication:</strong> Strong written and verbal skills with a knack for evangelizing reliability best practices across the organization.</li> </ul> <h4><span style="color: rgb(0, 0, 0);">Bonus Points </span></h4> <ul> <li>Experience troubleshooting complex <strong>distributed systems</strong> in a high-traffic production environment.</li> <li>Exposure to event streaming systems such as <strong>Kafka or Kinesis</strong>.</li> <li>Experience contributing to <strong>Internal Developer Platforms (IDP)</strong> or automating self-service infrastructure workflows.</li> <li>Familiarity with <strong>systems security</strong>, compliance requirements, or infrastructure hardening.</li> </ul> <h4>Compensation</h4> <p>As a remote-first organization, our compensation refle ... (truncated, view full listing at source)