Job Description
Senior Production Engineer
Hybrid, Bucharest, Romania
JOB OVERVIEW
Are you passionate about automation, cloud infrastructure, Kubernetes, and reliability engineering? As a Senior Production Engineer (SRE) at Legion, you will build and operate a secure, highly scalable, and cost-effective AWS/Kubernetes-based cloud platform. You will work across infrastructure automation, CI/CD pipelines, observability, and production reliability. Simply put, the SRE team ensures Legion’s platform is reliable, scalable, and continuously improving for our customers.
This role includes participation in an on-call rotation.
RESPONSIBILITIES AND DUTIES
Support and operate Legion’s AWS-based cloud platform and Kubernetes (EKS) environments.
Leverage GenAI tools (e.g., Claude Code, Codex, or similar) to accelerate infrastructure development, automation, and auto-remediation of common production issues.
Build and maintain infrastructure-as-code using Terraform.
Develop automation and internal tooling using Go or Python.
Improve CI/CD pipelines to increase deployment safety and velocity.
Define and improve monitoring, alerting, and observability systems.
Respond to production incidents, conduct root cause analysis, and implement systemic improvements.
Develop and automate operational runbooks and remediation workflows.
Support production deployments, including during off-hours as needed.
REQUIRED SKILLS AND QUALIFICATIONS
5-8+ years of experience in SRE, DevOps, or SaaS production operations.
3+ years of hands-on experience operating production workloads in AWS.
5+ years of experience with observability tools such as Datadog, CloudWatch, ELK stack, Prometheus, or similar.
3+ years experience with Terraform and infrastructure-as-code practices, including managing complex multi-region deployments with module based configurations.
3+ years of experience with containerized environments using Docker and Kubernetes (EKS preferred); familiarity with Helm.
Proficiency in Go or Python (or similar programming language).
Experience building and maintaining CI/CD systems (Git-based workflows, Argo, Jenkins or similar).
Strong Linux/Unix systems experience.
Bachelor’s degree in Computer Science or equivalent practical experience.
PREFERRED QUALIFICATIONS AND ATTRIBUTES
Experience managing AWS RDS and/or Aurora MySQL including slow query analysis, replication, and upgrade operations.
Experience implementing SLIs/SLOs and reliability best practices.
Experience working effectively with remote, distributed teams.
Experience with supporting SOC 2 / ISO 27001 audits.
AWS certification preferred.
ABOUT LEGION
Join Legion's mission to turn hourly jobs into good jobs. We're a mission-driven team seeking exceptional talent to propel this vision. Embrace a culture that's collaborative, fast-paced, and entrepreneurial. With us, you'll grow your skills, work closely with experienced executives, and contribute significantly to our mission. Our award-winning AI-native workforce management platform is intelligent, automated, and employee-centric and proven to deliver 13x ROI. We help labor-intensive organizations maximize labor efficiency and employee engagement simultaneously. Legion has earned recognition for its innovation, including spots on the Inc. 5000 list, Forbes’ Next Billion Dollar Startups, and awards for our AI technology. Backed by leading investors such as Norwest Venture Partners, Stripes, First Round Capital, XYZ Ventures, Webb Investment Network, Workday Ventures, and NTT DOCOMO Ventures, we're making real change. If you're ready to make an impact and grow your career, Legion is where you belong. Join us in making hourly work rewarding and fulfilling.
BACKGROUND AND OPPORTUNITY
There are almost 75 million hourly workers in the United States, representing more than half of the entire workforce. Historically, managing hourly employees has been difficult due to high attrition (average of 60%) and high replacement costs (average of $ ... (truncated, view full listing at source)