Production Operations Manager (Maternity Leave Replacement)
ApiiroTel Aviv-Yafo, Tel Aviv District, IsraelPosted 23 March 2026
Tech Stack
Job Description
What We’re About
Apiiro is a fast-growing startup at the forefront of application security. Our cutting-edge platform is designed to help development teams build secure software quickly, enabling them to move faster while managing risks. As pioneers in the AppSec space, we’re shaping the future of secure software development in the AI era. If you're looking for an exciting opportunity to make a significant impact and grow with a passionate team, Apiiro is the place to be.
What you’re about
Prior experience in this (or a closely related) Production/Release/Production Operations Manager role.
3+ years hands-on experience with release management, CI/CD and monitoring tooling plus practical experience with GCP and AWS.
Experience with GitOps deployment tools.
Experience with CI systems (GitHub Actions, Jenkins, GitLab CI) and release orchestration tooling.
Strong Linux, Docker and Kubernetes experience; familiarity with Infrastructure-as-Code (Terraform/CloudFormation) and helm/eks/gke style deployment patterns.
Hands-on experience operating Grafana dashboards and alerting and responding to GCP monitoring alerts.
Proven ability to run releases, create hotfixes and own release calendars and schedules.
Excellent interpersonal and stakeholder management skills - able to run meetings, influence engineers and product managers and drive cross-team decisions.
English - written and verbal: native level.
Comfortable working in a fast-paced startup environment and taking ownership of end-to-end processes.Experience with ArgoCD or Flux
Familiarity with Prometheus, Loki, Tempo or other components of the Grafana stack.
Experience with Alertmanager, PagerDuty/incident.io integrations, and status page tooling.
Experience writing automation tooling in Python, Go or Bash to reduce manual steps in release and incident workflows.
Background in SRE or production engineering, including SLA/SLI/SLO design and MTTR reduction.
What you will do
Release hotfix management - Create and coordinate manual releases and hotfixes; build and publish release versions; maintain a centralized release calendar and manage release risk, quality and schedule.
Daily operations syncs - Lead daily production sync meetings and act as the cross-team release/incident liaison across RD, QA and DevOps.
Incident ownership - Own the incident.io platform (day-to-day configuration, runbooks, training for RD, and status pages), run incident lifecycle activities, and drive post-mortems and RCAs for production bugs and outages.
Monitoring alerting - Manage and maintain Grafana dashboards and alerting rules; respond to and triage alerts from Grafana and GCP (Cloud) monitoring; work with teams to reduce false positives and improve signal-to-noise in alerts.
On-call incident triage - Participate in on-call rotations and act as escalation point for production issues; perform structured troubleshooting to mitigate issues and reduce MTTR.
Post-incident process - Lead post-mortems, own follow-up actions, and ensure lessons learned are integrated into process and tooling (e.g., smoke tests, cache clearing, alerting improvements).
Cross-functional reporting - Continuously monitor project health, produce operational reports on availability, performance and security, and communicate status to stakeholders (Slack, dashboards, support tooling).
Process automation - Drive improvements to CI/CD, release automation, and production runbooks; help orchestrate IaC and deployment tooling to reduce manual risk.
Training enablement - Train new RD and support members on incident platform usage, on-call processes and runbooks.
Technologies tools you’ll use
Grafana (dashboards alerts), incident.io (incident management training), GCP (Cloud Monitoring / Logging), AWS, Kubernetes, Docker, Terraform / IaC, Helm, GitHub, CI/CD systems (GitHub Actions/ArgoCD/Jenkins), Slack, Jira, and common scripting languages (Python/Bash). (Apiiro’s ProdOps team drives these systems in production and ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card
More jobs at Apiiro
See all →More Python jobs
See all →Quality Assurance Engineer
Graphcore · Gdańsk, Pomeranian Voivodeship, Poland
Senior Quality Assurance Engineer
Graphcore · Gdańsk, Pomeranian Voivodeship, Poland
Intern - Research
Graphcore · Bristol, UK; Cambridge, UK; London, UK
Data & AI Strategy Senior Manager
Accenture Federal Services · Washington, DC