Staff Site Reliability Engineer
GetYourGuideBerlinPosted 28 April 2026
Job Description
Change the way the world travels
Be part of the GetYourGuide journey and connect people with unforgettable travel experiences worldwide. Since 2009, millions of travelers have booked unique activities with us in over 12,000 cities. Our headquarters in Berlin is supported by local offices across the globe, from New York to Bangkok.
Ready to join a diverse community of over 850 fellow explorers dedicated to revolutionizing the travel experience industry? Check out getyourguide.careers to learn more.
Team mission
Incidents interrupt operations, drain team productivity, and erode user trust. As a member of the Operational Excellence team, you will help GetYourGuide move toward a world of fewer interruptions and higher user trust — by preventing incidents before they happen and enabling teams to resolve them faster when they do.
As we push boldly into AI-powered experiences, we don't ignore the risks that increased output velocity creates. You will be a key part of ensuring our engineering organization moves fast with confidence, so our customers continue to have great experiences every time.
Beyond reliability, you will drive observability and cost efficiency — building the tooling, culture, and practices that make operational excellence a shared standard across all product teams.
Your mission
You will act as an "engineer for the engineers" — partnering with product teams to raise the bar on reliability, speed, and confidence in their systems.
Incident management reliability
Drive down incident frequency, MTTD and MTTR
Lead post-incident reviews and translate learnings into systemic improvements
Build tooling and runbooks that enable teams to diagnose and resolve production issues faster
Champion a culture of blameless incident handling and continuous improvement
Participate in the infrastructure on-call rotation
Observability production confidence
Advance our Datadog-based observability practice — metrics, logs, traces, dashboards, and alerting
Ensure teams have meaningful SLOs and actionable alerts — not alert fatigue
Enable production debugging capabilities so engineers can triage issues without needing a specialist
Change confidence release quality
Improve change failure rate by helping teams invest in the right automated test coverage and pre-production validation
Reduce the cost and risk of deployments through better tooling, feature flagging, and progressive rollout practices
Platform enablement
Design and maintain paved paths - well - documented golden paths for development, observability, testing, and incident response so product teams can do the right things by default
Work hands-on with product teams using Java and React to help them improve system design, testability, and operational hygiene
Leverage Kubernetes, AWS, and Istio expertise to guide teams on infrastructure best practices
Identify cost optimization opportunities and drive efficiency improvements across services
Leverage AI tooling to accelerate incident response, improve developer workflows, and scale operational practices
Your toolkit
Deep understanding of observability tooling — we use Datadog (metrics, APM, logs, dashboards)
Proven experience reducing MTTD, MTTR, and change failure rate; DORA metrics are not just acronyms to you
Strong coding skills in Java; comfortable reading and contributing in Go across infrastructure contexts; enough frontend context to collaborate with React / Vue teams
Experience with Kubernetes, AWS, and service mesh technologies (Istio/Envoy)
Solid understanding of distributed systems, networking, and container technology
Hands-on experience with CI/CD, automated testing strategies, and build systems
Ability to influence engineers and teams without direct authority — you raise standards by coaching, not dictating
Excellent written and verbal communication skills in English
Positive, proactive team player who is passionate about operational excellence and helps others deliver
Extras that gi ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card