Team Lead, Site Reliability Engineering
Veeam SoftwareBucharest, RomaniaPosted 21 March 2026
Job Description
Veeam is the Data and AI Trust Company, specializing in helping organizations ensure their data and AI are fully understood, secured, and resilient to enable the acceleration of safe AI at scale. As the market leader in both data resilience and data security posture management, Veeam is built for the convergence of identity, data, security, and AI risk. Headquartered in Seattle with offices in more than 30 countries, Veeam protects over 550,000 customers worldwide, who trust Veeam to keep their businesses running. Join us as we go fearlessly forward together, growing, learning, and making a real impact for some of the world’s biggest brands.
About the Role
Veeam is expanding its Site Reliability Engineering (SRE) organization to support Veaam services. As an
SRE Team Leader , you will build and lead a high-performing team that partners with product, platform, and security engineering to make our systems reliable, scalable, and observable from the ground up. You’ll collaborate with peer engineering leaders to embed reliability into service roadmaps.
You’ll drive adoption of SRE principles (SLIs/SLOs/error budgets) and operate a healthy, daytime follow-the-sun on-call model in partnership with other regions. You will lead your team to make improvements in the overall operability, reliability, resilience, and security of the services we support.
What You’ll Do
People Team Leadership
Hire, onboard, and develop your SRE team
Encourage culture that prioritizes learning and engineering over fault-finding and firefighting
Ensure a sustainable operational coverage; monitor on-call health and workload
Reliability Strategy Governance
Establish and operationalize SLIs/SLOs and error budgets with service owners
Run reliability reviews and hold teams accountable to outcomes
Define reliability standards, runbooks, readiness checklists, and alerting patterns (including SLO-based alerting)
Operations Incident Excellence
Ensure incident response readiness
Lead and coordinate major incidents
Measure MTTR, change failure rate, SLO posture, and repeat-incident reduction
Engineering Automation
Lead software-first reliability investments: observability, resilience testing/chaos, and self-service guardrails
Drive platform improvements and internal tools
What You’ll Bring
3+ years in managing Software, Platform, and/or Reliability Engineering
Experience in IT Platform Engineering or Software Development
Demonstrable experience leading engineering teams to predictably deliver outcomes
Demonstrated success leading SLO/error-budget adoption and reliability programs for services
Experience leading cross-functional initiatives collaboratively with peers through influence
Experience with public clouds, Kubernetes, IaC, CI/CD, and observability
Hands-on incident management and postmortem practice
Readiness to participate in an on-call rotation (typically during daytime hours, including weekends/holidays)
Bonus Skills
Experience operating a multi-region follow-the-sun on-call model
Background in chaos/resilience/performance testing
Experience in building or scaling SRE teams and influencing org-wide standards
Coding background with experience improving service reliability
What You’ll Get
21 annual vacation days, additional days based on tenure, plus
4 extra global VeeaMe Days for self-care and 24 paid volunteer hours annually through Veeam Cares
Private health, dental, and vision insurance for employees and dependents, including outpatient care, hospitalization, pregnancy monitoring, and psychology support
Monthly lifestyle and daily meal benefits: 40 RON/day via Edenred and 600 RON/month through a flexible cafeteria platform
Life insurance (2× annual gross salary), critical illness, and disability coverage, plus vision reimbursement
Free access to Bookster library platform for borrowing your favorite books for free
Opportunities to learn and grow through on-demand libraries (LinkedIn Learning, O’Reilly), mentoring, w ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card