Senior Software Engineer - Red Hat AI Inference Server (EMEA) - Q2 role
Red Hat8 LocationsPosted 27 March 2026
Tech Stack
Job Description
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model compression, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.
We are seeking an experienced Senior Software engineer to work closely with our technical and research teams on vLLM, llm-compressor, speculators, llm-d, create DevOps and CI/CD infrastructure, and scale our current technology stack. If you are someone who wants to contribute to solving challenging technical problems at the forefront of AI Inference, this is the role for you! You would be joining the core team behind 2025's most popular open source project on Github.
In this role, your primary responsibility will be to build and release the Red Hat AI Inference Server, continuously improve the processes and tooling used by the DevOps team, and find opportunities to automate procedures and tasks.
Join us in shaping the future of AI!
What you will do
Collaborate with research and product development teams to scale machine learning products for internal and external applications
Actively contribute to managing and releasing upstream and midstream product builds
Test to ensure correctness, responsiveness, and efficiency
Troubleshoot, debug and upgrade Dev & Test pipelines
Identifying and deploying cybersecurity measures by continuously performing vulnerability assessment and risk management
Collaborate with a cross-functional team about market requirements and best practices
Keep abreast of the latest technologies and standards in the field
What you will bring
2 years of experience in MLOps, DevOps, Automation and/or modern Software Deployment practices
Experience with Release Engineering
Experience evaluating LLMs for performance and accuracy (think HellaSwag, MMLU, Chatbot Arena, TruthfulQA, etc.)
Being super comfortable with Python and PyTest is a must
Strong experience with Git, Github Actions including self-hosted runners, BuildKite, Terraform, Jenkins, Ansible, and/or other common technologies for automation and monitoring
Experienced with administering Kubernetes/OpenShift and/or docker/podman
Experience with Cloud Computing using at least one of the following Cloud infrastructures: AWS, GCP, Azure, or IBM Cloud
Familiar with Agile development methodology
Following is considered a plus
Familiarity with contributing to the vLLM CI community is a big plus
Experience maintaining an infrastructure and ensuring stability
About Red Hat
Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40 countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.
Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welc ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card