Staff Software Engineer - ML Michelangelo

Uber
Sunnyvale, United StatesPosted 5 March 2026

Job Description

Staff Software Engineer - ML Michelangelo Department: Engineering Team: Machine Learning Location: Sunnyvale, United States Type: Full-Time **About the Role** Partners with stakeholders and leads team efforts to build and maintain Machine Learning backend services and solutions to support user-facing products, downstream services, or infrastructure tools and platforms used across Uber. What the Candidate Will Do ---- 1. Design and build tools to empower production teams to innovate and productionize state-of-the-art deep learning models at Uber. 2. Develop and maintain scalable, end-to-end deep learning training systems and frameworks. 3. Ensure distributed training tools are reliable, efficient, flexible to use for new production use cases. 4. Collaborate with cross-functional teams including machine learning engineers, backend engineers, data scientists, and data engineers to deliver robust ML solutions for Uber. \-\-\-\- Basic Qualifications ---- 1. Master in relevant fields (CS, EE, Math, Stats, etc.) AND 8+years full-time Software Engineering work experience in deep learning 2. Proficiency in Python and PyTorch 3. Expertise in designing, debugging, and optimizing distributed deep learning systems. 4. Working experience of distributed training in PyTorch at Scale (e.g., data parallelism, model parallelism). 5. Strong ability to translate complex DL requirements and problems into scalable solutions. \-\-\-\- Preferred Qualifications ---- 1. Expertise in distributed training frameworks such as DDP, DeepSpeed, FSDP, or TorchRec. 2. Familiarity with C++, Go or CUDA programming. 3. Expertise in optimizing GPU/TPU training performance and data loading efficiency. 4. Familiarity with large-scale distributed infrastructure tools like Ray, OpenAI Triton, PyTorch Lightning. 5. Built and deployed end-to-end machine learning systems in production. 6. Experience training large models (10B+ parameters), such as large recommendation systems or large language models (LLMs) 7. PhD in relevant fields (CS, EE, Math, Stats, etc.) For Sunnyvale, CA-based roles: The base salary range for this role is USD$232,000
Apply Now

Direct link to company career page

Share this job