Senior/Staff Deep Reinforcement Learning Engineer
DoorDashSan Francisco, CAPosted 26 March 2026
Job Description
About the Team
Our DD Labs team builds real-time autonomous delivery systems. The Planning Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical planning, learning policies that generalize across novel driving scenarios, handle long-tail edge cases, and improve continuously from large-scale fleet data. Our models jointly handle prediction and planning in a single unified architecture. Our stack is pure JAX end-to-end: the same code you train with is the code that runs on the robot. No C++ rewrites, no TensorRT export. A new policy goes from training to on-vehicle deployment in minutes.
About the Role
As a Senior/Staff Deep RL Engineer, you will design, train, and deploy deep reinforcement learning policies that make real-time driving decisions for our autonomous vehicles. You will own the full lifecycle, from problem formulation and reward design through large-scale distributed training to on-vehicle inference. You'll help define how learned components compose with the rest of the autonomy stack to produce robust, shippable behavior.
You’re excited about this opportunity because you will…
Formulate complex driving tasks as RL problems with well-shaped reward functions and expressive state/action representations.
Design and train model-based deep RL agents using GPU-accelerated simulation at massive scale, including improving the simulator itself.
Build and maintain distributed training infrastructure in JAX across large compute clusters.
Build agentic optimization systems that automatically improve code, run experiments, analyze metrics, and iterate on RL policies with minimal human intervention.
We’re excited about you because…
BS/MS/PhD in CS, EE, Robotics, or a related field, with a strong foundation in reinforcement learning and deep learning.
Hands-on experience training RL agents at scale, ideally in robotics, autonomous driving, or other real-time decision-making domains.
Proficiency in JAX or a similar functional ML framework; comfort with JIT compilation, vectorized environments, and GPU-accelerated simulation.
Deep grasp of core RL concepts: policy gradients, value functions, exploration-exploitation, model-based RL, reward shaping, and sim-to-real transfer.
Data-driven mindset: comfortable building experiment pipelines, analyzing training runs, and letting metrics guide architectural decisions.
Nice to Have
Publications at top venues (NeurIPS, ICML, ICLR, CoRL, RSS, ICRA) on RL or learned planning.
Experience building or working with GPU-accelerated simulators for RL training.
Track record of shipping a learned component in a production robotics or autonomous vehicle stack.
Compensation
The successful candidate’s starting pay will fall within the pay range listed below and is determined based on job-related factors including, but not limited to, skills, experience, qualifications, work location, and market conditions. Base salary is localized according to an employee’s work location. Ranges are market-dependent and may be modified in the future.
In addition to base salary, the compensation for this role includes opportunities for equity grants. Talk to your recruiter for more information.
DoorDash cares about you and your overall well-being. That’s why we offer a comprehensive benefits package to all regular employees, which includes a 401(k) plan with employer matching, 16 weeks of paid parental leave, wellness benefits, commuter benefits match, paid time off and paid sick leave in compliance with applicable laws (e.g. Colorado Healthy Families and Workplaces Act). DoorDash also offers medical, dental, and vision benefits, 11 paid holidays, disability and basic life insurance, family-forming assistance, and a mental health program, among others.
To learn more about our benefits, visit our careers page here .
See below for paid time off details:
For salaried roles: flexible paid time off/vacation, plus 80 hours of paid sick time per year.
For h ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card
More jobs at DoorDash
See all →Senior Staff Software Engineer - Delivery Experience
San Francisco, CA; Sunnyvale, CA; Los Angeles, CA; New York, NY · 26 March 2026
Growth Merchant Lead - Emerging Markets (Oregon)
Salem, OR; Portland, OR · 26 March 2026
Account Manager, Restaurants
Chicago, IL · 26 March 2026
Associate Manager, Retailer Account Management
New York, NY · 26 March 2026