AI Inference Engineer - Model Optimization & Deployment
ZooxFoster City, CAPosted 11 April 2026
Tech Stack
Job Description
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.
As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
Apply Now
Direct link to company career page
AI Resume Fit Check
See exactly which skills you match and which are missing before you apply. Free, instant, no spam.
Check my resume fitFree · No credit card
More jobs at Zoox
See all →PhD Research Intern, Offline Driving Intelligence
Foster City, CA · 19 April 2026
Engineering Manager, Compute Optimization
Foster City, CA · 18 April 2026
Part Time Student Worker – Performance Optimization & Agentic Systems
Foster City, CA · 18 April 2026
PhD Research Intern, Physical AI in Perception
Foster City, CA · 18 April 2026