Engineering Manager, AI Cloud Platform
LambdaSan Francisco OfficePosted 11 September 2025
Tech Stack
Job Description
We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be.If you'd like to build the world's best deep learning cloud, join us. Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance.What you’ll doLead the AI Cloud Core Platform team of ~6 engineers, with end-to-end ownership of Cloud Platform and governance capabilities.Drive execution of roadmap features including cluster lifecycle automation.Partner closely with Product and Design to ensure the user experience matches the needs of enterprise customers.Balance rapid feature delivery with longer-term investments in scalability, observability, and platform design.Hire, mentor, and grow a team of engineers, providing career development and feedback.Collaborate with other Lambda teams (Control Plane, Billing, Platform) to ensure smooth, integrated delivery across the stack.Contribute to a culture of high performance, documentation, humility, and curiosity.Be product-focused in your leadership and execution, always placing the needs of the customer first, with a particular focus on feature velocity, reliability and security.Shape a culture of sustainable, empathetic, and high-velocity engineering, with a deep focus on cross-team collaboration, documentation, and data-driven decision-making.You5+ years in a full-time management role at a high-growth technology company10+ years of industry experience in software engineering, with a focus on large-scale distributed systems and backend systems.Proven record of leading and building engineering teams that work on mission-critical, high performance systems.Proven track record leading teams that deliver enterprise features or governance platforms.Exceptional leadership skills that encompass leading by trust, building empathy with your reports and other teams, and maintaining a sustainable but rapid velocity.Demonstrated expertise in managing long-term projects alongside urgent, short-term priorities and incident resolution.Extensive experience collaborating with product, sales, and other engineering teams to build cohesive products with a focus on user experience and reliability.Ability to understand, review and structure Python and Go applications.Nice to HaveExperience with IAM, authentication/authorization (SSO, RBAC, SCIM), governance tooling, or compliance features.Background building cloud application platforms.Experience managing a remote, distributed teamSalary Range InformationThe annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.About LambdaFounded in 2012, ~400 employees (2025) and growing fastWe offer generous cash & equity compensationOur investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitabilityOur research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOGHealth, dental, and vision coverage for you and your dependentsWellness ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
More jobs at Lambda
See all →Principal Product Manager – Networking
San Francisco Office · 28 February 2026
Senior Data Center Operations System Engineer - Los Angeles, CA
Vernon, CA - Data Center · 26 February 2026
Senior Site Reliability Engineer - Fleet Reliability
San Francisco Office · 24 February 2026
Senior Software Engineer - AI Cloud
San Francisco Office · 24 February 2026
More Python jobs
See all →[Summer 2026] People Science - PhD Intern
Roblox · San Mateo, CA, United States
Team Lead - Security Platform
Cloudflare · Distributed; Hybrid
Sr. Security Software Engineer, Applied Computing (Starshield)
SpaceX · Hawthorne, CA
Security Software Engineer, Applied Computing (Starshield)
SpaceX · Washington, DC