Senior Software Engineer, Launch & Sandboxes - Weights & Biases

Weights and Biases
Livingston, NJ / New York, NY / San Francisco, CA / Sunnyvale, CA / Bellevue, WA / Remote, US$165k – $242kPosted 26 March 2026

Job Description

CoreWeave, the AI Hyperscaler™, acquired Weights Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave’s industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights Biases, we’re setting a new standard for how AI is built, trained, and scaled. The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we’re combining forces to serve the full AI lifecycle — all in one seamless platform. Weights Biases has long been trusted by over 1,500 organizations — including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve — to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises. As we unite under one vision, we’re looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team. What You'll Do You'll be joining the Launch Sandboxes team within the ML Workflows organization. Our team owns two interconnected product areas: Sandboxes, WB's cloud execution environments that let ML practitioners run code in isolated, GPU-enabled containers directly from the WB platform, and Launch, which handles job launching, compute orchestration, and integration with HPC schedulers. We build the infrastructure that connects WB's experiment tracking and model development tools to the actual compute where ML work happens, spanning Kubernetes clusters, HPC nodes, and cloud VMs. About the role As a Senior Engineer on the Launch Sandboxes team, you'll work across the full stack of our execution infrastructure, from the backend services that manage sandbox orchestration and billing to the Python SDK that ML practitioners interact with daily. Your day-to-day will involve building and scaling our container orchestration layer, developing SDK integrations that make sandboxes a seamless part of the WB workflow, and extending our compute integrations for HPC customers. You'll work on real distributed systems problems: container lifecycle management, federated authentication, usage-based billing, and secrets management. This is a backend-heavy role (70% Go, 20% Python, 10% TypeScript/React) where you'll operate across multiple services and occasionally debug infrastructure issues in production. Like any role at a hyperscaler, this comes with high accountability and ownership. You'll own the systems you build end-to-end, from design through production reliability. We expect engineers to take full ownership of their domain, drive technical decisions with conviction, and hold themselves to a high bar on quality and reliability. Who You Are 5+ years of professional software engineering experience Strong proficiency in Go for backend service development Experience with Python, ideally in the context of SDK or library development Hands-on experience with Kubernetes (deploying, debugging, writing operators or controllers) Demonstrated ability to design and operate production distributed systems Experience with container runtimes and isolation technologies (Docker, containerd, or similar) Familiarity with gRPC or similar RPC frameworks Experience with message queues or event streaming (Kafka, PubSub, or similar) Strong understanding of authentication ... (truncated, view full listing at source)
Apply Now

Direct link to company career page

AI Resume Fit Check

See exactly which skills you match and which are missing before you apply. Free, instant, no spam.

Check my resume fit

Free · No credit card

Share