Director of Product Management, W&B Weave (AI Agents & Evaluation Platform)- W&B

Weights and Biases
San Fransisco, CA $206k – $303kPosted 15 April 2026

Job Description

CoreWeave, the AI Hyperscaler™, acquired Weights Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave’s industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights Biases, we’re setting a new standard for how AI is built, trained, and scaled. The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we’re combining forces to serve the full AI lifecycle — all in one seamless platform. Weights Biases has long been trusted by over 1,500 organizations — including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve — to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises. As we unite under one vision, we’re looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team. What You’ll Do: As Director of Product Management for Weights Biases Weave, you will define and scale the platform that AI developers rely on to build, evaluate, and operate AI agents in production. You will own the vision, roadmap, and execution for Weave—focusing on agent tracing, evaluation workflows, and production monitoring—ensuring developers can confidently ship reliable, high-performing AI systems. This role sits at the intersection of LLMs, developer tooling, and production infrastructure, requiring both deep technical fluency and strong product intuition. About the role Own the Weave product vision and roadmap, focused on enabling developers to build, evaluate, and monitor AI agents end-to-end Define how developers trace and debug agent behavior, including multi-step workflows, tool use, and reasoning chains Lead the development of evaluation systems (evals) that allow teams to measure agent quality, correctness, and performance over time Drive innovation in production monitoring and observability for AI systems, including logging, metrics, feedback loops, and drift detection Build workflows that enable rapid iteration—from experimentation to production—closing the loop between evaluation and deployment Partner closely with engineering to design systems for high-scale data ingestion, real-time analysis, and developer-facing APIs/SDKs Lead cross-functional initiatives across product, engineering, design, GTM, and customer teams to deliver cohesive, developer-first experiences Engage directly with customers building cutting-edge AI agents to deeply understand their workflows, pain points, and emerging needs Define success metrics and ensure Weave delivers measurable improvements in developer velocity, agent quality, and production reliability Location This role is based in San Francisco, CA and requires in-office presence at least 3 days per week to support close collaboration with engineering, design, and go-to-market teams. Who You Are: 7+ years of product management experience, with a strong focus on developer platforms, AI/ML tools, or data/observability systems Experience building products for AI developers, particularly around LLMs, agents, or ML workflows AI Agents Evaluation Expertise Deep understanding of LLM-powered applications and agent architectures (tool use, RAG, orchestration frameworks, etc.) Experience wi ... (truncated, view full listing at source)
Apply Now

Direct link to company career page

AI Resume Fit Check

See exactly which skills you match and which are missing before you apply. Free, instant, no spam.

Check my resume fit

Free · No credit card

Share