Job Description
CoreWeave, the AI Hyperscaler™, acquired Weights Biases to create the most powerful end-to-end platform to develop, deploy, and iterate AI faster. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe, and was ranked as one of the TIME100 most influential companies of 2024. By bringing together CoreWeave’s industry-leading cloud infrastructure with the best-in-class tools AI practitioners know and love from Weights Biases, we’re setting a new standard for how AI is built, trained, and scaled.
The integration of our teams and technologies is accelerating our shared mission: to empower developers with the tools and infrastructure they need to push the boundaries of what AI can do. From experiment tracking and model optimization to high-performance training clusters, agent building, and inference at scale, we’re combining forces to serve the full AI lifecycle — all in one seamless platform.
Weights Biases has long been trusted by over 1,500 organizations — including AstraZeneca, Canva, Cohere, OpenAI, Meta, Snowflake, Square,Toyota, and Wayve — to build better models, AI agents and applications. Now, as part of CoreWeave, that impact is amplified across a broader ecosystem of AI innovators, researchers, and enterprises.
As we unite under one vision, we’re looking for bold thinkers and agile builders who are excited to shape the future of AI alongside us. If you're passionate about solving complex problems at the intersection of software, hardware, and AI, there's never been a more exciting time to join our team.
What You’ll Do:
As Director of Product Management for Weights Biases Weave, you will define and scale the platform that AI developers rely on to build, evaluate, and operate AI agents in production.
You will own the vision, roadmap, and execution for Weave—focusing on agent tracing, evaluation workflows, and production monitoring—ensuring developers can confidently ship reliable, high-performing AI systems.
This role sits at the intersection of LLMs, developer tooling, and production infrastructure, requiring both deep technical fluency and strong product intuition.
About the role
Own the Weave product vision and roadmap, focused on enabling developers to build, evaluate, and monitor AI agents end-to-end
Define how developers trace and debug agent behavior, including multi-step workflows, tool use, and reasoning chains
Lead the development of evaluation systems (evals) that allow teams to measure agent quality, correctness, and performance over time
Drive innovation in production monitoring and observability for AI systems, including logging, metrics, feedback loops, and drift detection
Build workflows that enable rapid iteration—from experimentation to production—closing the loop between evaluation and deployment
Partner closely with engineering to design systems for high-scale data ingestion, real-time analysis, and developer-facing APIs/SDKs
Lead cross-functional initiatives across product, engineering, design, GTM, and customer teams to deliver cohesive, developer-first experiences
Engage directly with customers building cutting-edge AI agents to deeply understand their workflows, pain points, and emerging needs
Define success metrics and ensure Weave delivers measurable improvements in developer velocity, agent quality, and production reliability
Location
This role is based in San Francisco, CA and requires in-office presence at least 3 days per week to support close collaboration with engineering, design, and go-to-market teams.
Who You Are:
7+ years of product management experience, with a strong focus on developer platforms, AI/ML tools, or data/observability systems
Experience building products for AI developers, particularly around LLMs, agents, or ML workflows
AI Agents Evaluation Expertise
Deep understanding of LLM-powered applications and agent architectures (tool use, RAG, orchestration frameworks, etc.)
Experience wi ... (truncated, view full listing at source)