Job Description
<div class="content-intro"><h3><strong>About Us</strong></h3>
<div>Temporal is an open source programming model that can simplify code, make applications more reliable, and help developers focus on the important things like delivering features faster. We are on a mission to be the reliable foundation of every developer’s toolbox, and are building the team that will make that happen.</div>
<div> </div>
<div>Our values guide us —they are present in how we show up, make decisions, and work together to make an impact. We’re curious, driven, collaborative, genuine and humble.</div>
<div> </div>
<div>Temporal is growing and we are looking for those who share our values, challenge 'standard' thinking, and want to influence our future. If you have a passion for improving the developer experience, building world-class open-source software and communities, and want to be a part of our amazing team, we'd love to hear from you!</div></div><h2>Summary</h2>
<p>Join our team as a Senior Product Manager for Agentic Coding, where you'll lead the effort to make Temporal the best-supported technology for AI-assisted development. In this role, you'll define how LLM-based coding assistants understand and generate Temporal code, directly impacting developer onboarding, productivity, and time-to-production.</p>
<p>You'll build benchmark suites to measure LLM performance on Temporal tasks, create context files and skills that improve AI coding accuracy, and work cross-functionally with Engineering, DevRel, and Documentation to ensure developers succeed whether they're learning Temporal through Claude Code, Cursor, or any other AI assistant.</p>
<h2>What You'll Do</h2>
<ul>
<li>Define and track success metrics for AI-assisted Temporal development (benchmark scores, activation rates, time-to-production)</li>
<li>Build and iterate on a "SWE-Bench for Temporal" - a benchmark evaluating LLM performance on real Temporal development tasks</li>
<li>Create, test, and validate context files (<a href="http://agents.md/">agents.md</a>, Cursor rules, Claude skills) that improve how coding assistants write Temporal code</li>
<li>Research and prioritize which LLM performance gaps to address based on user impact</li>
<li>Drive fast hypothesis-test-iterate cycles to continuously improve AI coding assistant performance</li>
<li>Partner with DevRel to ensure documentation, samples, and content are optimized for LLM consumption</li>
<li>Collaborate with AI Engineering on benchmark infrastructure and evaluation pipelines</li>
<li>Work with external AI assistant vendors to improve Temporal support</li>
<li>Instrument developer journeys to understand where LLM-assisted users struggle and close feedback loops</li>
</ul>
<h2>What You'll Bring</h2>
<ul>
<li>5+ years of product management experience; developer tools or developer experience strongly preferred</li>
<li>Deep personal experience using AI coding assistants (Claude Code, Cursor, Copilot, etc.) in real development work</li>
<li>Strong prompt engineering skills with opinions on how to evaluate and refine LLM workflows based on empirical experience</li>
<li>Ability to write and evaluate code; comfort with multiple programming languages</li>
<li>Analytical mindset with experience defining and tracking metrics</li>
<li>Experience working cross-functionally with engineering, DevRel, and documentation teams</li>
<li>Excellent written and verbal communication skills</li>
<li>Self-directed with ability to work autonomously in a fast-moving space</li>
</ul>
<h2>Nice-to-Have</h2>
<ul>
<li>Experience with benchmark design and evaluation methodologies</li>
<li>Familiarity with Temporal or distributed systems concepts</li>
<li>Background in developer relations, technical writing, or developer education</li>
<li>Experience with open-source communities</li>
<li>Understanding of LLM architectures and how context affects model outputs</li>
</ul>
<h2>Compensation</h2>
<ul>
<li>Base Salary Range: $180,000 - $230,000, depending on quali ... (truncated, view full listing at source)