Senior Software Engineer, Research

Sully AI
US - Bay AreaPosted 9 March 2026

Job Description

Senior Software Engineer, Research About Us At Sully.ai http://Sully.ai, We’re Building the Most Impactful Healthcare Company on Earth We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system. Our Mission: One Human, One Doctor. We build AI teammates that augment clinicians — scribes, nurses, receptionists, translators — all powered by our own world-class models and deployed in real-world care. WHAT YOU’LL DO Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations. Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders. Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks). Ship research-proven features into production within 7 days, end-to-end. Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering. Hard Requirements - Senior-level full-stack engineering experience in React, TypeScript, and Node.js. - Proven ability to design, ship, and scale LLM-powered applications. - Expertise in API design, streaming, and CI/CD pipelines. - Strong cloud infrastructure background (AWS, GCP, or Azure). - Track record of building reliable systems with measurable performance and error budgets. - First-Month Focus - Audit all cross-agent flows for UI/UX consistency, correctness, and performance gaps. - Implement shared components, typed schemas, and contract-driven interfaces for reliability. - Establish instrumentation for frontend performance, agent consistency, latency, and model round-trip tracing. - Improve or replace brittle evaluation or agent pipelines identified during onboarding. - Partner with Research to productionize at least one new capability. - 90 Day OKRs - Deliver production-grade agentic workflows with <5% error rates across evaluation benchmarks. - Launch a cross-agent design system + SDK adopted by at least 2 internal teams. - Establish a weekly deploy + measure cadence with performance dashboards, latency budgets, and error budgets. - Reduce agent latency and failure rates across at least two high-volume workflows. - Ship multiple research-to-production integrations with measurable CSAT or accuracy gains. KEY RESULTS (FIRST 90 DAYS) - Deliver production-grade agentic workflows with end-to-end testing. - Audit all cross-agent flows for UI/UX consistency, correctness, and performance gaps. - Implement shared components, typed schemas, and contract-driven interfaces for reliability. - Establish instrumentation for frontend performance, agent consistency, latency, and model round-trip tracing. - Partner with Research to productionize at least one new capability. Why Join Sully.ai http://Sully.ai? 🔥 Revolutionizing the antiquated $800B+ Healthcare market 🧠 50%+ of us are ex-founders. We hire A-players, not passengers ⚡️ Speed matters - we operate with urgency, autonomy, and ownership 🧪 You’ll work on real, first-of-their-kind problems at the edge of AI and medicine ❤️ Your work helps doctors reclaim their time - and patients get better, faster care Sully.ai is an equal opportunity employer. In addition to EEO being the law, it is a policy that is fully consistent with our principles. All qualified applicants will receive consideration for employment without regard to status as a protected veteran or a qualified individual with a disability, or other protected status such as race, religion, color, national origin, sex, sexual orientation, gender identity, genetic information, pregnancy or age. Sully.ai prohibits any form of workplace harassment.