Senior Software Engineer, Research
Sully AIUS - Bay AreaPosted 9 March 2026
Job Description
Senior Software Engineer, Research
About Us
At Sully.ai http://Sully.ai, We’re Building the Most Impactful Healthcare Company on Earth
We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system.
Our Mission: One Human, One Doctor. We build AI teammates that augment clinicians — scribes, nurses, receptionists, translators — all powered by our own world-class models and deployed in real-world care.
WHAT YOU’LL DO
Build and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations.
Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders.
Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks).
Ship research-proven features into production within 7 days, end-to-end.
Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.
Hard Requirements
- Senior-level full-stack engineering experience in React, TypeScript, and Node.js.
- Proven ability to design, ship, and scale LLM-powered applications.
- Expertise in API design, streaming, and CI/CD pipelines.
- Strong cloud infrastructure background (AWS, GCP, or Azure).
- Track record of building reliable systems with measurable performance and error budgets.
- First-Month Focus
- Audit all cross-agent flows for UI/UX consistency, correctness, and performance gaps.
- Implement shared components, typed schemas, and contract-driven interfaces for reliability.
- Establish instrumentation for frontend performance, agent consistency, latency, and model round-trip tracing.
- Improve or replace brittle evaluation or agent pipelines identified during onboarding.
- Partner with Research to productionize at least one new capability.
- 90 Day OKRs
- Deliver production-grade agentic workflows with <5% error rates across evaluation benchmarks.
- Launch a cross-agent design system + SDK adopted by at least 2 internal teams.
- Establish a weekly deploy + measure cadence with performance dashboards, latency budgets, and error budgets.
- Reduce agent latency and failure rates across at least two high-volume workflows.
- Ship multiple research-to-production integrations with measurable CSAT or accuracy gains.
KEY RESULTS (FIRST 90 DAYS)
- Deliver production-grade agentic workflows with end-to-end testing.
- Audit all cross-agent flows for UI/UX consistency, correctness, and performance gaps.
- Implement shared components, typed schemas, and contract-driven interfaces for reliability.
- Establish instrumentation for frontend performance, agent consistency, latency, and model round-trip tracing.
- Partner with Research to productionize at least one new capability.
Why Join Sully.ai http://Sully.ai?
🔥 Revolutionizing the antiquated $800B+ Healthcare market
🧠 50%+ of us are ex-founders. We hire A-players, not passengers
⚡️ Speed matters - we operate with urgency, autonomy, and ownership
🧪 You’ll work on real, first-of-their-kind problems at the edge of AI and medicine
❤️ Your work helps doctors reclaim their time - and patients get better, faster care
Sully.ai is an equal opportunity employer. In addition to EEO being the law, it is a policy that is fully consistent with our principles. All qualified applicants will receive consideration for employment without regard to status as a protected veteran or a qualified individual with a disability, or other protected status such as race, religion, color, national origin, sex, sexual orientation, gender identity, genetic information, pregnancy or age. Sully.ai prohibits any form of workplace harassment.
Apply Now
Direct link to company career page