Machine Learning Engineer (NYC)

Output Biosciences
New York HQ 🗽Posted 21 February 2026

Job Description

The RoleJoin our team and help build the world's first biological reasoning model. Work with us to build generative foundational models that decode biological systems across scales - from molecules to organisms - enabling us to predict, understand, and program living systems in ways never before possible.Output is currently in stealth, operated by a team of repeat founders and biotech veterans with multiple exits in AI x Bio, and backed by top-tier VCs including Y Combinator.As a Machine Learning Engineer, you'll work alongside our founders and team members to develop and implement cutting-edge AI systems capable of complex biological reasoning across multiple scales.You will build foundational models for biology capable of reading and writing biology at scaleYou will develop deep generative models for biological applications, exploring innovative architectures to capture the complexities of multi-scale biological systemsYou will work on distributed training systems to scale our models to billions of parameters, optimizing for performance and efficiency across multi-GPU and multi-node setups while handling large-scale biological datasetsYou will engineer efficient data pipelines to manage and process massive biological datasets, addressing challenges in data loading, splitting, and memory optimizationYou will develop and implement robust evaluation frameworks for complex biological models, ensuring data integrity and preventing leakage across dataset splitsWho We're Looking For:You have a Bachelor's in Computer Science, Machine Learning, or a related technical fieldYou have 3+ years of experience in developing and implementing deep generative learning modelsYou have experience pre-training models and are proficient in distributed computing environmentsYou are proficient in Python and have expertise in at least one major deep learning framework (PyTorch, TensorFlow, or JAX)You have experience with deep learning and generative architectures such as transformers, diffusion models and autoencodersYou are skilled in working with terra-scale datasets and scaling models to billions of parametersYou have a strong understanding of machine learning fundamentals, including various model architectures, optimization techniques, and evaluation metricsYou have experience in designing and implementing efficient data pipelines for processing and managing large datasetsYou are experienced in developing robust evaluation frameworks and ensuring data integrity in machine learning projectsYou are experienced in code organization, version control, and collaborative software development practices.Who You Are:You have excellent problem-solving skills and the ability to quickly adapt to new challengesYou exhibit a proactive approach to problem-solving, thinking beyond the specific task, taking ownership of challenges, and pride in solving themYou have a mature mindset in ambiguous situations, helping to frame questions and seek clarity while making decisions in the face of uncertaintyYou have excellent communication skills and can clearly articulate complex technical conceptsYou are motivated by making a real impact and are committed to tackling problems of significant consequence with determination and creativityBonus pointsYou have experience applying machine learning to biology or chemistryYou have contributed to open-source machine learning projects or published research papers in the field of AI/MLYou have experience optimizing machine learning models for high-performance computing environmentsYou are familiar with ML-Ops practices and tools for managing ML experiments and deploymentsOur Values❤️ Heart: We foster a culture of ownership. We are assembling a team of individuals who are passionate and take pride in their contributions.🏆 Excellence: We have an unwavering commitment to excellence and continuously challenge ourselves to reach the highest standards.🚀 Practicality: We value practicality and results-oriented thinking. We are committed t ... (truncated, view full listing at source)