Research Engineer, Multimodal Generative AI (Image/Video)

Google Deepmind
Kirkland, Washington, US; Seattle, Washington, USPosted 12 March 2026

Job Description

Research Engineer, Multimodal Generative AI (Image/Video) Seattle, WA At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know. Snapshot The role of the Research Engineer will be to develop state-of-the-art methods for multimodal generative AI models, with a primary focus on image generation and editing . This role is for the team behind “Nano Banana”. At Google DeepMind, we've built a unique culture and work environment where long-term ambitious research can flourish. Our special interdisciplinary team combines the best techniques from deep learning, reinforcement learning, and systems neuroscience to build general-purpose learning algorithms. We have already made a number of high-profile breakthroughs towards building artificial general intelligence, and we have all the ingredients in place to make further significant progress over the coming year! About Us Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority. The Role Research Engineers at Google DeepMind lead our efforts in developing novel tools, infrastructure, and algorithms towards the end goal of solving and building Artificial General Intelligence. Having pioneered research in the world's leading academic and industrial labs, PhDs, post-docs, or professorships, Research Engineers join Google DeepMind to work collaboratively within and across Research fields. They are expected to independently build state of the art foundation models and research infrastructure, work with teams on large scale AI, and develop solutions to fundamental questions in machine learning and AI. Drawing on expertise from a variety of disciplines including deep learning, computer vision, language modeling, and advanced generative architectures, our Research Engineers are at the forefront of groundbreaking research. Key Responsibilities Design, rapidly implement, and rigorously evaluate cutting-edge deep learning algorithms, data curation, and evaluation infrastructure for multimodal generative AI, with a particular emphasis on image synthesis. Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing. Suggest and engage in team collaborations to meet ambitious research goals, while also driving significant individual contributions. Work in collaboration with our Ethics and Governance teams to ensure our advances in intelligence are developed ethically and provide broad benefits to humanity. About You In order to set you up for success as a Research Engineer at Google DeepMind, we look for the following skills and experience: PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, or equivalent practical experience. Proven experience in deep learning research and development, particularly in generative AI and related to image synthesis. This includes diffusion models and autoregressive generative models. Experience with post-training is a plus. Exceptional engineering skills in Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track ... (truncated, view full listing at source)