Senior Hardware Engineer - GPU & AI Infrastructure
RobloxSan Mateo, CA, United StatesPosted 24 February 2026
Job Description
<div class="content-intro"><p><span style="font-weight: 400;">Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. </span></p>
<p><span style="font-weight: 400;">At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.</span><strong> </strong><span style="font-weight: 400;">We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. </span></p>
<p><span style="font-weight: 400;">A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.</span></p></div><p>As a member of the Infrastructure Foundation Hardware Engineering team, you will play a key role in enabling our mission to deliver a reliable, high-performing, and cost-efficient infrastructure that powers the world’s play. In this specialized role, you will be the technical lead for our GPU and AI accelerator ecosystem. You will be responsible for the full lifecycle of GPU hardware, from initial architectural evaluation and firmware qualification to large-scale fleet integration and performance tuning. You will ensure that Roblox’s massive-scale rendering and ML workloads run on the most optimized and stable hardware possible.</p>
<p><strong>You Will:</strong></p>
<ul>
<li><strong>Architect Prototype:</strong> Prototype next-generation GPU-accelerated hardware platforms, ensuring seamless integration between high-density compute nodes, high-speed interconnects (NVLink/PCIe Gen5/6), and system firmware.</li>
<li><strong>GPU Optimization:</strong> Drive the integration, performance testing, and debugging of GPUs in our fleet, focusing specifically on hardware-level optimizations, driver tuning, and thermal/power management.</li>
<li><strong>Validation Certification:</strong> Develop and execute rigorous evaluation and stress-testing strategies for GPU-heavy server platforms to ensure they meet Roblox’s unique demands for real-time rendering and low-latency AI inference.</li>
<li><strong>Firmware Systems:</strong> Lead firmware qualification (BIOS/BMC) and troubleshooting, implementing automation systems to manage GPU health, firmware updates.</li>
<li><strong>Vendor Collaboration:</strong> Provide technical guidance and deep-dive feedback to hardware vendors. Lead critical investigations into component-level failures, triaging issues across the hardware, driver, and kernel layers.</li>
<li><strong>Observability:</strong> Build and maintain advanced monitoring stacks (Grafana/Prometheus) to track GPU metrics like HBM utilization, thermal throttling events, and PCIe bandwidth saturation.</li>
</ul>
<p><strong>You Have:</strong></p>
<ul>
<li><strong>Education:</strong> BA/BS Degree in Electrical Engineering, Computer Engineering, or related field with equivalent practical experience.</li>
<li><strong>GPU Expertise:</strong> 5+ years of hardware engineering experience with a specific focus on <strong>GPU architecture</strong> (NVIDIA HGX/MGX platforms preferred), AI accelerators, or high-performance compute (HPC) systems.</li>
<li><strong>Deep Technical Knowledge:</strong> In-depth understanding of modern data center technologies, including PCIe fabric, NVLink, InfiniBand, and liquid cooling systems for high-TDP hardware.</li>
<li><strong>Testing Skills:</strong> Hands-on experience testing and validating CPU, Memory (HBM/DDR5), Storage (NVMe), and high-speed networking subsystems in a Linux environment.</li>
<li><strong>Programming:</strong> Proficiency in Python, Go, or C++ for developing hardware validation tools and automation scripts.< ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
More jobs at Roblox
See all →[Summer 2026] People Science - PhD Intern
San Mateo, CA, United States · 28 February 2026
Senior Program Manager, Recruiting Programs
San Mateo, CA, United States · 28 February 2026
Executive Business Partner
San Mateo, CA, United States · 28 February 2026
Principal/Senior Machine Learning Scientist - Search and Discovery
San Mateo, CA, United States · 27 February 2026