Staff Engineer, HPC Infrastructure
TenstorrentAustin, Texas, United States; Santa Clara, California, United States; Toronto, Ontario, Canada$100k – $500kPosted 24 February 2026
Job Description
<div class="content-intro"><p>Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.</p></div><p>We're seeking a Staff HPC Engineer who thrives on turning hundreds of bare-metal compute nodes into consistent, production-ready clusters through automation and infrastructure-as-code. You'll design and maintain OS deployment pipelines that provision nodes in minutes, use Ansible to eliminate configuration drift across global sites, and ensure RHEL/Ubuntu systems stay performant and reliable as our compute demands scale exponentially. In semiconductor design, where millions of EDA jobs run daily, your automation work directly translates to faster design cycles and higher cluster utilization.</p>
<p class="whitespace-normal break-words">This role is hybrid, based out of Austin, TX, Santa Clara, CA, or Toronto, CA.</p>
<p class="whitespace-normal break-words">We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.</p>
<hr class="border-border-300 my-2">
<p><strong>Who You Are</strong></p>
<ul>
<li>Deep experience with IBM Spectrum LSF or similar workload managers.</li>
<li>Strong background in commercial HPC storage platforms such as Pure Storage FlashBlade, Weka, NetApp, etc.</li>
<li>Hands-on experience with container technologies (Docker, Singularity, Podman).</li>
<li>Solid Linux system administration skills.</li>
<li>Understanding of HPC networking, storage architectures, and job scheduling.</li>
<li>Ability to diagnose and resolve complex infrastructure issues independently.</li>
<li>Comfortable working in a startup environment with rapidly changing requirements.</li>
</ul>
<hr class="border-border-300 my-2">
<p><strong>What We Need</strong></p>
<ul>
<li>Design and maintain automated bare-metal provisioning pipelines that deploy hundreds of compute nodes globally with consistent configurations.</li>
<li>Implement infrastructure-as-code practices using Ansible to manage large-scale OS configuration across diverse hardware platforms.</li>
<li>Own the lifecycle management of RHEL and Ubuntu systems—from initial deployment through patching, upgrades, and performance tuning.</li>
<li>Build automation and tooling to streamline provisioning, patching, and system updates as the compute environment scales.</li>
<li>Troubleshoot OS-level issues, optimize kernel parameters, and resolve system performance bottlenecks that impact EDA workflows.</li>
<li>Work directly with hardware design teams to standardize system configurations, toolchains, and development environments.</li>
<li>Deploy and lifecycle manage systems across Tenstorrent's global engineering sites, ensuring consistency and reliability.</li>
</ul>
<hr class="border-border-300 my-2">
<p class="whitespace-normal break-words"><strong>Nice to Have</strong></p>
<ul class="[:not(:last-child)_ul]:pb-1 [:not(:last-child)_ol]:pb-1 list-disc space-y-2.5 pl-7">
<li>Experience supporting EDA tools and hardware design workflows in production HPC environments.</li>
<li>Hands-on expertise with commercial HPC storage platforms (Pure Storage, Weka, NetApp) and workload managers (LSF, Slurm).</li>
<li>Container technologies (Docker, Singularity, Podman) for reproducible compute environments at scale.</li>
<li>Advanced provisioning techniques (PXE boot, ... (truncated, view full listing at source)
Apply Now
Direct link to company career page
More jobs at Tenstorrent
See all →More AWS jobs
See all →Associate Manager, New Verticals - Consumer Financials Strategy & Operations
DoorDash · New York, NY; San Francisco, CA; Chicago, IL; Seattle, WA; Los Angeles, CA; Washington DC
Associate, Quality Strategy & Operations
DoorDash · United States - Remote
Creative Project Manager
DoorDash · Los Angeles,CA; San Francisco, CA; New York, NY
Manager, New Verticals - Gift Card Strategy & Operations
DoorDash · New York, NY; San Francisco, CA; Los Angeles, CA; Seattle, WA; Washington, DC