Machine Learning Engineer, AI Models

CyprusPosted 24 February 2026

Tech Stack

C++Scala AWS TensorFlow PyTorch AI LLM SEM Compensation

Job Description

<div class="content-intro">Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.</div>Join Tenstorrent’s AI Models team and work at the layer most ML engineers never see: bringing advanced models to life on custom AI hardware. You’ll own real workloads end‑to‑end including porting, tuning, and validating LLMs and vision models on our accelerator, and chasing down every last millisecond and percentage point of accuracy. This role is for people who love the craft of ML engineering and want their work to matter at silicon scale, not just behind another API. This role is hybrid, based in Cyprus. We welcome candidates at various experience levels for this role. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting. Who You Are <ul> <li data-start="770" data-end="863">Bring up, run, and debug modern ML models (e.g., transformers) using PyTorch or TensorFlow.</li> <li data-start="770" data-end="863">Analyze model behavior and performance, and identify bottlenecks across the stack.</li> <li data-start="770" data-end="863">Improve efficiency, correctness, and scalability of model execution in real systems.</li> <li data-start="770" data-end="863">Work closely with compiler, kernel, and hardware teams to drive performance and system-level improvements.</li> <li data-start="770" data-end="863">Help translate state-of-the-art model architectures into production-grade, high-performance deployments.</li> </ul> What We Need <ul> <li data-start="1285" data-end="1366">Strong experience building and working with ML models in PyTorch or TensorFlow.</li> <li data-start="1285" data-end="1366">Strong understanding of modern ML model architectures (ex: transformers).</li> <li data-start="1285" data-end="1366">Solid software engineering fundamentals with strong debugging and problem-solving skills.</li> <li data-start="1285" data-end="1366">Comfort working in a fast-moving, research-meets-engineering environment.</li> <li data-start="1285" data-end="1366">Bonus, not required: experience with profiling or performance tuning, or familiarity with quantization, flash attention, kernel fusion, memory hierarchies, C++, CUDA, or systems programming.</li> </ul> What You Will Learn <ul> <li class="p8i6j0a">How to bring state‑of‑the‑art LLMs and vision models to high performance on a custom AI accelerator.</li> <li class="p8i6j0a">How to trace and fix performance bottlenecks from PyTorch code down to kernels and memory systems.</li> <li class="p8i6j0a">How to turn research‑grade models into reliable, production deployments on new hardware.</li> <li class="p8i6j0a">The practical trade‑offs between techniques like quantization, FlashAttention, and kernel fusion when you’re optimizing real throughput, latency, and memory.</li> <li class="p8i6j0a">How your findings can drive changes across compiler, kernel, and hardware teams in a full‑stack co‑design loop</li> </ul> Tenstorrent offers ... (truncated, view full listing at source)

Apply Now

Direct link to company career page

More jobs at Tenstorrent

Share this job

LinkedIn X