Storage Engineer

TensorWave
Las Vegas, NevadaPosted 24 February 2026

Job Description

Our mission at TensorWave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.About the roleWe are looking for a Storage Engineer with deep expertise in NFS-based storage and modern high-performance file systems, specifically VAST Data and WEKA. This role exists to ensure our shared storage platforms are fast, reliable, scalable, and boring — even under extreme load.You will own the design, operation, and performance of our file storage layer, supporting workloads that depend on low latency, high throughput, and predictable behavior. This is a hands-on role for someone who understands storage at the protocol and system level, not just from a dashboard.If you think in terms of NFS semantics, metadata performance, failure domains, and throughput per node, this role is for you.ResponsibilitiesDesign, deploy, and operate NFS-based storage systems for production workloadsOwn and operate VAST Data and WEKA clusters in production environmentsArchitect storage for high-throughput, low-latency shared file accessTune and optimize NFS performance (mount options, client behavior, server-side tuning)Manage capacity planning, scaling, and rebalancing for VAST and WEKA systemsDiagnose and resolve storage performance issues (latency spikes, metadata bottlenecks, throughput drops)Design and test failure and recovery scenarios (node failures, network issues, disk loss)Lead upgrades, expansions, and maintenance with minimal or zero downtimePartner with infrastructure and application teams to ensure workloads are well-matched to storage behaviorDocument operational runbooks and establish best practices for shared file storageYou Are Obsessed With:NFS that behaves predictably under loadConsistent latency and throughput at scaleUnderstanding exactly how storage fails — before it doesFile systems that scale without becoming fragileMaking shared storage invisible to users because it just worksRequired ExperienceStrong hands-on experience with NFS in production environmentsDirect experience operating VAST Data and/or WEKA systemsDeep understanding of distributed file systems and shared storage architecturesStrong knowledge of storage performance fundamentals (latency, throughput, metadata operations)Experience troubleshooting complex storage and networking interactionsSolid Linux systems knowledge, especially around filesystem and I/O behaviorAbility to reason about failure domains, recovery paths, and data integrityPreferred ExperienceExperience supporting AI/ML, HPC, or data-intensive workloadsFamiliarity with RDMA, high-speed networking, or NVMe-based storageKubernetes workloads backed by shared file systemExperience with multi-rack or multi-site storage deploymentsInfrastructure-as-Code experience or automation experienceWhat We BringMission driven companyCompetitive SalaryStock Options100% paid Medical, Dental, and Vision insuranceLife and Voluntary Supplemental InsuranceShort Term Disability InsuranceFlexible Spending Account401(k)Flexible PTOPaid HolidaysParental LeaveMental Health Benefits through Spring HealthWe’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.TensorWave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.