Job Description
<div class="content-intro"><p><span style="font-weight: 400;">Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. </span></p>
<p><span style="font-weight: 400;">At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.</span><strong> </strong><span style="font-weight: 400;">We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. </span></p>
<p><span style="font-weight: 400;">A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.</span></p></div><p>As a <strong>Data Center Technician II</strong>, you'll independently manage and prioritize host repair efforts for multiple datacenters, perform initial troubleshooting for ambiguous hardware and network issues, and own well-defined projects with guidance from senior engineers while helping us scale our Core/Edge Data Centers and hardware infrastructure at a time of incredible growth for our business.</p>
<p><strong>You will:</strong></p>
<ul>
<li><strong>Manage and prioritize</strong> your ticket queue according to defined priorities, performing initial troubleshooting for server and network issues, and escalating clearly when problems fall outside standard procedures.</li>
<li><strong>Maintain the Core Data Center</strong> and hardware infrastructure to meet the large scale and real-time requirements of our Imagination Platform™ to ensure our community has an awesome experience anywhere in the world. This includes all aspects of the server, network infrastructure, power, and environmental life cycles.</li>
<li><strong>Collaborate across regions</strong> to track and mitigate systemic issues preventing hosts from returning to service.</li>
<li><strong>Identify and solve</strong> recurring operational problems through root cause analysis, and propose improvements to runbooks, SOPs, and MOPs to prevent re-occurrence. </li>
<li><strong>Contribute data, feedback, and requirements</strong> to partners building automation, ensuring that automation reflects real-world operational workflows</li>
<li><strong>Coordinate with peers</strong> to establish and uphold best practices related to breakfix, install, decom and all other aspects of datacenter operations.</li>
<li><strong>Influence, and improve</strong> the development platform, infrastructure, standards (Runbooks, SOPs, MOPs), and methods to ensure the goal of scalability and high availability can be achieved.</li>
<li><strong>Leverage partnerships across teams</strong> to ensure prompt expansion and recovery of hardware capacity.</li>
<li><strong>Actively participate</strong> in continuous improvement and ongoing learning within the engineering team</li>
<li><strong>Assist </strong>in coordinating vendors and ensuring quality of outsourced projects</li>
<li><strong>Participate </strong>in the on-call rotation for our critical infrastructure.</li>
<li><strong>Travel: </strong>International and Domestic travel may be required 25%</li>
</ul>
<p><strong>You have:</strong></p>
<ul>
<li>At minimum 3+ years of experience working in large-scale Data Center Infrastructure environments and experience planning, executing, and documenting repairs in the server and networking domains.</li>
<li>Extensive experience installing, monitoring, and maintaining server and network equipment. This includes brand new server and network provisioning.</li>
<li>In-depth knowledge of data center environments, servers, and network equipment.</li>
<li>Proven experience executing on multiple tasks simultaneously.</li> ... (truncated, view full listing at source)