Infrastructure Operations Manager New US
Infrastructure Operations Manager New US

Infrastructure Operations Manager New US

Full-Time 42000 - 98000 £ / year (est.) Home office (partial)
Nscale Ltd.

At a Glance

  • Tasks: Lead a team to manage AI Data Centre operations and ensure optimal performance.
  • Company: Join Nscale, the innovative GPU cloud powering AI for startups and enterprises.
  • Benefits: Competitive salary, equity options, flexible work, and continuous learning opportunities.
  • Other info: Embrace a remote-first culture that values diversity and personal growth.
  • Why this job: Make a real impact in cutting-edge AI technology while growing your career.
  • Qualifications: 5+ years in data centre management with strong leadership and technical skills.

The predicted salary is between 42000 - 98000 £ per year.

Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.

At Nscale, our Operations team plays a critical role in maintaining service availability, driving service reliability and rapid response to customer tickets. We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future.

About The Role

We are looking for an experienced Team Lead to take overall responsibility for the devices and infrastructure in our AI Data Center. The Data Centre Operations Manager will ensure operational excellence, oversee personnel, maintain critical infrastructure, and meet client-driven SLA and KPI requirements. Reporting to the Site Director, this role involves close collaboration with internal teams, clients, and external vendors. It also includes on-call duties and occasional travel to other sites as business needs arise.

What You’ll be Doing

  • Site & Infrastructure Management
  • Overall accountability for all devices, systems, and infrastructure at the data center.
  • Ensure 24/7 operational reliability and optimal performance for high-performance AI workloads.
  • Monitor and manage power, cooling, and environmental conditions, addressing any operational risks proactively.
  • Oversee installation, configuration, and maintenance of HPC and GPU systems.
  • Lead and mentor a team of engineers, technicians, and support staff.
  • Provide training and guidance to junior staff, ensuring the team is equipped to manage site operations effectively.
  • Plan and manage shift schedules to guarantee continuous coverage and on-call availability.
  • Serve as the primary POC for clients, providing regular reporting on site SLA and KPI’s.
  • Collaborate with vendors and contractors to manage procurement, repairs, and upgrades.
  • Build and maintain strong relationships to ensure smooth operation and project delivery.
  • Inventory & Resource Management
    • Manage spare parts inventory, ensuring sufficient stock levels to minimize downtime during repairs or upgrades.
    • Track usage and coordinate procurement to avoid supply shortages.
  • Technical Expertise
    • Maintain a strong working knowledge of HPC and GPU installations, including hardware configuration, network architecture, and performance optimization.
    • Troubleshoot and resolve technical issues, escalating complex problems when necessary.
    • Stay updated on emerging trends and technologies in AI and HPC infrastructure.
  • Operational Monitoring & Reporting
    • Implement and manage tools to monitor data center performance and resource utilization.
    • Prepare detailed reports for senior management and clients on performance metrics, downtime, and operational efficiency.
    • Propose and implement improvements to enhance reliability, scalability, and cost-effectiveness.

    About You

    • Bachelor’s degree in Computer Science, Engineering, or a related field.
    • 5+ years of experience managing data centers, particularly in HPC and GPU environments.
    • Proven leadership skills with experience managing and developing teams.
    • Technical expertise in HPC, GPU deployment, and associated hardware/software solutions.
    • Strong understanding of power, cooling, and environmental systems in data centers.
    • Excellent client-facing communication and reporting skills.
    • Experience with inventory and spare parts management.
    • Based on-site at the data center with occasional travel to other locations as required.
    • Participate in an on-call rotation to respond to urgent operational issues.
    • Commitment to continuous learning to stay ahead of evolving technologies and standards.

    Nice to Have

    • Certifications in data center management (e.g., CDCP, CDCS, or similar).
    • Hands-on experience with NVIDIA GPUs, CUDA, and AI frameworks.
    • Familiarity with hybrid cloud/HPC environments.

    What We Can Offer You

    At Nscale, you'll find a collaborative, supportive, and innovative environment where your contributions spark real impact. We're building something extraordinary, and we want you at the core. Highly competitive package (base + equity) with reviews every 12 months. Join the fastest-growing tech startup, your chance to push boundaries, collaborate with brilliant minds, and make your mark on cutting-edge AI. Expect a dynamic progression plan tailored to your ambitions. Grow by trying new things, leading, challenging the status quo, and owning your impact, always with our full support.

    Human-First Flexibility: We treat you as humans first. Our flexible workplace trusts Nscalers to deliver, giving you the autonomy to shape your day around life's moments. Join our thriving remote-first team. Geography is no barrier to impact or connection. We build seamless virtual collaboration, empowering you, wherever you work.

    We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds. If there’s anything we can do to accommodate your specific situation, please let us know.

    The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.

    Salary Range: Actual compensation may vary based on job-related factors such as skill set, experience, education, and location. In addition to base salary, this role may be eligible for bonus, equity, and/or commission programs. Nscale may offer a competitive benefits package including medical, dental, vision, flexible paid time off, parental leave, and retirement plan participation.

    Salary Range $60,000 - $140,000 USD

    Infrastructure Operations Manager New US employer: Nscale Ltd.

    At Nscale, we pride ourselves on fostering a culture of relentless innovation and accountability, making us an exceptional employer for those passionate about AI infrastructure. Our collaborative and supportive environment not only encourages personal growth through tailored progression plans but also offers competitive compensation packages and flexible working arrangements, allowing you to thrive both professionally and personally. Join us in shaping the future of AI technology while enjoying the benefits of a remote-first team that values diversity and inclusivity.
    Nscale Ltd.

    Contact Detail:

    Nscale Ltd. Recruiting Team

    StudySmarter Expert Advice 🤫

    We think this is how you could land Infrastructure Operations Manager New US

    ✨Tip Number 1

    Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

    ✨Tip Number 2

    Prepare for interviews by researching Nscale and its culture. Understand our focus on innovation and operational excellence, and think about how your experience aligns with that. Tailor your responses to show you’re a perfect fit!

    ✨Tip Number 3

    Practice makes perfect! Do mock interviews with friends or use online platforms to get comfortable with common questions. The more you practice, the more confident you'll feel when it’s time to shine.

    ✨Tip Number 4

    Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our team at Nscale.

    We think you need these skills to ace Infrastructure Operations Manager New US

    Site Management
    Infrastructure Management
    Operational Reliability
    HPC Systems
    GPU Systems
    Team Leadership
    Client Communication
    Inventory Management
    Technical Troubleshooting
    Performance Monitoring
    Data Centre Operations
    Environmental Systems Knowledge
    Continuous Learning
    Collaboration with Vendors

    Some tips for your application 🫡

    Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with HPC and GPU environments. We want to see how your skills align with our needs at Nscale, so don’t hold back on showcasing your relevant achievements!

    Showcase Your Leadership Skills: As a potential Team Lead, it’s crucial to demonstrate your leadership experience. Share examples of how you've mentored teams or managed projects in the past. We love seeing candidates who can inspire and guide others!

    Be Clear and Concise: When writing your application, keep it straightforward and to the point. Use bullet points where possible to make it easy for us to read through your qualifications and experiences. We appreciate clarity!

    Apply Through Our Website: We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy!

    How to prepare for a job interview at Nscale Ltd.

    ✨Know Your Infrastructure Inside Out

    Make sure you brush up on your knowledge of HPC and GPU systems. Be ready to discuss specific technologies you've worked with, as well as how you've managed power and cooling in data centres. This will show that you're not just familiar with the concepts but have hands-on experience.

    ✨Demonstrate Leadership Skills

    As a Team Lead, you'll need to showcase your leadership abilities. Prepare examples of how you've mentored teams or handled challenging situations. Highlight your approach to building trust and fostering a collaborative environment, as this aligns with Nscale's culture.

    ✨Prepare for Client-Facing Scenarios

    Since you'll be the primary point of contact for clients, think about how you would handle client communications. Prepare to discuss how you've reported on SLAs and KPIs in the past, and be ready to answer questions about managing client expectations.

    ✨Show Your Commitment to Continuous Learning

    Nscale values innovation and staying ahead of trends. Be prepared to talk about how you keep your skills updated, whether through certifications, courses, or personal projects. Mention any relevant certifications you have, like CDCP or CDCS, to further demonstrate your commitment.

    Infrastructure Operations Manager New US
    Nscale Ltd.

    Land your dream job quicker with Premium

    You’re marked as a top applicant with our partner companies
    Individual CV and cover letter feedback including tailoring to specific job roles
    Be among the first applications for new jobs with our AI application
    1:1 support and career advice from our career coaches
    Go Premium

    Money-back if you don't land a job in 6-months

    >