At a Glance
- Tasks: Lead a team ensuring top-notch infrastructure support for AI workloads.
- Company: CoreWeave, the essential cloud for AI, trusted by innovators worldwide.
- Benefits: Competitive salary, family-level medical and dental insurance, generous pension contributions.
- Why this job: Join a pioneering company at the forefront of AI technology and innovation.
- Qualifications: 5+ years in infrastructure support and strong Linux skills required.
- Other info: Hybrid work environment with opportunities for professional growth and development.
The predicted salary is between 103000 - 137000 £ per year.
CoreWeave is The Essential Cloud for AIâ„¢. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability.
The Customer Experience (CX) Organization at CoreWeave is dedicated to ensuring every client running AI workloads at scale has a seamless, reliable, and high-performance experience. This team supports the infrastructure that powers the AI revolution—working across data centers, hardware systems, and customer workloads to maintain the integrity of our cloud platform. The CX organization aligns closely with the internal and customer engineering teams, offering valuable insights from the field and having the chance to contribute to the CoreWeave product roadmap and development.
As a Manager of Bare Metal Support Engineering, you will be at the center of ensuring our dedicated infrastructure remains stable, reliable, and performant. You will lead daily support operations, triage incidents, drive escalations, and ensure that hardware is monitored, maintained, and delivered effectively for our clients. You will oversee a team of experienced Systems Operations Engineers and help build a new team focused on our Bare Metal support model. This role balances tactical execution with operational maturity, working cross-functionally with engineering, product, and infrastructure teams to scale processes as we grow.
In This Role, You Will:
- Lead a skilled team responsible for maintaining and optimizing physical infrastructure across multiple client environments.
- Build, develop, and lead a dedicated Infrastructure Support team focused on supporting key infrastructure, handling escalations, and ensuring smooth hardware operations.
- Oversee the resolution of infrastructure-related incidents, escalation management, and collaborate with internal teams to deliver effective solutions.
- Improve support processes to enhance efficiency and reduce downtime, ensuring the infrastructure meets client expectations.
- Work closely with product, infrastructure, and other teams to ensure seamless delivery of infrastructure resources.
- Manage client communication during escalations and issue resolution to ensure transparency and client satisfaction.
- Mentor team members, developing their skills to manage and maintain critical infrastructure effectively.
Who You Are:
- 5+ years of experience leading teams responsible for infrastructure support, data center operations, or physical compute environments.
- Hands-on experience with Linux system administration and command-line tools.
- Familiarity with hardware-level diagnostics, troubleshooting, and replacement (servers, power, cabling, etc.).
- Experience working with high-performance rack-scale hardware, including CPU and GPU-based compute nodes.
- Understanding of GPU infrastructure (e.g., NVIDIA A100/H100s, PCIe/NVLink, liquid cooling) or a demonstrated ability to quickly learn and adapt to HPC environments.
- Proven track record in incident and escalation management, with direct ownership of client or production-impacting issues.
- Experience managing ticket-based workflows (Jira, Zendesk, etc.) in a high-urgency technical environment.
- Comfortable interpreting and acting on metrics (MTTR, SLOs, backlog, ticket trends) to drive operational improvements.
- Skilled in managing scheduling, shift coverage, and team logistics in 24/7 or hybrid support models.
- Travel up to 30% annually.
Preferred:
- Experience managing infrastructure support teams in high-growth or rapidly evolving environments.
- Proven ability to develop and implement operational processes that scale with business needs.
- Strong familiarity with server and GPU hardware lifecycle management: deployment, maintenance, thermal/power concerns, RMA coordination, and decommissioning.
- Demonstrated success in coaching and growing technical teams through training, mentorship, and performance development.
- Skilled in both developing and interpreting metrics to drive accountability, continuous improvement, and executive visibility.
- Familiarity with AI/ML workloads, cluster utilization patterns, or the infrastructure needs of GPU-heavy clients is a plus.
Wondering if you’re a good fit?
- Have a track record of improving infrastructure reliability through clear processes and team accountability.
- Think critically about how to scale operations without overengineering.
- Care about delivering for customers—but know when to hold the line to protect the team and long-term goals.
- Communicate clearly, especially under pressure.
- Are comfortable getting close to the work, but know when to step back and lead.
- Thrive in fast-paced environments where priorities can shift quickly.
The base salary range for this role is £103,000 to £137,000 GBP. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
CoreWeave is an equal-opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
Manager, Bare Metal Support Engineering in London employer: CoreWeave
Contact Detail:
CoreWeave Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Manager, Bare Metal Support Engineering in London
✨Tip Number 1
Network like a pro! Reach out to folks in your industry, especially those at CoreWeave. A friendly chat can open doors and give you insights that a job description just can't.
✨Tip Number 2
Prepare for the interview by diving deep into CoreWeave's tech and culture. Show us you’re not just another candidate; demonstrate your passion for AI and how you can contribute to our mission.
✨Tip Number 3
Practice your problem-solving skills! We love candidates who can think on their feet. Be ready to tackle some technical scenarios or case studies during your interview.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who take that extra step.
We think you need these skills to ace Manager, Bare Metal Support Engineering in London
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter for the Manager, Bare Metal Support Engineering role. Highlight your experience with infrastructure support and team leadership, as these are key aspects of the job.
Showcase Your Technical Skills: Don’t forget to mention your hands-on experience with Linux system administration and hardware diagnostics. We want to see how your technical expertise aligns with our needs in supporting AI workloads.
Be Clear and Concise: When writing your application, keep it straightforward. Use clear language to describe your past experiences and achievements, especially those related to incident management and operational improvements.
Apply Through Our Website: We encourage you to submit your application through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy!
How to prepare for a job interview at CoreWeave
✨Know Your Stuff
Make sure you brush up on your knowledge of Linux system administration and hardware diagnostics. CoreWeave is looking for someone who can handle high-performance rack-scale hardware, so be ready to discuss your hands-on experience with servers and GPUs.
✨Show Your Leadership Skills
As a Manager, you'll need to demonstrate your ability to lead a team effectively. Prepare examples of how you've mentored team members or improved processes in previous roles. Highlight your experience in incident management and how you've handled escalations.
✨Understand the Client Perspective
CoreWeave values client satisfaction, so be prepared to talk about how you've managed client communications during critical incidents. Share specific instances where you ensured transparency and resolved issues to keep clients happy.
✨Be Ready for Technical Questions
Expect some technical questions related to infrastructure support and operational processes. Brush up on metrics like MTTR and SLOs, and be ready to discuss how you've used these to drive improvements in past roles.