At a Glance
- Tasks: Deploy and troubleshoot high-bandwidth GPU interconnect platforms in global data centres.
- Company: Join CoreWeave, a leader in innovative tech solutions.
- Benefits: Competitive salary, bonuses, health insurance, and tuition reimbursement.
- Other info: Inclusive workplace with opportunities for career advancement.
- Why this job: Make an impact in cutting-edge technology while growing your skills.
- Qualifications: Strong Linux skills and a passion for troubleshooting and automation.
The predicted salary is between 79000 - 105000 £ per year.
We are seeking an HPC Engineer to deploy, operate, troubleshoot, and improve high‑bandwidth GPU interconnect platforms across our global data center footprint.
What You Will Do
- Deploy, operate, and support NVLink/NVSwitch platforms across large data center environments.
- Troubleshoot Linux, networking, hardware, firmware, performance, and stability issues in production.
- Build automation and improve runbooks, dashboards, alerts, and lifecycle workflows.
- Collaborate with teams across CoreWeave, external vendors, and customer-facing stakeholders.
- Drive assigned work to completion with clear communication, thoughtful prioritization, and early visibility into risks or blockers.
- Participate in on‑call, incident response, root cause analysis, and follow‑up improvements.
- Contribute to reliable workflows that scale across regions, platforms, and fleet growth, with ownership calibrated by level.
What We Are Looking For
- Strong Linux system administration and troubleshooting skills.
- Networking fundamentals and common troubleshooting tools.
- Production debugging experience using logs, metrics, and command‑line tools.
- Server, network, GPU, or data center hardware troubleshooting experience.
- Practical scripting or automation experience in Python, Go, Bash, or similar.
- Clear communication, documentation, collaboration, and on‑call readiness.
- Curiosity to learn specialized GPU interconnect technologies such as NVLink, NVSwitch, and InfiniBand.
Preferred Qualifications
- Ansible or other infrastructure automation tooling.
- Kubernetes application development or operations experience.
- Grafana, Prometheus, PromQL, or similar observability systems.
- Large fleet operations across Linux systems, network devices, GPUs, or infrastructure components.
- InfiniBand, RDMA, HPC networking, or low‑latency/high‑bandwidth fabrics.
- BMC, Redfish, IPMI, firmware lifecycle management, or hardware management APIs.
- NVLink, NVSwitch, NVIDIA GPU platforms, NVUE, SONiC, or network operating systems.
What We Offer
- Competitive salary: £79,000 to £105,000.
- Discretionary bonus and equity awards.
- Family‑level Medical and Dental Insurance.
- Generous Pension Contribution.
- Life Assurance at 4x Salary.
- Critical Illness Cover.
- Employee Assistance Programme.
- Tuition Reimbursement.
- Work culture focused on innovative disruption.
Equal Opportunity
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be:
- A U.S. person, defined as a U.S. citizen or national, lawful permanent resident (green card holder), refugee, or asylee.
- Eligible to access the export-controlled information without a required export authorization.
- Eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.
CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
HPC Engineer, Metal Net in London employer: CoreWeave
CoreWeave is an exceptional employer for HPC Engineers, offering a dynamic work culture that prioritises innovative disruption and collaboration across global data centres. With competitive salaries, generous benefits including family-level medical insurance, pension contributions, and tuition reimbursement, employees are supported in their professional growth while working with cutting-edge GPU interconnect technologies. The inclusive environment fosters a sense of belonging, making it an ideal place for those seeking meaningful and rewarding employment.
StudySmarter Expert Advice🤫
We think this is how you could land HPC Engineer, Metal Net in London
✨Tip Number 1
Network like a pro! Attend industry meetups, webinars, or online forums related to HPC and GPU technologies. Engaging with professionals in the field can open doors and give you insights that might just land you that interview.
✨Tip Number 2
Show off your skills! Create a GitHub repository showcasing your automation scripts or any projects you've worked on. This not only demonstrates your technical abilities but also your passion for continuous learning and improvement.
✨Tip Number 3
Prepare for those tricky interviews! Brush up on your Linux troubleshooting skills and be ready to discuss your experience with NVLink or similar technologies. Practising common interview questions can help you articulate your thoughts clearly.
✨Tip Number 4
Don’t forget to apply through our website! We love seeing candidates who are genuinely interested in joining us at CoreWeave. Tailor your application to highlight your relevant experience and how you can contribute to our innovative culture.
We think you need these skills to ace HPC Engineer, Metal Net in London
Some tips for your application 🫡
Tailor Your CV:Make sure your CV highlights your Linux system administration skills and any experience with GPU interconnect technologies. We want to see how your background aligns with the HPC Engineer role, so don’t be shy about showcasing relevant projects!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re excited about the role and how your skills in troubleshooting and automation can benefit us at CoreWeave. Keep it concise but impactful!
Show Off Your Communication Skills:Since clear communication is key for this role, make sure your application reflects that. Whether it’s through your CV, cover letter, or any additional documentation, we want to see how you articulate your thoughts and ideas.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re genuinely interested in joining our team!
How to prepare for a job interview at CoreWeave
✨Know Your Tech Inside Out
Make sure you brush up on your Linux system administration skills and be ready to discuss troubleshooting techniques. Familiarise yourself with NVLink, NVSwitch, and other GPU interconnect technologies, as these will likely come up during the interview.
✨Show Off Your Scripting Skills
Be prepared to talk about your experience with automation and scripting in Python, Go, or Bash. Have examples ready that demonstrate how you've built automation or improved workflows in previous roles.
✨Communicate Clearly
Since collaboration is key in this role, practice articulating your thoughts clearly. Think about how you can convey complex technical issues simply and effectively, especially when discussing past experiences with incident response or root cause analysis.
✨Prepare for Real-World Scenarios
Expect to tackle some practical problems during the interview. Brush up on your debugging skills and be ready to walk through how you would approach troubleshooting a performance issue in a data centre environment.