HPC Engineer, Metal Net

HPC Engineer, Metal Net

Full-Time 79000 - 105000 £ / year (est.) No working from home possible
CoreWeave

At a Glance

  • Tasks: Deploy and troubleshoot high-bandwidth GPU interconnect platforms in global data centres.
  • Company: CoreWeave, a leader in innovative tech solutions.
  • Benefits: Competitive salary, bonuses, medical insurance, pension contributions, and tuition reimbursement.
  • Other info: Inclusive workplace with opportunities for growth and learning.
  • Why this job: Join a cutting-edge team and work with advanced GPU technologies.
  • Qualifications: Strong Linux skills, networking knowledge, and scripting experience required.

The predicted salary is between 79000 - 105000 £ per year.

We are seeking an HPC Engineer to deploy, operate, troubleshoot, and improve high‑bandwidth GPU interconnect platforms across our global data center footprint.

What You Will Do

  • Deploy, operate, and support NVLink/NVSwitch platforms across large data center environments.
  • Troubleshoot Linux, networking, hardware, firmware, performance, and stability issues in production.
  • Build automation and improve runbooks, dashboards, alerts, and lifecycle workflows.
  • Collaborate with teams across CoreWeave, external vendors, and customer-facing stakeholders.
  • Drive assigned work to completion with clear communication, thoughtful prioritization, and early visibility into risks or blockers.
  • Participate in on‑call, incident response, root cause analysis, and follow‑up improvements.
  • Contribute to reliable workflows that scale across regions, platforms, and fleet growth, with ownership calibrated by level.

What We Are Looking For

  • Strong Linux system administration and troubleshooting skills.
  • Networking fundamentals and common troubleshooting tools.
  • Production debugging experience using logs, metrics, and command‑line tools.
  • Server, network, GPU, or data center hardware troubleshooting experience.
  • Practical scripting or automation experience in Python, Go, Bash, or similar.
  • Clear communication, documentation, collaboration, and on‑call readiness.
  • Curiosity to learn specialized GPU interconnect technologies such as NVLink, NVSwitch, and InfiniBand.

Preferred Qualifications

  • Ansible or other infrastructure automation tooling.
  • Kubernetes application development or operations experience.
  • Grafana, Prometheus, PromQL, or similar observability systems.
  • Large fleet operations across Linux systems, network devices, GPUs, or infrastructure components.
  • InfiniBand, RDMA, HPC networking, or low‑latency/high‑bandwidth fabrics.
  • BMC, Redfish, IPMI, firmware lifecycle management, or hardware management APIs.
  • NVLink, NVSwitch, NVIDIA GPU platforms, NVUE, SONiC, or network operating systems.

What We Offer

  • Competitive salary: £79,000 to £105,000.
  • Discretionary bonus and equity awards.
  • Family‑level Medical and Dental Insurance.
  • Generous Pension Contribution.
  • Life Assurance at 4x Salary.
  • Critical Illness Cover.
  • Employee Assistance Programme.
  • Tuition Reimbursement.
  • Work culture focused on innovative disruption.

Equal Opportunity

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

Export Control Compliance

This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be:

  • A U.S. person, defined as a U.S. citizen or national, lawful permanent resident (green card holder), refugee, or asylee.
  • Eligible to access the export-controlled information without a required export authorization.
  • Eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.

CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

HPC Engineer, Metal Net employer: CoreWeave

CoreWeave is an exceptional employer for HPC Engineers, offering a dynamic work culture that prioritises innovative disruption and collaboration across global data centres. With competitive salaries, generous benefits including family-level medical insurance and a robust pension contribution, employees are supported in their professional growth and well-being. The inclusive environment fosters continuous learning, particularly in cutting-edge GPU interconnect technologies, making it a rewarding place for those looking to advance their careers in high-performance computing.

CoreWeave

Contact Details:

CoreWeave Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land HPC Engineer, Metal Net

Tip Number 1

Network, network, network! Reach out to folks in the industry, especially those at CoreWeave. Use LinkedIn to connect and engage with them. A friendly chat can sometimes lead to opportunities that aren’t even advertised!

Tip Number 2

Show off your skills! If you’ve got experience with NVLink or GPU interconnects, create a project or a blog post about it. Share it on social media or relevant forums. This not only showcases your expertise but also gets you noticed by potential employers.

Tip Number 3

Prepare for interviews like a pro! Research common questions for HPC Engineers and practice your responses. Don’t forget to have examples ready that highlight your troubleshooting skills and teamwork experiences. We want to see how you tackle challenges!

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our team. Don’t hesitate – get your application in and let’s make some tech magic happen together!

We think you need these skills to ace HPC Engineer, Metal Net

Linux System Administration
Troubleshooting Skills
Networking Fundamentals
Production Debugging
Scripting or Automation in Python, Go, Bash
Clear Communication
Documentation Skills

Some tips for your application 🫡

Tailor Your CV:Make sure your CV highlights your Linux system administration skills and any experience with GPU interconnect technologies. We want to see how your background aligns with the HPC Engineer role, so don’t be shy about showcasing relevant projects or achievements!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re excited about the role and how your skills can contribute to our team. We love seeing genuine enthusiasm, so let your personality come through while keeping it professional.

Showcase Your Troubleshooting Skills:In your application, mention specific examples of how you've tackled troubleshooting challenges in production environments. We’re looking for clear communication and problem-solving abilities, so share those experiences that demonstrate your expertise!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our culture and values!

How to prepare for a job interview at CoreWeave

Know Your Tech Inside Out

Make sure you brush up on your Linux system administration skills and be ready to discuss troubleshooting techniques. Familiarise yourself with NVLink, NVSwitch, and other GPU interconnect technologies, as these will likely come up during the interview.

Show Off Your Scripting Skills

Be prepared to talk about your experience with scripting or automation in Python, Go, or Bash. Have examples ready that demonstrate how you've built automation or improved workflows in previous roles.

Communicate Clearly

Since clear communication is key, practice explaining complex technical concepts in simple terms. This will help you convey your ideas effectively and show that you can collaborate well with teams and stakeholders.

Prepare for Real-World Scenarios

Think of specific instances where you've troubleshot hardware or networking issues in a production environment. Be ready to walk through your thought process and the steps you took to resolve the problems, as this will showcase your practical experience.