HPC Production Engineer: Scale Compute & Ops in London

HPC Production Engineer: Scale Compute & Ops in London

London Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Jump Trading

At a Glance

  • Tasks: Design and support high-performance computing systems while collaborating on global projects.
  • Company: Join a leading tech company focused on innovative HPC solutions.
  • Benefits: Enjoy private medical insurance, pension schemes, and paid parental leave.
  • Other info: Dynamic role with opportunities for travel and professional growth.
  • Why this job: Make an impact in cutting-edge technology and work with researchers worldwide.
  • Qualifications: 5+ years in HPC, Linux admin, and proficiency in programming languages.

The predicted salary is between 60000 - 80000 £ per year.

What You'll Do

  • Design, implement, maintain, and support high performance compute and storage systems.
  • Implement and support performance monitoring and fault monitoring systems.
  • Monitor systems and storage performance, up to and including network components.
  • Build tooling to compile, package, install, and upgrade software and operating system components at scale.
  • Collaborate with team members and across teams to write code and testing infrastructures spanning both new and existing codebases in multiple programming languages.
  • Develop and improve systems and user documentation.
  • Participate in large, coordinated maintenance operations, including during evenings and weekends.
  • Work on global projects across a wide range of infrastructure.
  • Collaborate directly with researchers to optimize their use of HPC infrastructure.
  • Develop and monitor the tools used to maintain a production computing environment.
  • Provide operational support on a rotating basis and as needed.
  • Manage relationships with outside vendors, including traveling both domestically and internationally to meet with current and potential vendors.
  • Adhere to all company cybersecurity and IT policies, including performing all work using only approved hardware and software.
  • Other duties as assigned or needed.

Skills You’ll Need

  • 5+ years of professional experience in high performance computing (HPC), including parallel filesystems (e.g., Lustre, GPFS), batch systems (e.g., Slurm, Grid Engine), and high-performance network interconnects experience is a plus, but not required.
  • 5+ years of experience with Linux systems administration.
  • High proficiency with at least one programming/scripting language (e.g., Go, Python, C).
  • Extensive experience designing, building, and maintaining complicated, interdependent, and distributed systems.
  • Extensive experience profiling and debugging application stacks (debuggers and profilers).
  • Experience with system configuration management tools (SaltStack, Ansible, Puppet, etc.).
  • A compulsion to perform root cause analysis.
  • Reliable and predictable availability.

Benefits

  • Private Medical, Vision and Dental Insurance.
  • Travel Medical Insurance.
  • Group Pension Scheme.
  • Group Life Assurance and Income Protection Schemes.
  • Paid Parental Leave.
  • Parking and Commuter Benefits.

HPC Production Engineer: Scale Compute & Ops in London employer: Jump Trading

As a leading employer in the high-performance computing sector, we offer an innovative work environment that fosters collaboration and creativity. Our commitment to employee growth is reflected in our comprehensive benefits package, including private medical insurance and generous parental leave, alongside opportunities for professional development through global projects. Located in a vibrant area, our culture prioritises teamwork and support, making it an ideal place for those seeking meaningful and rewarding careers.

Jump Trading

Contact Details:

Jump Trading Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land HPC Production Engineer: Scale Compute & Ops in London

Network Like a Pro

Get out there and connect with folks in the HPC community! Attend meetups, conferences, or even online webinars. Building relationships can open doors to opportunities that aren’t even advertised.

Show Off Your Skills

Don’t just tell them what you can do; show them! Create a portfolio or GitHub repository showcasing your projects, especially those related to high performance computing. This gives potential employers a taste of your coding chops.

Ace the Interview

Prepare for technical interviews by brushing up on your knowledge of parallel filesystems and batch systems. Practice common coding challenges and be ready to discuss your past experiences in detail. Confidence is key!

Apply Through Us!

We’ve got some fantastic roles waiting for you on our website. Don’t hesitate to apply directly through us – it’s the best way to get noticed and land that dream job in HPC!

We think you need these skills to ace HPC Production Engineer: Scale Compute & Ops in London

High Performance Computing (HPC)
Parallel Filesystems (e.g., Lustre, GPFS)
Batch Systems (e.g., Slurm, Grid Engine)
Linux Systems Administration
Programming/Scripting Languages (e.g., Go, Python, C)
System Configuration Management Tools (e.g., SaltStack, Ansible, Puppet)
Profiling and Debugging Application Stacks

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the HPC Production Engineer role. Highlight your experience with high performance computing, Linux systems, and any relevant programming languages. We want to see how your skills match what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about HPC and how your background makes you a great fit for our team. Don’t forget to mention any collaborative projects you've worked on!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled complex issues in previous roles. We love candidates who can perform root cause analysis and come up with innovative solutions, so let us know how you've done this before!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen to join our team at StudySmarter!

How to prepare for a job interview at Jump Trading

Know Your HPC Stuff

Make sure you brush up on your high performance computing knowledge. Be ready to discuss parallel filesystems like Lustre or GPFS, and batch systems such as Slurm. They’ll likely want to know how you’ve tackled complex systems in the past, so have some examples ready!

Show Off Your Coding Skills

Since you'll be collaborating on code across multiple languages, it’s crucial to demonstrate your proficiency in at least one programming language like Go, Python, or C. Bring along a project or two that showcases your coding prowess and problem-solving skills.

Be Ready for Real-World Scenarios

Expect questions about real-world scenarios, especially around system performance monitoring and fault management. Think of specific instances where you’ve had to troubleshoot issues or optimise systems, and be prepared to explain your thought process.

Highlight Your Teamwork

Collaboration is key in this role, so be sure to highlight your experience working with teams. Share examples of how you've worked with researchers or other departments to improve HPC infrastructure, and how you’ve managed relationships with vendors.