HPC Production Engineer: Scale Compute & Ops

HPC Production Engineer: Scale Compute & Ops

Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Jump Trading

At a Glance

  • Tasks: Design and support high-performance compute and storage systems while collaborating on global projects.
  • Company: Leading tech firm focused on innovative HPC solutions.
  • Benefits: Comprehensive health insurance, pension scheme, and paid parental leave.
  • Other info: Opportunity for international travel and excellent career growth.
  • Why this job: Join a dynamic team and optimise cutting-edge technology for real-world impact.
  • Qualifications: 5+ years in HPC, Linux admin, and proficiency in programming languages.

The predicted salary is between 60000 - 80000 £ per year.

What You'll Do

  • Design, implement, maintain, and support high performance compute and storage systems.
  • Implement and support performance monitoring and fault monitoring systems.
  • Monitor systems and storage performance, up to and including network components.
  • Build tooling to compile, package, install, and upgrade software and operating system components at scale.
  • Collaborate with team members and across teams to write code and testing infrastructures spanning both new and existing codebases in multiple programming languages.
  • Develop and improve systems and user documentation.
  • Participate in large, coordinated maintenance operations, including during evenings and weekends.
  • Work on global projects across a wide range of infrastructure.
  • Collaborate directly with researchers to optimize their use of HPC infrastructure.
  • Develop and monitor the tools used to maintain a production computing environment.
  • Provide operational support on a rotating basis and as needed.
  • Manage relationships with outside vendors, including traveling both domestically and internationally to meet with current and potential vendors.
  • Adhere to all company cybersecurity and IT policies, including performing all work using only approved hardware and software.
  • Other duties as assigned or needed.

Skills You’ll Need

  • 5+ years of professional experience in high performance computing (HPC), including parallel filesystems (e.g., Lustre, GPFS), batch systems (e.g., Slurm, Grid Engine), and high-performance network interconnects experience is a plus, but not required.
  • 5+ years of experience with Linux systems administration.
  • High proficiency with at least one programming/scripting language (e.g., Go, Python, C).
  • Extensive experience designing, building, and maintaining complicated, interdependent, and distributed systems.
  • Extensive experience profiling and debugging application stacks (debuggers and profilers).
  • Experience with system configuration management tools (SaltStack, Ansible, Puppet, etc.).
  • A compulsion to perform root cause analysis.
  • Reliable and predictable availability.

Benefits

  • Private Medical, Vision and Dental Insurance.
  • Travel Medical Insurance.
  • Group Pension Scheme.
  • Group Life Assurance and Income Protection Schemes.
  • Paid Parental Leave.
  • Parking and Commuter Benefits.

HPC Production Engineer: Scale Compute & Ops employer: Jump Trading

As a leading employer in the high-performance computing sector, we offer an innovative work environment that fosters collaboration and creativity. Our commitment to employee growth is evident through extensive training opportunities and a supportive culture that values work-life balance, enhanced by comprehensive benefits such as private medical insurance and generous parental leave. Located in a vibrant area, our team enjoys the unique advantage of working on global projects while engaging directly with researchers to drive impactful advancements in technology.

Jump Trading

Contact Details:

Jump Trading Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land HPC Production Engineer: Scale Compute & Ops

Tip Number 1

Network, network, network! Reach out to folks in the HPC community, attend meetups or webinars, and connect with potential colleagues on LinkedIn. You never know who might have the inside scoop on job openings!

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to high performance computing. This gives you a chance to demonstrate your expertise beyond just a CV.

Tip Number 3

Prepare for technical interviews by brushing up on your coding skills and system design principles. Practice common interview questions related to HPC and be ready to discuss your past experiences in detail.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace HPC Production Engineer: Scale Compute & Ops

High Performance Computing (HPC)
Parallel Filesystems (e.g., Lustre, GPFS)
Batch Systems (e.g., Slurm, Grid Engine)
Linux Systems Administration
Programming/Scripting Languages (e.g., Go, Python, C)
System Configuration Management Tools (e.g., SaltStack, Ansible, Puppet)
Profiling and Debugging Application Stacks

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the HPC Production Engineer role. Highlight your experience with high performance computing, Linux systems, and any relevant programming languages. We want to see how your skills match what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about HPC and how your background makes you a great fit for our team. Don't forget to mention any collaborative projects you've worked on!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled complex issues in previous roles. We love candidates who can demonstrate their ability to perform root cause analysis and improve systems. Let us know how you’ve made an impact!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy – just follow the prompts!

How to prepare for a job interview at Jump Trading

Know Your HPC Stuff

Make sure you brush up on your high performance computing knowledge. Be ready to discuss parallel filesystems like Lustre or GPFS, and batch systems such as Slurm. They’ll likely want to hear about your hands-on experience, so prepare some examples of how you've tackled challenges in these areas.

Show Off Your Scripting Skills

Since proficiency in programming or scripting languages is key, be prepared to talk about your experience with languages like Go, Python, or C. Maybe even bring a small code snippet or project that showcases your skills. This will help demonstrate your technical prowess and problem-solving abilities.

Collaboration is Key

This role involves a lot of teamwork, so think of examples where you've successfully collaborated with others. Whether it’s working with researchers or managing vendor relationships, showing that you can communicate effectively and work well in a team will set you apart.

Be Ready for Real-World Scenarios

Expect questions that put you in real-world situations, especially around system performance monitoring and fault management. Prepare to discuss how you would approach maintaining a production computing environment and what tools you would use. This shows you’re not just theoretical but practical too!