At a Glance
- Tasks: Manage and optimise high-performance computing systems for cutting-edge research.
- Company: Join Northeastern University London, a prestigious institution in the heart of London.
- Benefits: Enjoy a supportive work/life balance, health and wellness initiatives, and professional growth opportunities.
- Other info: Dynamic environment with rapid growth and exciting career prospects.
- Why this job: Be at the forefront of technology, supporting innovative research and education.
- Qualifications: Bachelor's degree in a computational field or equivalent Linux system administration experience.
The predicted salary is between 40000 - 50000 £ per year.
Manage, monitor, and maintain research computing hardware and software systems, ensuring reliability, performance, scalability, and security across NU London's HPC ecosystem.
Deploy, configure, and maintain HPC workload managers and schedulers (e.g., Slurm, PBS, LSF), including queue configuration, fair-share resource allocation, job monitoring, and performance optimization.
Install, maintain, and optimize GPU software stacks, including NVIDIA drivers, CUDA, cuDNN, NCCL, and GPU-aware MPI libraries.
Develop and maintain automation, configuration management, and infrastructure-as-code solutions to improve reliability and operational efficiency.
Deploy and maintain scientific software environments using tools such as Spack, EasyBuild, Conda, and environment modules.
Design, deploy, configure, and document core services, including cluster resource management and scheduling, high-performance storage and backup systems, data lifecycle management, user lifecycle management, and authentication and authorization frameworks.
Implement and maintain secure access controls for research systems, including identity federation, key-based authentication, and authorization mechanisms, while ensuring compliance with institutional policies, data protection regulations, and research governance requirements.
Diagnose, troubleshoot, and resolve system issues across hardware, software, networking, storage, and distributed computing environments, ensuring a stable and performant research computing platform.
Provide technical support and training to staff and students, supporting skills development and promoting effective use of research computing resources.
Collaborate closely with faculty to understand evolving computational requirements and develop new systems, workflows, and solutions that support both research and teaching.
Work with Northeastern’s Research Computing team to define and deliver short- and long-term strategies for expanding infrastructure, services, and capabilities at NU London.
Write and curate technical documentation, including internal administrative documentation and external user-facing guides.
Communicate progress, risks, and outcomes through regular updates, technical reviews, and strategic discussions with researchers and senior management.
Participate in conferences, workshops, and regional collaborations, contributing to professional development, external partnerships, and funding opportunities while helping expand NU London’s research computing portfolio.
- Bachelor’s degree in a computation field with Linux systems experience, or equivalent professional experience in Linux system administration.
- Experience with or interest in working with automation, configuration management, and infrastructure-as-code tools (e.g., Ansible, Puppet, Chef, Salt).
HPC Systems Administrator employer: Nulondon
Contact Detail:
Nulondon Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land HPC Systems Administrator
✨Tip Number 1
Network like a pro! Reach out to folks in the HPC community, attend meetups or conferences, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to HPC systems, automation, or any relevant software stacks. This gives potential employers a tangible look at what you can do.
✨Tip Number 3
Prepare for interviews by brushing up on common technical questions related to HPC systems and Linux administration. Practice explaining your past experiences and how they relate to the role you're applying for—confidence is key!
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in being part of the NU London team. Let's get you that job!
We think you need these skills to ace HPC Systems Administrator
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with HPC systems and Linux administration. We want to see how your skills align with the job description, so don’t hold back on showcasing relevant projects!
Show Off Your Technical Skills: When detailing your experience, be specific about the tools and technologies you've used, like Slurm or CUDA. We love seeing candidates who can demonstrate their hands-on experience with automation and configuration management tools.
Keep It Clear and Concise: While we appreciate detail, clarity is key! Use bullet points for easy reading and make sure your application is well-structured. This helps us quickly grasp your qualifications and experience.
Apply Through Our Website: We encourage you to submit your application directly through our website. It’s the best way to ensure it gets into the right hands and helps us keep track of all applications efficiently!
How to prepare for a job interview at Nulondon
✨Know Your HPC Systems
Make sure you brush up on your knowledge of high-performance computing systems. Be ready to discuss your experience with workload managers like Slurm or PBS, and how you've optimised performance in previous roles. This shows you're not just familiar with the tech, but you can also apply it effectively.
✨Showcase Your Automation Skills
Since automation is key for this role, prepare examples of how you've used tools like Ansible or Puppet in past projects. Talk about specific challenges you faced and how your solutions improved operational efficiency. This will demonstrate your hands-on experience and problem-solving abilities.
✨Communicate Clearly
You'll need to collaborate with faculty and provide support to students, so practice explaining complex technical concepts in simple terms. Think of examples where you've successfully communicated technical information to non-technical audiences. This will highlight your ability to bridge the gap between tech and users.
✨Prepare for Scenario Questions
Expect scenario-based questions that test your troubleshooting skills. Think through potential issues you might encounter in a research computing environment and how you'd resolve them. Being able to articulate your thought process will show your analytical skills and readiness for the role.