Senior System Software Engineer, NCCL - Partner Enablement
Senior System Software Engineer, NCCL - Partner Enablement

Senior System Software Engineer, NCCL - Partner Enablement

Full-Time 48000 - 72000 £ / year (est.) Home office (partial)
N

At a Glance

  • Tasks: Engage with partners to solve performance issues and develop automation tools for AI networking.
  • Company: Join NVIDIA, a leader in AI and High Performance Computing innovations.
  • Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
  • Why this job: Be at the forefront of AI technology and make a real impact in the industry.
  • Qualifications: 5+ years in software engineering with strong C/C++ skills and HPC experience.
  • Other info: Collaborative environment with global teams and exciting career advancement opportunities.

The predicted salary is between 48000 - 72000 £ per year.

Overview

NVIDIA is seeking a motivated Partner Enablement Engineer to guide our key partners and customers with NCCL. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. Most DL/HPC applications run on large clusters with high-speed networking (Infiniband, RoCE, Ethernet). This is an outstanding opportunity to get an end to end understanding of the AI networking stack.

Responsibilities

  • Engage with our partners and customers to root cause functional and performance issues reported with NCCL
  • Conduct performance characterization and analysis of NCCL and DL applications on GPU clusters
  • Develop tools and automation to isolate issues on new systems and platforms, including cloud platforms (Azure, AWS, GCP, etc.)
  • Guide our customers and support teams on HPC knowledge and standard methodologies for running applications on multi-node clusters
  • Document and conduct trainings/webinars for NCCL
  • Engage with internal teams in different time zones on networking, GPUs, storage, infrastructure and support.

Qualifications

  • B.S./M.S. degree in CS/CE or equivalent experience with 5+ years of relevant experience. Experience with parallel programming and at least one communication runtime (MPI, NCCL, UCX, NVSHMEM)
  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
  • Experience working with engineering or academic research community supporting HPC or AI
  • Practical experience with high performance networking: Infiniband/RoCE/Ethernet networks, RDMA, topologies, congestion control
  • Expert in Linux fundamentals and a scripting language, preferably Python
  • Familiar with containers, cloud provisioning and scheduling tools (Docker, Docker Swarm, Kubernetes, SLURM, Ansible)
  • Adaptability and passion to learn new areas and tools
  • Flexibility to work and communicate effectively across different teams and timezones

Ways To Stand Out From The Crowd

  • Experience conducting performance benchmarking and developing infrastructure on HPC clusters. Prior system administration experience, esp for large clusters. Experience debugging network configuration issues in large scale deployments
  • Familiarity with CUDA programming and/or GPUs. Good understanding of Machine Learning concepts and experience with Deep Learning Frameworks such PyTorch, TensorFlow
  • Deep understanding of technology and passionate about what you do

As an equal opportunity employer, NVIDIA is committed to fostering a supportive and empowering workplace for all.

#J-18808-Ljbffr

Senior System Software Engineer, NCCL - Partner Enablement employer: Nvidia

NVIDIA is an exceptional employer that champions innovation and creativity, providing a dynamic work environment where employees can thrive in the cutting-edge fields of Artificial Intelligence and High Performance Computing. With a strong emphasis on professional development, employees are encouraged to engage in continuous learning and collaboration across diverse teams, ensuring they remain at the forefront of technology. Located in a vibrant tech hub, NVIDIA offers unique advantages such as access to industry-leading resources and a culture that celebrates diversity and inclusion.
N

Contact Detail:

Nvidia Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior System Software Engineer, NCCL - Partner Enablement

✨Tip Number 1

Network like a pro! Reach out to current or former employees at NVIDIA on LinkedIn. A friendly chat can give us insider info and might even lead to a referral, which can seriously boost our chances.

✨Tip Number 2

Prepare for the technical interview by brushing up on your C/C++ skills and parallel programming concepts. We should also practice explaining our past projects clearly, especially those involving HPC or AI, as they’ll want to see how we tackle real-world problems.

✨Tip Number 3

Show off our passion for technology! During interviews, let’s share our thoughts on the latest trends in AI and HPC. This not only demonstrates our knowledge but also shows that we’re genuinely excited about the field.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure our application gets seen by the right people. Plus, it shows we’re serious about joining the NVIDIA team.

We think you need these skills to ace Senior System Software Engineer, NCCL - Partner Enablement

NCCL
High Performance Computing (HPC)
Deep Learning (DL) applications
C/C++ programming
Debugging
Profiling
Code optimization
Performance analysis
Parallel programming
MPI
UCX
NVSHMEM
Infiniband
RoCE
Ethernet networking
RDMA
Linux fundamentals
Python scripting
Docker
Kubernetes
SLURM
Ansible
Machine Learning concepts
PyTorch
TensorFlow

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the role of Senior System Software Engineer. Highlight your experience with parallel programming, C/C++, and any relevant HPC or AI projects. We want to see how your skills align with what we're looking for!

Showcase Your Projects: Include specific examples of projects you've worked on that relate to NCCL or high-performance networking. This could be anything from performance benchmarking to developing tools for cloud platforms. Let us see your hands-on experience!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Explain why you're passionate about this role and how your background makes you a great fit. We love seeing enthusiasm and a clear understanding of our work at NVIDIA.

Apply Through Our Website: Don't forget to apply through our website! It’s the best way for us to receive your application and ensures it gets into the right hands. Plus, it shows you're serious about joining our team!

How to prepare for a job interview at Nvidia

✨Know Your NCCL Inside Out

Make sure you have a solid understanding of NCCL and its role in high-performance computing. Brush up on how it interacts with GPUs and the networking stack, as well as common performance issues. Being able to discuss specific examples from your experience will show your expertise.

✨Show Off Your Programming Skills

Prepare to demonstrate your C/C++ programming skills during the interview. Be ready to discuss debugging techniques, profiling, and performance analysis. You might even be asked to solve a coding problem on the spot, so practice some relevant challenges beforehand!

✨Familiarise Yourself with Cloud Platforms

Since the role involves working with cloud platforms like Azure, AWS, and GCP, make sure you understand their basics and how they relate to HPC. Highlight any experience you have with cloud provisioning tools like Docker or Kubernetes, as this will set you apart.

✨Communicate Effectively Across Teams

Given the need to engage with internal teams across different time zones, practice articulating your thoughts clearly and concisely. Prepare examples of how you've successfully collaborated with diverse teams in the past, showcasing your adaptability and communication skills.

Senior System Software Engineer, NCCL - Partner Enablement
Nvidia

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

N
  • Senior System Software Engineer, NCCL - Partner Enablement

    Full-Time
    48000 - 72000 £ / year (est.)

    Application deadline: 2027-09-17

  • N

    Nvidia

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>