Staff Inference Systems & Performance Engineer

Staff Inference Systems & Performance Engineer

Full-Time 60000 - 80000 Β£ / year (est.) No working from home possible
J

At a Glance

  • Tasks: Optimise performance for inference platforms and manage multi-GPU environments.
  • Company: Join a forward-thinking tech company in London with a focus on innovation.
  • Benefits: Competitive salary, equity options, and an inclusive workplace culture.
  • Other info: Exciting opportunity for growth in a dynamic and supportive environment.
  • Why this job: Make a real impact by enhancing system efficiency in cutting-edge technology.
  • Qualifications: Expertise in LLM inference, distributed GPU workloads, C++, and CUDA required.

The predicted salary is between 60000 - 80000 Β£ per year.

jobr.pro is seeking a skilled engineer in London to oversee performance for inference platforms. You will be responsible for optimising systems across multi-GPU environments and managing memory allocation to maximise efficiency. The role requires deep expertise in LLM inference and distributed GPU workloads, alongside proficiency in languages such as C++ and CUDA. Competitive salary, equity options, and a commitment to an inclusive workplace are part of the offer.

Staff Inference Systems & Performance Engineer employer: jobr.pro

At jobr.pro, we pride ourselves on being an excellent employer by fostering a collaborative and inclusive work culture in the heart of London. Our commitment to employee growth is reflected in our competitive salary packages, equity options, and opportunities for professional development, making it an ideal environment for skilled engineers looking to make a meaningful impact in the field of inference systems.

J

Contact Details:

jobr.pro Recruitment Team

We think you need these skills to ace Staff Inference Systems & Performance Engineer

Python
Problem-Solving Skills
SQL
Data Engineering
ETL/ELT Processes
Data Pipeline Development
API Integration