Staff Inference Systems & Performance Engineer

Job Board

Companies

jobr.pro

Staff Inference Systems & Performance Engineer

Full-Time 60000 - 80000 £ / year (est.) No working from home possible

At a Glance

Tasks: Optimise performance for inference platforms and manage multi-GPU environments.
Company: Join a forward-thinking tech company in London with a focus on innovation.
Benefits: Competitive salary, equity options, and an inclusive workplace culture.
Other info: Exciting opportunity for growth in a dynamic and supportive environment.
Why this job: Make a real impact by enhancing system efficiency in cutting-edge technology.
Qualifications: Expertise in LLM inference, distributed GPU workloads, C++, and CUDA required.

The predicted salary is between 60000 - 80000 £ per year.

jobr.pro is seeking a skilled engineer in London to oversee performance for inference platforms. You will be responsible for optimising systems across multi-GPU environments and managing memory allocation to maximise efficiency. The role requires deep expertise in LLM inference and distributed GPU workloads, alongside proficiency in languages such as C++ and CUDA. Competitive salary, equity options, and a commitment to an inclusive workplace are part of the offer.