At a Glance
- Tasks: Optimise performance for inference platforms and manage multi-GPU environments.
- Company: Join a forward-thinking tech company in London with a focus on innovation.
- Benefits: Competitive salary, equity options, and an inclusive workplace culture.
- Other info: Exciting opportunity for growth in a dynamic and supportive environment.
- Why this job: Make a real impact by enhancing system efficiency in cutting-edge technology.
- Qualifications: Expertise in LLM inference, distributed GPU workloads, C++, and CUDA required.
The predicted salary is between 60000 - 80000 Β£ per year.
jobr.pro is seeking a skilled engineer in London to oversee performance for inference platforms. You will be responsible for optimising systems across multi-GPU environments and managing memory allocation to maximise efficiency. The role requires deep expertise in LLM inference and distributed GPU workloads, alongside proficiency in languages such as C++ and CUDA. Competitive salary, equity options, and a commitment to an inclusive workplace are part of the offer.
Staff Inference Systems & Performance Engineer employer: jobr.pro
At jobr.pro, we pride ourselves on being an excellent employer by fostering a collaborative and inclusive work culture in the heart of London. Our commitment to employee growth is reflected in our competitive salary packages, equity options, and opportunities for professional development, making it an ideal environment for skilled engineers looking to make a meaningful impact in the field of inference systems.