Senior LLM Inference Systems Engineer in London

Senior LLM Inference Systems Engineer in London

London Full-Time 70000 - 90000 € / year (est.) No home office possible
D

At a Glance

  • Tasks: Develop and optimise cutting-edge LLM inference technology for AI applications.
  • Company: Leading AI tech company based in Greater London with a focus on innovation.
  • Benefits: Competitive compensation and opportunities to tackle challenging AI problems.
  • Other info: Dynamic work environment with potential for significant career advancement.
  • Why this job: Join a team solving complex AI challenges and push the boundaries of technology.
  • Qualifications: Deep understanding of inference workloads, GPU architectures, and experience with PyTorch and TensorRT.

The predicted salary is between 70000 - 90000 € per year.

A leading AI technology company in Greater London is seeking a Senior Research Engineer to develop cutting-edge LLM inference technology. Candidates will work on optimizing infrastructure for batch inference workloads and enhancing inference engines in memory-constrained environments.

Ideal candidates will possess a deep understanding of inference workloads and GPU architectures, along with familiarity with tools such as PyTorch and TensorRT. The role offers competitive compensation and is aimed at solving challenging AI problems.

Senior LLM Inference Systems Engineer in London employer: Doubleword

As a leading AI technology company in Greater London, we pride ourselves on fostering a dynamic work culture that encourages innovation and collaboration. Our employees benefit from competitive compensation, comprehensive growth opportunities, and the chance to tackle some of the most challenging problems in AI, all while working in a vibrant city known for its tech advancements.

D

Contact Detail:

Doubleword Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior LLM Inference Systems Engineer in London

Tip Number 1

Network like a pro! Reach out to folks in the AI community, especially those working with LLMs and inference systems. Attend meetups or webinars to make connections that could lead to job opportunities.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects related to GPU architectures and inference workloads. This will give potential employers a taste of what you can bring to the table.

Tip Number 3

Prepare for technical interviews by brushing up on your knowledge of PyTorch and TensorRT. Practice coding challenges and system design questions that focus on optimising inference engines.

Tip Number 4

Don’t forget to apply through our website! We’ve got loads of exciting roles, and applying directly can sometimes give you an edge. Plus, it’s super easy to keep track of your applications!

We think you need these skills to ace Senior LLM Inference Systems Engineer in London

LLM Inference Technology
Batch Inference Workloads
Inference Engines
Memory-Constrained Environments
Inference Workloads
GPU Architectures
PyTorch

Some tips for your application 🫡

Tailor Your CV:Make sure your CV highlights your experience with LLM inference technology and GPU architectures. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects or tools like PyTorch and TensorRT.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re passionate about AI and how your background makes you a perfect fit for our team. We love seeing enthusiasm and a clear understanding of the challenges we tackle.

Showcase Problem-Solving Skills:In your application, highlight specific examples where you've solved complex problems, especially in memory-constrained environments. We’re all about tackling tough challenges, so let us know how you’ve done it before!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about what we do at StudySmarter!

How to prepare for a job interview at Doubleword

Know Your Tech Inside Out

Make sure you have a solid grasp of LLM inference technology and GPU architectures. Brush up on your knowledge of PyTorch and TensorRT, as these tools are likely to come up in conversation. Being able to discuss specific projects or experiences where you've used these technologies will really impress the interviewers.

Showcase Problem-Solving Skills

Prepare to discuss how you've tackled challenging AI problems in the past. Think of examples where you optimised infrastructure for batch inference workloads or enhanced inference engines. This will demonstrate your practical experience and ability to think critically under pressure.

Understand the Company’s Vision

Research the company’s goals and recent projects in AI technology. Understanding their mission will help you align your answers with what they value. It also shows that you're genuinely interested in the role and the company, which can set you apart from other candidates.

Ask Insightful Questions

Prepare thoughtful questions about the team, projects, and challenges they face. This not only shows your enthusiasm but also helps you gauge if the company is the right fit for you. Questions about their approach to memory-constrained environments or future developments in LLM technology can spark engaging discussions.