AI Inference Engineer (London)
AI Inference Engineer (London)

AI Inference Engineer (London)

London Full-Time 108000 - 144000 Β£ / year (est.) No home office possible
Go Premium
P

We are looking for an AI Inference Engineer to join our growing team.

Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  1. Develop APIs for AI inference used by internal and external customers.
  2. Benchmark and address bottlenecks in our inference stack.
  3. Improve system reliability and observability; respond to outages.
  4. Explore research and implement optimizations for LLM inference.

Qualifications

  1. Experience with ML systems and deep learning frameworks (e.g., PyTorch, TensorFlow, ONNX).
  2. Familiarity with LLM architectures and inference optimization techniques (e.g., batching, quantization).
  3. Experience deploying reliable, distributed, real-time model serving at scale.
  4. (Optional) Understanding of GPU architectures or experience with CUDA kernel programming.

The cash compensation range for this role is $190,000 – $240,000.

About Perplexity

Since launching the world\’s first fully functional conversational answer engine over a year ago, we\’ve experienced tremendous growth. Our AI-powered search assistant has 10 million monthly active users as of early 2024, with over 1 million app installations across iOS and Android. In 2023, we served over 500 million queries globally.

We have raised significant funding, including a $73.6 million Series B in January 2024 led by IVP with participation from NVIDIA, Jeff Bezos\’ fund, NEA, Databricks, and others. We also completed a $62.7 million Series B1 in April 2024, valuing Perplexity at over $1 billion.

Our investor base includes IVP, NEA, NVIDIA, Jeff Bezos, Databricks, Bessemer Venture Partners, and prominent individuals like Elad Gil, Nat Friedman, Naval Ravikant, and Tobi Lutke.

Additional Information

Final offer amounts depend on experience and expertise and may vary from listed ranges.

Compensation includes base salary and equity.

Benefits include comprehensive health, dental, and vision insurance, and a 401(k) plan.

#J-18808-Ljbffr

P

Contact Detail:

Perplexity AI Recruiting Team

AI Inference Engineer (London)
Perplexity AI
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

P
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>