Lead AI Inference Engineer 100% Remote
Lead AI Inference Engineer 100% Remote

Lead AI Inference Engineer 100% Remote

Full-Time 80000 - 100000 £ / year (est.) Home office possible
F

At a Glance

  • Tasks: Lead a team to deploy cutting-edge AI models on edge devices and enhance existing products.
  • Company: Join a forward-thinking tech company focused on innovative AI solutions.
  • Benefits: Enjoy a fully remote role with competitive pay and opportunities for growth.
  • Other info: Collaborative environment with a focus on personal and professional development.
  • Why this job: Make a real impact in the AI field while working with the latest technologies.
  • Qualifications: Strong C++ skills and experience with AI inference engines required.

The predicted salary is between 80000 - 100000 £ per year.

You'll lead a cross-functional pod that spans the full stack, from C++ inference engines to JavaScript applications. Your responsibility is to ensure that local AI capabilities ship reliably and perform well across devices. You'll balance hands‑on technical work with team coordination, guiding foundation and middleware engineers toward shared goals. This role is ideal for someone who understands both the low-level challenges of edge AI and the product-facing needs of app developers, and wants to drive the delivery of cohesive, production‑ready local AI systems.

Responsibilities

  • Work on deploying machine learning models to edge devices using the frameworks llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.
  • Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high quality deliverables.
  • Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms.
  • Leverage the expertise of technical architects to ensure robust architectural choices and code quality.
  • Ensure stable releases by following precise internal release processes.

Qualifications

  • Excellent programming skills in C++.
  • Strong experience with Llama.cpp and ggml inference engines, facilitating deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers, LLMs, Diffusion Models.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • Experience managing a small, specialized, cross‑functional team (pod) of 3–5 people.
  • Genuine passion for building good products that improve people's lives.
  • Degree in Computer Science, AI, Machine Learning, or related field, complemented by a solid track record in AI R&D.

Bonus points if

  • Extensive experience with Javascript/Typescript.
  • Understanding of difficulties, nuances and importance of p2p technology.
  • Experience with Vulkan, Metal and OpenCL.
  • Experience productionizing models.

Lead AI Inference Engineer 100% Remote employer: Framework Ventures

As a Lead AI Inference Engineer at our innovative company, you'll thrive in a fully remote environment that champions collaboration and creativity. We offer a dynamic work culture that prioritises employee growth through continuous learning opportunities and the chance to lead a talented cross-functional team. Join us to make a meaningful impact in the AI space while enjoying the flexibility and support that comes with working for a forward-thinking organisation.
F

Contact Detail:

Framework Ventures Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Lead AI Inference Engineer 100% Remote

✨Tip Number 1

Network like a pro! Reach out to folks in the industry on LinkedIn or at meetups. A friendly chat can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those involving C++ and AI. This gives potential employers a taste of what you can do.

✨Tip Number 3

Prepare for interviews by practising common technical questions and scenarios related to AI inference. We recommend doing mock interviews with friends or using online platforms to get comfortable.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!

We think you need these skills to ace Lead AI Inference Engineer 100% Remote

C++ Programming
JavaScript
Llama.cpp
ggml Inference Engines
Machine Learning Model Deployment
Deep Learning Concepts
Transformers
LLMs
Diffusion Models
Team Management
Cross-Functional Collaboration
Technical Architecture
Release Management
Adaptability to New Technologies
Product Development

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with C++ and AI inference engines like Llama.cpp and ggml. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Tell us why you’re passionate about AI and how your background makes you the perfect fit for leading our cross-functional pod. Keep it engaging and personal!

Showcase Team Management Skills: Since this role involves managing a small team, highlight any previous experience you have in leading cross-functional teams. We love to see examples of how you've guided teams towards shared goals!

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy!

How to prepare for a job interview at Framework Ventures

✨Know Your Tech Inside Out

Make sure you brush up on your C++ skills and get familiar with Llama.cpp and ggml inference engines. Be ready to discuss how you've deployed machine learning models to edge devices in the past, as this will show your hands-on experience and technical prowess.

✨Showcase Your Team Leadership Skills

Since you'll be managing a cross-functional team, prepare examples of how you've successfully led teams before. Highlight your ability to coordinate between different roles, like middleware and foundation engineers, and how you’ve driven projects to completion.

✨Understand the Product Landscape

Research the company’s products and their position in the market. Be prepared to discuss how you would assess and improve their offerings compared to competitors. This shows that you’re not just technically savvy but also understand the business side of things.

✨Be Ready for Technical Challenges

Expect some technical questions or challenges during the interview. Practice explaining deep learning concepts and model architectures clearly, as well as discussing any experience you have with transformers and LLMs. This will demonstrate your depth of knowledge and problem-solving skills.

Lead AI Inference Engineer 100% Remote
Framework Ventures

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>