At a Glance
- Tasks: Lead a team to deploy cutting-edge AI models on edge devices and enhance existing products.
- Company: Join a forward-thinking tech company focused on innovative AI solutions.
- Benefits: Enjoy a fully remote role with competitive pay and opportunities for growth.
- Other info: Collaborative environment with a focus on personal and professional development.
- Why this job: Make a real impact in the AI field while working with the latest technologies.
- Qualifications: Strong C++ skills and experience with AI inference engines required.
The predicted salary is between 80000 - 100000 £ per year.
You'll lead a cross-functional pod that spans the full stack, from C++ inference engines to JavaScript applications. Your responsibility is to ensure that local AI capabilities ship reliably and perform well across devices. You'll balance hands‑on technical work with team coordination, guiding foundation and middleware engineers toward shared goals. This role is ideal for someone who understands both the low-level challenges of edge AI and the product-facing needs of app developers, and wants to drive the delivery of cohesive, production‑ready local AI systems.
Responsibilities
- Work on deploying machine learning models to edge devices using the frameworks llama.cpp, ggml, onnx.
- Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
- Integrate AI features into existing products, enriching them with the latest advancements in machine learning.
- Manage a cross‑functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high quality deliverables.
- Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms.
- Leverage the expertise of technical architects to ensure robust architectural choices and code quality.
- Ensure stable releases by following precise internal release processes.
Qualifications
- Excellent programming skills in C++.
- Strong experience with Llama.cpp and ggml inference engines, facilitating deployment of models to specific GPU architectures.
- Good understanding of deep learning concepts and model architectures.
- Experience with transformers, LLMs, Diffusion Models.
- Demonstrated ability to rapidly assimilate new technologies and techniques.
- Experience managing a small, specialized, cross‑functional team (pod) of 3–5 people.
- Genuine passion for building good products that improve people's lives.
- Degree in Computer Science, AI, Machine Learning, or related field, complemented by a solid track record in AI R&D.
Bonus points if
- Extensive experience with Javascript/Typescript.
- Understanding of difficulties, nuances and importance of p2p technology.
- Experience with Vulkan, Metal and OpenCL.
- Experience productionizing models.
Lead AI Inference Engineer 100% Remote employer: Framework Ventures
Contact Detail:
Framework Ventures Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Lead AI Inference Engineer 100% Remote
✨Tip Number 1
Network like a pro! Reach out to folks in the industry on LinkedIn or at meetups. A friendly chat can open doors that a CV just can't.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those involving C++ and AI. This gives potential employers a taste of what you can do.
✨Tip Number 3
Prepare for interviews by practising common technical questions and scenarios related to AI inference. We recommend doing mock interviews with friends or using online platforms to get comfortable.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!
We think you need these skills to ace Lead AI Inference Engineer 100% Remote
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with C++ and AI inference engines like Llama.cpp and ggml. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Tell us why you’re passionate about AI and how your background makes you the perfect fit for leading our cross-functional pod. Keep it engaging and personal!
Showcase Team Management Skills: Since this role involves managing a small team, highlight any previous experience you have in leading cross-functional teams. We love to see examples of how you've guided teams towards shared goals!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy!
How to prepare for a job interview at Framework Ventures
✨Know Your Tech Inside Out
Make sure you brush up on your C++ skills and get familiar with Llama.cpp and ggml inference engines. Be ready to discuss how you've deployed machine learning models to edge devices in the past, as this will show your hands-on experience and technical prowess.
✨Showcase Your Team Leadership Skills
Since you'll be managing a cross-functional team, prepare examples of how you've successfully led teams before. Highlight your ability to coordinate between different roles, like middleware and foundation engineers, and how you’ve driven projects to completion.
✨Understand the Product Landscape
Research the company’s products and their position in the market. Be prepared to discuss how you would assess and improve their offerings compared to competitors. This shows that you’re not just technically savvy but also understand the business side of things.
✨Be Ready for Technical Challenges
Expect some technical questions or challenges during the interview. Practice explaining deep learning concepts and model architectures clearly, as well as discussing any experience you have with transformers and LLMs. This will demonstrate your depth of knowledge and problem-solving skills.