Lead AI Inference Engineer in London

Lead AI Inference Engineer in London

London Full-Time 48000 - 72000 £ / year (est.) Home office possible
Go Premium
T

At a Glance

  • Tasks: Lead a team to deploy AI models and integrate cutting-edge features into products.
  • Company: Join Tether, a leader in digital finance and blockchain innovation.
  • Benefits: Work remotely, collaborate globally, and enjoy competitive pay and growth opportunities.
  • Why this job: Make a real impact in fintech while working with the latest AI technologies.
  • Qualifications: Strong programming skills in C++, experience with AI models, and team management.
  • Other info: Be part of a dynamic team pushing the boundaries of technology.

The predicted salary is between 48000 - 72000 £ per year.

Join Tether and shape the future of digital finance. At Tether, we’re not just building products; we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Innovate with Tether:

  • Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.
  • Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.
  • Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.
  • Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.
  • Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.

Why join us? Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry. If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you. Are you ready to be part of the future?

About the job: You will lead a cross-functional pod that spans the full stack, from C++ inference engines to JavaScript applications. Your responsibility is to ensure that local AI capabilities ship reliably and perform well across devices. You will balance hands-on technical work with team coordination, guiding foundation and middleware engineers toward shared goals. This role is ideal for someone who understands both the low-level challenges of edge AI and the product-facing needs of app developers, and wants to drive the delivery of cohesive, production-ready local AI systems.

Responsibilities:

  • Work on deploying machine learning models to edge devices using the frameworks: llama.cpp, ggml, onnx.
  • Collaborate closely with researchers to assist in coding, training and transitioning models from research to production environments.
  • Integrate AI features into existing products, enriching them with the latest advancements in machine learning.
  • Manage a cross-functional team (pod) made of middleware (JS), foundation (C++), QA and documentation engineers to produce high-quality deliverables.
  • Regularly assess, both qualitatively and quantitatively, our position in the market with regards to similar products or platforms.
  • Leverage the expertise of technical architects to ensure robust architectural choices and code quality.
  • Ensure stable releases by following precise internal release processes.

Requirements:

  • Excellent programming skills in C++.
  • Strong experience with Llama.cpp and ggml inference engines, which facilitates the deployment of models to specific GPU architectures.
  • Good understanding of deep learning concepts and model architectures.
  • Experience with transformers and LLMs.
  • Demonstrated ability to rapidly assimilate new technologies and techniques.
  • Experience managing a small, specialized, cross-functional team (pod) of 3-5 people.
  • A genuine passion for building good products that improve people's lives.
  • A degree in Computer Science, AI, Machine Learning, or a related field, complemented by a solid track record in AI R&D.

Bonus points if:

  • You have extensive experience with Javascript/Typescript.
  • You have experience with AWS, containerization platforms, orchestration, and automated testing suites (Maestro, Appium).
  • You understand the difficulties, nuances and importance of p2p technology.
  • You have worked with MLC, TVM or similar frameworks.
  • You have experience with Vulkan, CUDA.
  • You have productionized models.

Important information for candidates: Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles: Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page. Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website. Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms. Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io. We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately. When in doubt, feel free to reach out through our official website.

Lead AI Inference Engineer in London employer: Tether Operations Limited

At Tether, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration among a global team. Our commitment to employee growth is evident through opportunities to work on cutting-edge AI technologies while contributing to meaningful projects that shape the future of digital finance. With a focus on transparency and sustainability, Tether provides a unique environment where your contributions directly impact the fintech landscape, all from the comfort of your own home.
T

Contact Detail:

Tether Operations Limited Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Lead AI Inference Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the fintech and AI space, especially those already at Tether. A friendly chat can open doors and give you insider info on what it’s really like to work there.

✨Tip Number 2

Show off your skills! If you’ve got a portfolio or GitHub with projects related to AI or machine learning, make sure to highlight that. It’s a great way to demonstrate your expertise beyond just words.

✨Tip Number 3

Prepare for the interview by brushing up on your technical knowledge. Be ready to discuss your experience with C++, Llama.cpp, and any relevant frameworks. We want to see how you think and solve problems!

✨Tip Number 4

Don’t forget to apply through our official website! It’s the safest way to ensure your application gets seen and keeps you away from recruitment scams. Plus, we love seeing candidates who follow the right channels!

We think you need these skills to ace Lead AI Inference Engineer in London

C++ Programming
JavaScript
Machine Learning Deployment
Llama.cpp
ggml Inference Engines
Deep Learning Concepts
Transformers
LLMs
Team Management
Cross-Functional Collaboration
AI Feature Integration
Architectural Design
AWS
Containerization Platforms
Automated Testing Suites

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter for the Lead AI Inference Engineer role. Highlight your experience with C++, Llama.cpp, and ggml inference engines, as well as any relevant projects that showcase your skills in deploying machine learning models.

Showcase Your Passion: We love candidates who are genuinely excited about building products that improve lives. Share examples of your past work or projects that demonstrate your passion for AI and fintech, and how you’ve contributed to innovative solutions.

Be Clear and Concise: When writing your application, keep it clear and to the point. Use straightforward language to explain your experience and skills, making it easy for us to see why you’d be a great fit for our team.

Apply Through Our Website: Don’t forget to apply through our official careers page! This ensures your application goes directly to us and helps you avoid any recruitment scams. We’re excited to see what you bring to the table!

How to prepare for a job interview at Tether Operations Limited

✨Know Your Tech Inside Out

Make sure you’re well-versed in the frameworks mentioned in the job description, like llama.cpp and ggml. Brush up on your C++ skills and be ready to discuss how you've deployed machine learning models to edge devices in the past.

✨Showcase Your Team Leadership Skills

Since you'll be managing a cross-functional team, prepare examples of how you've successfully led teams before. Highlight your experience in coordinating between different tech stacks and ensuring high-quality deliverables.

✨Understand the Product Landscape

Research Tether’s products and their market position. Be ready to discuss how your skills can enhance their offerings and what innovations you could bring to the table. This shows you're not just interested in the role but also in the company’s mission.

✨Prepare for Technical Questions

Expect deep dives into AI concepts and model architectures. Brush up on your knowledge of transformers and LLMs, and be prepared to explain complex ideas in simple terms. This will demonstrate your expertise and communication skills.

Lead AI Inference Engineer in London
Tether Operations Limited
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

T
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>