Machine Learning Engineer in England

Machine Learning Engineer in England

England Full-Time 60000 - 80000 € / year (est.) Home office (partial)
P

At a Glance

  • Tasks: Design and optimise systems for real-time AI video generation, enhancing accessibility for the Deaf community.
  • Company: Fast-growing tech startup focused on inclusivity and social impact.
  • Benefits: Competitive salary, equity packages, 24 days holiday, and free sign language classes.
  • Other info: Flexible working hours and a commitment to diversity and inclusion.
  • Why this job: Join a passionate team making a real difference with cutting-edge technology.
  • Qualifications: 3+ years in ML systems engineering, strong Python skills, and experience with production ML models.

The predicted salary is between 60000 - 80000 € per year.

Are you interested in helping us craft exceptional experiences for our clients that deliver genuine social impact? Are you ready to join a small and experienced team of innovators and make a significant contribution to a fast-growing technology company?

About the Company

Our client is a fast-paced startup with the goal of making the world a more inclusive place for the Deaf community. As a technology-for-good organisation, they are using AI to create sign language translations across video, transportation and website platforms. The company is expanding rapidly across these sectors, providing an exciting opportunity to grow its technical team. By joining as an ML Systems Engineer, you will be at the forefront of expansion into new markets, helping shape infrastructure strategy and ensuring systems remain scalable, secure and at the cutting edge of technology.

The Role

We are looking for an ML Systems Engineer to help design and optimise the systems that power real-time AI video generation. The company's models generate sign language video using generative AI pipelines deployed on GPU infrastructure across both cloud and on-prem environments. A key challenge is reducing generation latency and maximising GPU utilisation so the system can deliver real-time video streams. You will work across the full ML inference stack, from model optimisation to deployment infrastructure, ensuring models run efficiently in production environments. This role is ideal for engineers who enjoy performance optimisation, distributed systems, and building production ML infrastructure.

Example Responsibilities

  • ML Inference Optimisation
    • Profile and optimise deep learning models used for sign language video generation
    • Reduce inference latency using techniques such as quantisation, pruning, mixed precision, and kernel optimisation
    • Improve GPU utilisation and throughput across inference pipelines
    • Work closely with ML researchers to ensure models are production-ready
  • ML Infrastructure & Deployment
    • Build and maintain scalable model serving systems
    • Deploy and operate inference services on GPU clusters
    • Design autoscaling infrastructure to meet real-time SLAs
    • Contribute to model deployment pipelines, versioning, and rollback strategies
  • Performance Engineering
    • Develop benchmarking frameworks for tracking inference performance
    • Identify bottlenecks across the ML pipeline and eliminate latency hotspots
    • Implement performance monitoring and alerting for production systems
    • Evaluate new hardware accelerators and inference runtimes

Current Technology Stack

  • Python, Go and Rust production services
  • PyTorch-based generative models
  • GPU inference workloads
  • Kubernetes clusters (cloud and on-prem)
  • AWS infrastructure including SageMaker
  • Real-time streaming systems using protocols such as HLS, LL-HLS, RTMP and SRT

Essential Requirements

  • 3+ years of experience in ML systems engineering, ML infrastructure, or backend systems
  • Strong programming skills in Python (Rust is a plus)
  • Experience working with production ML models
  • Experience optimising ML inference performance
  • Familiarity with containerised systems such as Docker and Kubernetes
  • Strong debugging, profiling, and performance analysis skills
  • Interest in building latency-critical systems

Desirable Requirements

  • Experience with inference optimisation tools such as TensorRT, ONNX, or similar frameworks
  • Experience with model serving systems such as Triton, TorchServe, or Ray Serve
  • Familiarity with GPU architecture and performance optimisation
  • Experience working with video, graphics, or real-time streaming systems
  • Experience deploying ML workloads at scale
  • Experience contributing to open-source ML infrastructure projects

Why Join This Company

  • Work on technology that directly improves accessibility for Deaf communities
  • Help build one of the first real-time AI sign language generation systems
  • Join a small, experienced engineering team solving challenging technical problems
  • Opportunity to take ownership of critical systems as an early engineering hire
  • Work across a modern ML infrastructure stack

Benefits

  • 24 days' holiday plus bank holidays and company pension scheme
  • Competitive compensation and high-value equity packages
  • Opportunity to work on cutting-edge technologies and be involved in the early stages of a high-growth business
  • Free sign language classes

Hours

This is a full-time position with normal virtual office hours of 9am to 6pm, although flexibility is offered to suit reasonable personal circumstances. What matters most is strong collaboration, meeting agreed milestones and delivering high-quality work.

Please note you must have the right to work and live full-time in the UK when applying for this position.

Equality and Diversity

The company is committed to eliminating discrimination and encouraging diversity within its team. The aim is to build a workforce that is truly representative of all sections of society, where every employee feels respected and able to give their best. A culture of encouragement and support has been created to enable employees to focus on what they want to achieve for successful career development. Work-life policies and flexible working practices help employees feel more in control of their personal and professional lives. Any qualified applicants who are native sign language users are guaranteed an interview.

Machine Learning Engineer in England employer: Papillon Talent Strategy Ltd

Join a pioneering technology-for-good startup that is dedicated to enhancing accessibility for the Deaf community through innovative AI solutions. As an ML Systems Engineer, you will be part of a dynamic and supportive team, with opportunities for personal and professional growth while working on cutting-edge technologies. Enjoy a flexible work culture, competitive compensation, and the chance to make a meaningful impact in a rapidly expanding company.

P

Contact Detail:

Papillon Talent Strategy Ltd Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Machine Learning Engineer in England

Tip Number 1

Network like a pro! Reach out to people in the industry, attend meetups, and connect with professionals on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to ML systems engineering. This gives potential employers a taste of what you can do and sets you apart from the crowd.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common ML scenarios and be ready to discuss how you've tackled challenges in past projects.

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are genuinely interested in joining our mission to make a difference.

We think you need these skills to ace Machine Learning Engineer in England

Machine Learning Systems Engineering
Deep Learning Model Optimisation
Python Programming
Performance Optimisation
GPU Utilisation
Containerised Systems (Docker, Kubernetes)
Inference Performance Analysis

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the ML Systems Engineer role. Highlight your experience with ML infrastructure, programming skills in Python, and any relevant projects that showcase your ability to optimise performance.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about using technology for good, especially in making a difference for the Deaf community. Show us how your skills align with our mission.

Showcase Your Projects:If you've worked on any projects related to ML inference optimisation or real-time systems, make sure to mention them. We love seeing practical examples of your work and how you’ve tackled challenges in the past.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity to join our innovative team!

How to prepare for a job interview at Papillon Talent Strategy Ltd

Know Your Tech Stack

Familiarise yourself with the technologies mentioned in the job description, like Python, PyTorch, and Kubernetes. Be ready to discuss your experience with these tools and how you've used them in past projects.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled performance optimisation challenges in ML systems. Discuss specific techniques you've used, such as quantisation or pruning, and the impact they had on system performance.

Understand the Company’s Mission

Research the company’s focus on accessibility for the Deaf community. Be prepared to explain why this mission resonates with you and how your skills can contribute to their goals.

Ask Insightful Questions

Prepare thoughtful questions about the team dynamics, current projects, and future challenges. This shows your genuine interest in the role and helps you assess if the company is the right fit for you.