Senior Machine Learning Engineer - AI & GPU Performance
Senior Machine Learning Engineer - AI & GPU Performance

Senior Machine Learning Engineer - AI & GPU Performance

City of London Full-Time 48000 - 84000 ÂŁ / year (est.) No home office possible
Go Premium
S

At a Glance

  • Tasks: Design and develop high-performance solutions in AI and GPU performance.
  • Company: Join Synthesia, a fast-growing AI video communications platform.
  • Benefits: Attractive salary, stock options, private health insurance, and hybrid work.
  • Why this job: Make a real impact in the world of AI and video technology.
  • Qualifications: 3+ years in ML engineering, experience with CUDA/Triton, and strong coding skills.
  • Other info: Great company culture with opportunities for career growth and regular social events.

The predicted salary is between 48000 - 84000 ÂŁ per year.

Who are we?

From your everyday PowerPoint presentations to Hollywood movies, AI will transform the way we create and consume content.

Today, people want to watch and listen, not read — both at home and at work. If you’re reading this and nodding, check out our brand video.

Despite the clear preference for video, communication and knowledge sharing in the business environment are still dominated by text, largely because high‑quality video production remains complex and challenging to scale—until now….

Meet Synthesia

We\’re on amission to make video easy for everyone. Born in an AI lab, our AI video communications platform simplifies the entire video production process, making it easy for everyone, regardless of skill level, to create, collaborate, and share high‑quality videos. Whether it\’s for delivering essential training to employees and customers or marketing products and services, Synthesia enables large organizations to communicate and share knowledge through video quickly and efficiently. We’re trusted by leading brands such as Heineken, Zoom, Xerox, McDonald’s and more. Readstories from happy customers and what1,200+ people say on G2.

In February 2024, G2 named us as the fastest growing company in the world. Today, we\’re at a $2.1bn valuation and we recently raised our Series D. This brings our total funding to over $330M from top‑tier investors, including Accel, Nvidia, Kleiner Perkins, Google and top founders and operators including Stripe, Datadog, Miro, Webflow, and Facebook.

About the role

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will join a team of 40+ Researchers and Engineers within the R&D Department working on cutting edge challenges in the Generative AI space, with a focus on creating highly realistic, emotional and life‑like Synthetic humans through text‑to‑video. Within the team you’ll have the opportunity to work on the applied side of our research efforts and directly impact our solutions that are used worldwide by over 60,000 businesses.

This is an opportunity to work for a company that is impacting businesses at a rapid pace across the globe.

What will you be doing?

As a ML Performance Engineer in the AI & GPU Performance team you will contribute to the design and development of high performance solutions. You will own one or more projects for computationally optimizing large‑scale model training and inference pipelines. By partnering with researchers and research teams you’ll identify high‑impact initiatives and push the boundaries of model performance. You will work on re‑implementing models in an efficient manner by using PyTorch and underlying technologies like CUDA/Triton, Torch compilation, etc.

This would include:

  • Evaluating, profiling and optimising compute resource usage (e.g., Hopper & Blackwell GPUs) for cost and time efficiency at training and inference times
  • Developing customised efficient solutions for inference pipelines (CUDA/Triton kernels) as well as introducing or enhancing tooling for achieving optimal computational performance (e.g. DL compilers, ONNX, TensorRT)
  • Driving the adoption of best practices for large‑model training, including checkpointing, gradient accumulation, and memory optimisation among others
  • Introducing or enhancing tooling for distributed training, performance monitoring, and logging (e.g., DeepSpeed, PyTorch Distributed)
  • Designing and implementing techniques for model parallelism, data parallelism, and mixed‑precision training
  • Keeping updated on the latest research in model compression (e.g., quantization, pruning) and advanced optimisation methods

Who are you?

  • You are an ML engineer passionate about high performance computing
  • You have a background in Computer Science / Engineering and 3+ years of industry experience. (PhD preferred)
  • You have worked on optimising large models for over 2 years
  • You have experience developing CUDA/Triton kernels and optimizing models with DL compilers (torch.compile)
  • You have great coding skills in Python and C++ and you care about writing clean, and efficient code
  • You have experience with optimising distributed systems and distributed tools like DDP, Deepspeed, Accelerate or similar
  • You have some experience in the video space (Diffusion models / GAN’s)
  • You are interested in doing research, trying new things and pushing the boundaries, going beyond what\’s already known

The good stuff…

  • Attractive compensation (salary + stock options + bonus)
  • Private Health Insurance in London
  • Hybrid work setting with an office in London
  • 25 days of annual leave + public holidays
  • Work in a great company culture with the option to join regular planning and socials at our hubs.
  • A generous referral scheme when you know people that are amazing for us
  • Strong opportunities for your career growth

Interested in building your career at Synthesia? Get future opportunities sent straight to your email.

By checking this box, I agree to allow Synthesia to retain my data for future opportunities for employment for up to 700 days after the conclusion of consideration of my current application for employment.

#J-18808-Ljbffr

Senior Machine Learning Engineer - AI & GPU Performance employer: synthesia.io

At Synthesia, we pride ourselves on being an innovative leader in AI video communications, offering a dynamic work environment that fosters creativity and collaboration. Our London office provides a hybrid work setting, competitive compensation packages, and ample opportunities for professional growth, all while being part of a rapidly expanding company that is reshaping how businesses communicate globally. Join us to be at the forefront of technology and make a meaningful impact in the world of video production.
S

Contact Detail:

synthesia.io Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Machine Learning Engineer - AI & GPU Performance

✨Tip Number 1

Network like a pro! Reach out to people in the industry, especially those at Synthesia. A friendly chat can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! If you’ve got a project or a portfolio, make sure to highlight it during interviews. We love seeing what you can do in action!

✨Tip Number 3

Prepare for technical challenges! Brush up on your coding skills and be ready to tackle some real-world problems. We want to see how you think on your feet.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we’re always looking for passionate folks to join our team!

We think you need these skills to ace Senior Machine Learning Engineer - AI & GPU Performance

Machine Learning
High Performance Computing
CUDA
Triton
PyTorch
Python
C++
Model Optimisation
Distributed Systems
DeepSpeed
Performance Monitoring
Model Parallelism
Data Parallelism
Mixed-Precision Training
Research and Development

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the role of Senior Machine Learning Engineer. Highlight your experience with CUDA/Triton and any projects where you've optimised large models. We want to see how your skills align with our mission!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for high-performance computing and how you can contribute to our AI & GPU Performance team. Let us know why you're excited about working at Synthesia!

Showcase Relevant Projects: Include specific examples of projects you've worked on that relate to model optimisation and performance. We love seeing real-world applications of your skills, so don’t hold back on the details!

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Plus, it’s super easy!

How to prepare for a job interview at synthesia.io

✨Know Your Stuff

Make sure you brush up on your knowledge of machine learning, especially in the context of high-performance computing. Be ready to discuss your experience with CUDA/Triton kernels and optimising large models. This is your chance to show off your technical skills!

✨Show Your Passion for AI

Synthesia is all about pushing the boundaries of AI. Share your enthusiasm for generative AI and any personal projects or research you've done in this area. This will help you connect with the interviewers and demonstrate that you're genuinely interested in the field.

✨Prepare for Technical Questions

Expect some deep dives into your coding skills, particularly in Python and C++. Practice explaining your thought process when solving problems, as well as discussing best practices for model training and optimisation. Being articulate about your approach can set you apart.

✨Ask Insightful Questions

Interviews are a two-way street! Prepare thoughtful questions about the team’s current projects, challenges they face, and how they measure success. This shows that you’re not just interested in the role, but also in contributing to the company’s mission.

Senior Machine Learning Engineer - AI & GPU Performance
synthesia.io
Location: City of London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

S
  • Senior Machine Learning Engineer - AI & GPU Performance

    City of London
    Full-Time
    48000 - 84000 ÂŁ / year (est.)
  • S

    synthesia.io

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>