Gen AI Audio Researcher in London
Gen AI Audio Researcher

Gen AI Audio Researcher in London

London Full-Time 36000 - 60000 £ / year (est.) No home office possible
D

At a Glance

  • Tasks: Research and develop cutting-edge voice synthesis models for natural-sounding speech.
  • Company: Join a forward-thinking team in the exciting field of AI audio research.
  • Benefits: Remote work, competitive salary, and opportunities for professional growth.
  • Why this job: Make an impact in AI by creating innovative voice technologies that change how we communicate.
  • Qualifications: Strong background in machine learning, deep learning, and experience with voice synthesis.
  • Other info: Collaborative remote-first environment with a focus on creativity and innovation.

The predicted salary is between 36000 - 60000 £ per year.

We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines. We are hiring remotely across the EMEA region.

Responsibilities

  • Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech).
  • Build and fine-tune models using frameworks like PyTorch and HuggingFace.
  • Design training pipelines and datasets for scalable voice model training.
  • Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation.
  • Work closely with product and creative teams to ensure models meet quality and production constraints.
  • Stay on top of academic and industrial trends in speech synthesis and related fields.

Must Haves

  • Strong background in machine learning and deep learning, with focus on speech/audio.
  • Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
  • Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
  • Experience training models at scale and working with large audio datasets.
  • Familiarity with vocoders and transformer-based architectures.
  • Strong problem-solving skills, ability to work autonomously in a remote-first environment.

Nice to Have

  • PhD degree in Computer Science/ Machine Learning and publications in top venues.
  • Contributions to open-source speech research or participation in relevant benchmarks.
  • Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control.
  • Experience with voice datasets or proprietary pipelines.

Gen AI Audio Researcher in London employer: DNEG

Join a forward-thinking company that values innovation and collaboration, where as a Gen AI Audio Researcher, you will have the opportunity to work on cutting-edge voice synthesis models in a remote-first environment across the EMEA region. We foster a culture of continuous learning and growth, offering employees access to the latest tools and resources, while encouraging contributions to open-source projects and academic research. With a focus on meaningful work and a commitment to employee well-being, we provide a supportive atmosphere that empowers you to excel in your career.
D

Contact Detail:

DNEG Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Gen AI Audio Researcher in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry on LinkedIn or at conferences. A friendly chat can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to voice synthesis or deep learning. It’s a great way to demonstrate what you can bring to the table.

✨Tip Number 3

Prepare for interviews by brushing up on common questions in the AI and audio space. We recommend practicing with a friend or even recording yourself to refine your answers.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!

We think you need these skills to ace Gen AI Audio Researcher in London

Machine Learning
Deep Learning
Voice Synthesis
Text-to-Speech (TTS)
Voice Cloning
Speech-to-Speech
Python
PyTorch
Torchaudio
ESPnet
Model Training at Scale
Large Audio Datasets
Vocoder Familiarity
Transformer-based Architectures
Problem-Solving Skills

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with voice synthesis and deep learning. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects or technologies you've worked with!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about voice synthesis and how you can contribute to our team. Keep it engaging and personal – we love to see your personality come through.

Showcase Your Projects: If you've got any projects related to TTS or voice cloning, make sure to mention them! Whether it's a GitHub repo or a paper you've published, we want to see what you've been up to in the field of audio research.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. Plus, it’s super easy!

How to prepare for a job interview at DNEG

✨Know Your Tech Inside Out

Make sure you’re well-versed in the latest advancements in voice synthesis and deep learning. Brush up on frameworks like PyTorch and HuggingFace, and be ready to discuss your hands-on experience with TTS and voice cloning. This will show that you’re not just familiar with the tools, but that you can use them effectively.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of challenges you've faced in previous projects and how you overcame them. This is especially important in a remote-first environment where autonomy is key. Highlight your ability to think critically and adapt to new situations.

✨Collaborate Like a Pro

Since the role involves working closely with product and creative teams, be ready to discuss your experience in cross-functional collaboration. Share examples of how you’ve integrated your research into production-ready pipelines and how you’ve ensured quality meets production constraints.

✨Stay Ahead of the Curve

Demonstrate your passion for the field by discussing recent trends in speech synthesis and related areas. Mention any relevant publications or contributions to open-source projects. This shows that you’re not only knowledgeable but also actively engaged in the community.

Gen AI Audio Researcher in London
DNEG
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

D
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>