At a Glance
- Tasks: Research and develop next-gen voice synthesis models for expressive speech.
- Company: Join a cutting-edge tech team focused on AI audio innovation.
- Benefits: Remote work, competitive salary, and opportunities for professional growth.
- Why this job: Make an impact in the exciting field of AI voice technology.
- Qualifications: Strong background in machine learning and experience with voice synthesis.
- Other info: Collaborative remote-first environment with a focus on innovation.
The predicted salary is between 36000 - 60000 Β£ per year.
We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. You will research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines. We are hiring remotely across the EMEA region.
Responsibilities
- Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech).
- Build and fine-tune models using frameworks like PyTorch and HuggingFace.
- Design training pipelines and datasets for scalable voice model training.
- Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation.
- Work closely with product and creative teams to ensure models meet quality and production constraints.
- Stay on top of academic and industrial trends in speech synthesis and related fields.
Must Haves
- Strong background in machine learning and deep learning, with focus on speech/audio.
- Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
- Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
- Experience training models at scale and working with large audio datasets.
- Familiarity with vocoders and transformer-based architectures.
- Strong problem-solving skills, ability to work autonomously in a remote-first environment.
Nice to Have
- PhD degree in Computer Science/ Machine Learning and publications in top venues.
- Contributions to open-source speech research or participation in relevant benchmarks.
- Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control.
- Experience with voice datasets or proprietary pipelines.
Gen AI Audio Researcher employer: DNEG
Contact Detail:
DNEG Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land Gen AI Audio Researcher
β¨Tip Number 1
Network like a pro! Reach out to folks in the industry on LinkedIn or at conferences. A friendly chat can open doors that a CV just can't.
β¨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to voice synthesis. Itβs a great way to demonstrate what you can do beyond the written application.
β¨Tip Number 3
Prepare for interviews by brushing up on common questions in AI and audio research. Practice explaining your past projects clearly and confidently β we want to see your passion!
β¨Tip Number 4
Donβt forget to apply through our website! Itβs the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take that extra step.
We think you need these skills to ace Gen AI Audio Researcher
Some tips for your application π«‘
Tailor Your CV: Make sure your CV highlights your experience with voice synthesis and deep learning. We want to see how your skills align with the role, so donβt be shy about showcasing relevant projects or research!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why youβre passionate about voice synthesis and how your background makes you a perfect fit for our team. Keep it engaging and personal!
Showcase Your Projects: If you've worked on any cool projects related to TTS or voice cloning, make sure to mention them! We love seeing practical applications of your skills, so include links or descriptions of your work.
Apply Through Our Website: We encourage you to apply directly through our website. Itβs the best way to ensure your application gets into the right hands. Plus, it shows us youβre keen on joining our team!
How to prepare for a job interview at DNEG
β¨Know Your Tech Inside Out
Make sure youβre well-versed in the latest advancements in voice synthesis and deep learning. Brush up on frameworks like PyTorch and HuggingFace, and be ready to discuss your hands-on experience with TTS and voice cloning. This will show that youβre not just familiar with the tools, but you can also apply them effectively.
β¨Showcase Your Problem-Solving Skills
Prepare to share specific examples of challenges you've faced in previous projects and how you tackled them. Highlight your ability to work autonomously, especially in a remote setting, as this is crucial for the role. Companies love candidates who can think on their feet and come up with innovative solutions.
β¨Stay Current with Trends
Familiarise yourself with the latest research and trends in speech synthesis. Mention any relevant publications or contributions to open-source projects during your interview. This demonstrates your passion for the field and your commitment to continuous learning, which is highly valued.
β¨Collaborate and Communicate
Since you'll be working closely with product and creative teams, practice articulating your ideas clearly. Be prepared to discuss how you would integrate your models into production-ready pipelines. Good communication skills can set you apart from other candidates, so donβt underestimate their importance!