At a Glance
- Tasks: Design and optimise TTS models for natural-sounding voice assistants.
- Company: Join Aiphoria, a team of award-winning AI professionals.
- Benefits: Competitive pay, remote work, and rapid career progression.
- Why this job: Work with cutting-edge tech in voice-assisted banking and make a real impact.
- Qualifications: 3+ years in TTS, proficiency in Python, and deep learning frameworks.
- Other info: Dynamic start-up culture with opportunities to work on diverse projects.
The predicted salary is between 36000 - 60000 Β£ per year.
We are now expanding our team and are looking for skilled, goal-oriented MLE (TTS) to join our teams.
Requirements
- 3+ years of hands-on experience with Text-to-Speech (TTS) / speech synthesis
- Proficiency in Python and deep learning frameworks (especially, PyTorch)
- Strong understanding of speech synthesis processing techniques
- Experience with Fast Attention-Based Models: (FastPitch, FastSpeech 2) and modern variative approaches: (e.g., VITS, Glow-TTS)
- Strong understanding of techniques to control prosody, rhythm, and emotional tone for expressive speech synthesis
- Knowledge of normalization techniques, FSTs, NN for normalization
- Familiarity with TTS evaluation techniques, including MOS and A/B testing
- Familiarity with vocoder models (e.g. Vocos, HiFi-GAN, mimi)
- Knowledge of signal processing, statistical modeling, and language structure
Responsibilities
- Design and optimize TTS models to ensure our voice assistant sounds as natural and accurate as possible
- Collaborate closely with product managers and engineers to integrate TTS tech, making it seamless and intuitive for users
- Partner with data teams to build efficient audio data pipelines, from speaker recording/preprocessing to model training
- Regularly update and refine TTS models to adapt to various accents, dialects, and speech styles, enhancing user satisfaction and responsiveness
- Keep up-to-date with the latest TTS advancements, bringing in innovative techniques and tools to keep us at the forefront of voice-assisted banking
- Rigorously test and validate models to meet strict standards
What We Offer
- Experienced team, Aiphoria is formed by a team of enthusiastic professionals who created award-winning devices, voice assistants and other AI-driven products for BigTech corporations
- Cutting-edge technologies, we build a technology using our areas of expertise including Computer Vision, Speech Technologies, Natural Language Understanding, Generative AI incl. LLM and Diffusion models
- Rapid career progression, facilitated by our team of seasoned senior professionals who hail from prestigious, industry-leading companies
- Remote work opportunities
- Company has prominent clients with an opportunity for you to work on different projects and/or to be involved in developing our proprietary own products
- Competitive compensation surpassing market standards
- A company with entrepreneurial spirit. We offer a unique mix of a secure workspace thanks to the big clients raised along with a true start-up culture!
Machine Learning Engineer TTS employer: Aiphoria
Contact Detail:
Aiphoria Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land Machine Learning Engineer TTS
β¨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with other Machine Learning Engineers. You never know who might have the inside scoop on job openings or can refer you directly.
β¨Tip Number 2
Show off your skills! Create a portfolio showcasing your TTS projects, especially those using FastPitch or FastSpeech 2. Having tangible examples of your work can really set you apart during interviews.
β¨Tip Number 3
Prepare for technical interviews by brushing up on your Python and deep learning frameworks. Practice coding challenges and be ready to discuss your understanding of speech synthesis techniques and models.
β¨Tip Number 4
Donβt forget to apply through our website! Itβs the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!
We think you need these skills to ace Machine Learning Engineer TTS
Some tips for your application π«‘
Tailor Your CV: Make sure your CV highlights your experience with TTS and Python. We want to see how your skills align with our needs, so donβt be shy about showcasing relevant projects or achievements!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Tell us why youβre passionate about TTS and how you can contribute to our team. Keep it engaging and personal β we love to see your personality come through.
Showcase Your Technical Skills: Be specific about your experience with deep learning frameworks like PyTorch and any TTS models you've worked on. Weβre looking for hands-on experience, so include examples that demonstrate your expertise!
Apply Through Our Website: We encourage you to apply directly through our website. Itβs the best way for us to receive your application and ensures youβre considered for the role. Plus, itβs super easy!
How to prepare for a job interview at Aiphoria
β¨Know Your TTS Inside Out
Make sure you brush up on your Text-to-Speech knowledge before the interview. Familiarise yourself with the latest advancements in speech synthesis, especially Fast Attention-Based Models like FastPitch and FastSpeech 2. Being able to discuss these topics confidently will show that you're not just skilled but also genuinely interested in the field.
β¨Showcase Your Python Skills
Since proficiency in Python is a must-have, be prepared to discuss your experience with it in detail. Bring examples of projects where you've used Python and deep learning frameworks like PyTorch. If possible, demonstrate your understanding of how these tools can optimise TTS models.
β¨Discuss Prosody and Emotion Control
Understanding how to control prosody, rhythm, and emotional tone is crucial for expressive speech synthesis. Be ready to share your insights or experiences related to these techniques. This will highlight your expertise and ability to enhance user satisfaction through natural-sounding voice assistants.
β¨Prepare for Technical Questions
Expect technical questions about TTS evaluation techniques, vocoder models, and signal processing. Brush up on concepts like MOS and A/B testing, and be ready to explain how you've applied these in past projects. This preparation will help you tackle any curveballs during the interview.