At a Glance
- Tasks: Join us to research and develop cutting-edge speech-to-speech capabilities in multimodal LLMs.
- Company: ConnexAI is pioneering innovative language models with a focus on speech technologies.
- Benefits: Enjoy a collaborative environment, flexible work options, and opportunities for professional growth.
- Why this job: Be at the forefront of AI technology, shaping impactful products while working with passionate experts.
- Qualifications: PhD or MSc in machine learning/data science, with experience in LLMs and strong Python skills required.
- Other info: Ideal for those eager to innovate and improve workflows in a dynamic team setting.
The predicted salary is between 36000 - 60000 £ per year.
About the Role
ConnexAI is developing an ambitious new product to enhance our large language models with speech-to-speech capabilities. This greenfield project offers a unique opportunity to help define its research direction and build the machine learning systems that will power it.
We’re seeking a data scientist with a strong research background in machine learning and a focus on speech or multimodal systems. In this role, you’ll work at the intersection of speech and language technologies, exploring how to integrate these modalities into deployable models. You’ll collaborate closely with engineers, researchers, and product leaders to design, prototype, and deploy state-of-the-art models.
What You'll Be Doing
- Researching the state-of-the-art approaches for incorporating audio data into multimodal LLMs for speech-to-text, text-to-speech, and speech-to-speech tasks
- Implementing and adapting techniques from recent academic papers into practical, production-ready solutions
- Training and fine-tuning models, and iterating on architectures to improve performance and scalability
- Sourcing, curating, and preparing datasets for model training and evaluation
- Defining evaluation metrics and testing frameworks for multimodal systems
- Collaborating with product and engineering teams to translate research concepts into deployable features
- Contributing to improving the team’s workflows to help foster a healthy, productive, and innovation-focused environment
What We're Looking For
- Background in machine learning or data science, ideally with a research focus (PhD or MSc with equivalent industry experience)
- Experience working with LLMs, speech technologies (ASR, TTS), or multimodal systems
- Strong programming skills in Python, with experience in using PyTorch
- Hands-on experience with training and fine-tuning ML models, including setting up experiments and evaluating results
- Ability to read, interpret, and implement techniques from recent academic papers into practical, working solutions
- Strong communicator, comfortable working across interdisciplinary teams
- A collaborative mindset and interest in helping improve team workflows
- Curiosity and a willingness to learn
Data Marketing Scientist employer: ConnexAI
Contact Detail:
ConnexAI Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Data Marketing Scientist
✨Tip Number 1
Familiarise yourself with the latest research in multimodal LLMs and speech technologies. This will not only help you understand the current landscape but also allow you to engage in meaningful conversations during interviews, showcasing your knowledge and enthusiasm for the field.
✨Tip Number 2
Network with professionals in the machine learning and speech technology sectors. Attend relevant conferences, webinars, or meetups to connect with industry experts. This can lead to valuable insights and potentially even referrals for the position at ConnexAI.
✨Tip Number 3
Demonstrate your programming skills by working on personal projects or contributing to open-source initiatives related to LLMs or speech technologies. Having a portfolio of practical work can set you apart from other candidates and show your hands-on experience.
✨Tip Number 4
Prepare to discuss how you would approach integrating audio data into multimodal systems. Think about specific challenges you might face and how you would overcome them, as this will demonstrate your problem-solving abilities and innovative thinking during the interview process.
We think you need these skills to ace Data Marketing Scientist
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, data science, and any specific projects related to speech technologies or multimodal systems. Use keywords from the job description to align your skills with what ConnexAI is looking for.
Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Discuss your research background and how it relates to the development of multimodal LLMs. Mention specific experiences that demonstrate your ability to work collaboratively and innovate.
Showcase Your Projects: If you have worked on relevant projects, especially those involving LLMs or speech technologies, include them in your application. Provide links to any code repositories or publications that showcase your work and expertise in this area.
Prepare for Technical Questions: Be ready to discuss your technical skills and experiences in detail. Review recent academic papers related to multimodal systems and be prepared to explain how you would implement their techniques in practical applications.
How to prepare for a job interview at ConnexAI
✨Showcase Your Research Background
Make sure to highlight your research experience in machine learning, especially if it relates to speech or multimodal systems. Be prepared to discuss specific projects or papers you've worked on and how they relate to the role.
✨Demonstrate Technical Proficiency
Since strong programming skills in Python and experience with PyTorch are essential, be ready to discuss your coding experience. You might even want to prepare for a technical assessment or coding challenge during the interview.
✨Prepare for Collaborative Scenarios
Given the emphasis on collaboration with engineers and product leaders, think of examples where you've successfully worked in interdisciplinary teams. Be ready to discuss how you contributed to team workflows and fostered a productive environment.
✨Stay Updated on Recent Research
Familiarise yourself with the latest academic papers related to audio data integration in multimodal LLMs. Being able to discuss recent advancements and how they could apply to the company's projects will demonstrate your genuine interest and expertise.