At a Glance
- Tasks: Develop and optimise AI models for Arabic speech synthesis and recognition.
- Company: CNTXT, a pioneering voice AI company focused on the Arabic-speaking world.
- Benefits: Competitive pay, remote work options, and a chance to shape impactful technology.
- Other info: Opportunity to work on cutting-edge projects with significant career growth.
- Why this job: Join a small team making a real difference in Arabic voice AI technology.
- Qualifications: Strong machine learning background and experience with Python and neural models.
The predicted salary is between 50000 - 70000 Β£ per year.
About CNTXT
CNTXT is building voice AI infrastructure for the Arabic-speaking world. We work on the hard problems β natural speech synthesis, real-time transcription, and conversational voice systems β with a focus on Arabic language quality that actually serves the region's speakers.
The Role
We're looking for an AI engineer or researcher who is passionate about voice and speech technology. You'll work directly on the models and systems that power our speech products β evaluating architectures, running fine-tuning experiments, and shipping improvements to production. This is a hands-on role that sits at the intersection of research and engineering.
What Our Team Works On
- Speech Synthesis (TTS) - We build and fine-tune Arabic TTS systems based on state-of-the-art generative architectures β both autoregressive models that generate speech token by token and non-autoregressive models that produce full utterances in parallel. This includes working with neural vocoders (HiFi-GAN, MelGAN, WaveGlow), audio codecs and tokenizers (EnCodec, DAC, RVQ-based systems), acoustic encoders (HuBERT, wav2vec), and diffusion-based audio decoders. A significant focus is voice cloning and zero-shot speaker adaptation for Arabic voices.
- Speech Recognition (ASR) - We work with encoder-decoder and CTC-based ASR models (Whisper, Conformer, wav2vec 2.0) to build accurate, low-latency Arabic transcription. This includes streaming inference, domain adaptation, and language model integration for Arabic dialect robustness.
- Speech-to-Speech - We are building end-to-end voice interaction pipelines that chain ASR, language understanding, and TTS β with hard constraints on latency. This involves voice activity detection (VAD), speaker diarization, speech enhancement, and optimizing the full stack for real-time performance.
- Arabic Language Challenges - Arabic presents unique challenges across the whole stack: diacritization (tashkil) is critical for TTS pronunciation accuracy, dialect variation (MSA, Gulf, Levantine, Egyptian, Maghrebi) affects both synthesis and recognition quality, and training data for many dialects remains scarce. A big part of our work is closing these gaps.
What You'll Work On
- Benchmark and evaluate TTS and ASR models on Arabic test sets β measuring WER, speaker similarity (SIM), naturalness, and dialect coverage across MSA and regional varieties.
- Fine-tune pretrained TTS models on curated Arabic data β including ablations on diacritized vs. undiacritized input, dialect-specific training splits, and voice prompt conditioning.
- Experiment with audio tokenizer and codec configurations β comparing discrete RVQ representations against continuous latent approaches and their effect on Arabic phoneme accuracy.
- Build and maintain Arabic speech data pipelines β audio sourcing, normalization, diacritization, quality filtering, and manifest generation for model training.
- Optimize models for production serving β streaming chunk generation, KV cache tuning, quantization, and batched inference for low-latency Arabic TTS and ASR.
- Evaluate and adapt speech-to-speech pipelines β integrating ASR, LLM, and TTS components with attention to end-to-end latency and Arabic conversational quality.
What We're Looking For
- Strong foundations in machine learning and deep learning.
- Hands-on experience training or fine-tuning neural models β domain matters less than depth.
- Comfortable with Python, PyTorch, and the HuggingFace ecosystem.
- Able to read research papers and translate ideas into experiments independently.
- Clear communicator who can work across research and engineering.
Nice to Have
- Native or fluent Arabic speaker β a real advantage when evaluating synthesis naturalness and dialect quality.
- Prior work with speech or audio models (ASR, TTS, speaker verification, codec, VAD, enhancement, or similar).
- Familiarity with Arabic linguistic structure, diacritization tools, and NLP preprocessing for Arabic.
- Experience with inference optimization β quantization, speculative decoding, CUDA kernels, or serving frameworks (vLLM, TensorRT).
- Publications or open-source contributions in speech or audio.
What We Offer
- Work at the frontier of Arabic voice AI β a genuinely underserved, high-impact area.
- Direct influence on product and research direction.
- Small, focused team β your work ships and matters.
- Competitive compensation and remote flexibility.
AI Engineer β Speech & Voice Intelligence in Newport employer: CNTXT AI
CNTXT is an exceptional employer for those passionate about voice AI, particularly in the Arabic-speaking world. With a focus on cutting-edge technology and a small, dedicated team, employees have the opportunity to make a significant impact on product development while enjoying competitive compensation and remote work flexibility. The company fosters a collaborative work culture that encourages innovation and personal growth, making it an ideal place for professionals looking to advance their careers in a meaningful way.
StudySmarter Expert Adviceπ€«
We think this is how you could land AI Engineer β Speech & Voice Intelligence in Newport
β¨Get Involved in Data Science Meetups
Tap into local data science meetups or workshops to connect with fellow enthusiasts and professionals. These events are goldmines for networking, and sometimes even lead directly to job openings at companies like CNTXT AI!
β¨Show Off Your Projects
Start building a public portfolio showcasing your data science projects on platforms like GitHub or personal websites. Highlight unique analyses or models you've developed. This not only demonstrates your skills but also gets your name out there for roles like AI Engineer β Speech & Voice Intelligence at CNTXT AI.
β¨Leverage Professional Networks
Join professional bodies related to data science, like the Data Science Society or similar organisations. Getting involved can lead to mentorship opportunities and insider knowledge about full-time positions at companies like CNTXT AI.
β¨Apply Directly through Our Website
When you find a suitable opening like AI Engineer β Speech & Voice Intelligence at CNTXT AI, make sure to apply directly through our website. It gives you an edge and shows you're keen to join our team. Plus, who doesnβt love a direct application? Itβs easier than navigating through job boards!
We think you need these skills to ace AI Engineer β Speech & Voice Intelligence in Newport
Some tips for your application π«‘
Show Off Your Projects:In the world of data science, your projects can speak volumes about your skills. Make sure to showcase a few key projects in your CV or portfolio, especially those that highlight your ability to work with data sets, build models, or use relevant tools like Python, R, or SQL. Donβt forget to include links to any GitHub repositories if applicable!
Quantify Your Achievements:Employers love numbers! When drafting your CV, highlight your achievements with quantifiable results. For instance, mention how your data analysis led to a certain percentage increase in efficiency or revenue at a previous job or project. These details can really make your application pop!
Craft a Tailored Cover Letter:For a full-time role at CNTXT AI, your cover letter should reflect your passion for data science and your excitement about the specific projects or values of the company. Dive into why youβre a good fit, how your skills align with their needs, and any unique perspectives you can bring to the team.
Stand Out with Relevant Courses and Certifications:Although experience talks, relevant courses or certifications can be your ticket to impressing hiring managers at CNTXT AI. Mention any standout courses you've completed that equipped you with essential skills, such as machine learning certifications or data visualisation courses. This shows your commitment to continuously developing your skills in the field!
How to prepare for a job interview at CNTXT AI
β¨Brush Up on Your Statistics
For a data science role, we need to seriously sharpen our statistics skills. Get ready to tackle technical questions on probability distributions, hypothesis testing, and regression analysis. These are often the bread and butter of data science interviews, so don't just skim over them!
β¨Showcase Your Projects
Prepare a killer portfolio showcasing your data science projects. We should include details about the datasets used, the tools and techniques applied, and the impact of your findings. If we can walk them through a particularly challenging project or a cool visualisation that had real-world implications, itβll really make us stand out!
β¨Get Comfortable with Python and R
Most data science positions require us to be proficient in programming languages like Python and R. We should practice common libraries like pandas, NumPy, and scikit-learn, and be ready for live coding exercises or algorithm questions. Showing off our coding chops can really impress the interviewers at CNTXT AI!
β¨Prepare for Case Studies
Expect to encounter real-world case studies during the interview. We might be asked how weβd approach a data problem or analyse a dataset to extract insights. It's essential to think out loud and demonstrate our problem-solving process so that the interviewer can see our logical thinking in action.