At a Glance
- Tasks: Revolutionise the AI industry by developing efficient inference pipelines for speech recognition and synthesis.
- Company: Multi-award-winning AI and SaaS provider based in Manchester.
- Benefits: Hybrid work model, competitive salary, and opportunities for professional growth.
- Other info: Dynamic research environment with strong focus on collaboration and cutting-edge technology.
- Why this job: Join a team of innovators and make a real impact in the Agentic AI space.
- Qualifications: MSc with 2+ years in Speech or Generative AI, expert in Python and PyTorch.
The predicted salary is between 50000 - 60000 £ per year.
Location: Manchester, UK (Hybrid)
About the job
Can you see yourself revolutionising the Agentic AI industry? We are a multi-award-winning AI and SaaS provider based in Manchester, dedicated to boosting productivity and efficiency across our global customer base spanning five continents. As a Machine Learning Engineer, you will be the architect responsible for maximising the fluidity, scale, and performance of our ASR and TTS products within the Agentic AI pipeline. This role offers a world-class research and production environment: you will work alongside like-minded scientists who develop state-of-the-art models, acting as the vital link between experimental research and enabling these bespoke systems to be used by our customers. Your primary focus will be implementing highly efficient inference pipelines for real-time and offline speech recognition and synthesis, ensuring our Agentic vision translates into a seamless user experience.
Responsibilities
- Liaise with ASR and TTS technical leads, software engineers, and DevOps teams to deploy models efficiently on Hopper and Blackwell GPU architectures.
- Maintain steady biweekly progression within our sprint-based research environment.
- Implement low-latency, real-time inference ML pipelines using tools such as Triton Inference Server, vLLM, or SGLang.
- Optimise model performance across cloud platforms using frameworks like TensorRT and ONNX.
- Build and maintain robust API services using Python-based web frameworks (e.g., FastAPI).
- Manage containerisation and orchestration workflows using Docker and Kubernetes.
- Ensure system reliability through observability and monitoring tools like Prometheus, Grafana, and OpenTelemetry.
- Write concise technical documentation and research papers.
Requirements
- MSc plus 2+ years of hands‑on experience in Speech or Generative AI.
- Deep understanding of Generative AI, Neural Networks, and the latest LLM architectures.
- Expert‑level proficiency in Python and PyTorch.
- Proven experience in performance optimisation and cloud platform deployment.
- Strong background in containerisation and orchestration (Docker, K8s, etc.).
- Demonstrated ability to deploy and scale low‑latency ML pipelines in production.
- Strong oral and written communication skills.
Inference Engineer in Manchester employer: ConnexAI
Contact Detail:
ConnexAI Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Inference Engineer in Manchester
✨Tip Number 1
Network like a pro! Reach out to folks in the AI and tech scene, especially those who work at companies you're eyeing. LinkedIn is your best mate here; drop them a message, ask about their experiences, and see if they can give you the inside scoop on job openings.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to speech recognition or generative AI. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on technical questions specific to inference pipelines and machine learning frameworks. Practice explaining complex concepts in simple terms, as communication is key in this field. We want to see how you think and solve problems!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team and contributing to the Agentic AI revolution.
We think you need these skills to ace Inference Engineer in Manchester
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Inference Engineer role. Highlight your experience with ASR, TTS, and any relevant projects that showcase your skills in Python and performance optimisation. We want to see how you can contribute to our Agentic AI vision!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about AI and how your background aligns with our mission at StudySmarter. Keep it concise but impactful – we love a good story!
Showcase Your Technical Skills: Don’t forget to mention your hands-on experience with tools like Triton Inference Server, Docker, and Kubernetes. We’re looking for someone who can hit the ground running, so make sure we know what you bring to the table!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates. Plus, we love seeing applications come in through our own channels!
How to prepare for a job interview at ConnexAI
✨Know Your Tech Inside Out
Make sure you’re well-versed in the tools and technologies mentioned in the job description, like Triton Inference Server and Docker. Brush up on your Python and PyTorch skills, as you'll likely be asked to demonstrate your expertise during the interview.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles, especially related to low-latency ML pipelines or performance optimisation. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your contributions.
✨Familiarise Yourself with Their Products
Research the company’s ASR and TTS products thoroughly. Understanding their applications and how they fit into the Agentic AI pipeline will help you articulate how you can contribute to enhancing user experience and system reliability.
✨Communicate Clearly and Confidently
Since strong communication skills are a must, practice explaining complex technical concepts in simple terms. This will not only show your expertise but also your ability to collaborate effectively with cross-functional teams.