Job Board

Companies

ConnexAI

Inference Engineer

Inference Engineer in Manchester

Manchester Full-Time 50000 - 60000 £ / year (est.) No home office possible

Apply now

At a Glance

Tasks: Revolutionise the AI industry by developing efficient inference pipelines for speech recognition and synthesis.
Company: Multi-award-winning AI and SaaS provider based in Manchester.
Benefits: Hybrid work model, competitive salary, and opportunities for professional growth.
Other info: Dynamic research environment with strong focus on collaboration and cutting-edge technology.
Why this job: Join a team of innovators and make a real impact in the Agentic AI space.
Qualifications: MSc with 2+ years in Speech or Generative AI, expert in Python and PyTorch.

The predicted salary is between 50000 - 60000 £ per year.

Location: Manchester, UK (Hybrid)

About the job

Can you see yourself revolutionising the Agentic AI industry? We are a multi-award-winning AI and SaaS provider based in Manchester, dedicated to boosting productivity and efficiency across our global customer base spanning five continents. As a Machine Learning Engineer, you will be the architect responsible for maximising the fluidity, scale, and performance of our ASR and TTS products within the Agentic AI pipeline. This role offers a world-class research and production environment: you will work alongside like-minded scientists who develop state-of-the-art models, acting as the vital link between experimental research and enabling these bespoke systems to be used by our customers. Your primary focus will be implementing highly efficient inference pipelines for real-time and offline speech recognition and synthesis, ensuring our Agentic vision translates into a seamless user experience.

Responsibilities

Liaise with ASR and TTS technical leads, software engineers, and DevOps teams to deploy models efficiently on Hopper and Blackwell GPU architectures.
Maintain steady biweekly progression within our sprint-based research environment.
Implement low-latency, real-time inference ML pipelines using tools such as Triton Inference Server, vLLM, or SGLang.
Optimise model performance across cloud platforms using frameworks like TensorRT and ONNX.
Build and maintain robust API services using Python-based web frameworks (e.g., FastAPI).
Manage containerisation and orchestration workflows using Docker and Kubernetes.
Ensure system reliability through observability and monitoring tools like Prometheus, Grafana, and OpenTelemetry.
Write concise technical documentation and research papers.

Requirements

MSc plus 2+ years of hands‑on experience in Speech or Generative AI.
Deep understanding of Generative AI, Neural Networks, and the latest LLM architectures.
Expert‑level proficiency in Python and PyTorch.
Proven experience in performance optimisation and cloud platform deployment.
Strong background in containerisation and orchestration (Docker, K8s, etc.).
Demonstrated ability to deploy and scale low‑latency ML pipelines in production.
Strong oral and written communication skills.

Inference Engineer in Manchester employer: ConnexAI

Join a pioneering AI and SaaS provider in Manchester, where innovation meets collaboration in a vibrant hybrid work environment. We prioritise employee growth through continuous learning opportunities and a supportive culture that values creativity and teamwork, making it an ideal place for those looking to make a significant impact in the Agentic AI industry.

Contact Detail:

ConnexAI Recruiting Team

View ConnexAI Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Inference Engineer in Manchester

✨Tip Number 1

Network like a pro! Reach out to folks in the AI and tech scene, especially those who work at companies you're eyeing. LinkedIn is your best mate here; drop them a message, ask about their experiences, and see if they can give you the inside scoop on job openings.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to speech recognition or generative AI. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✨Tip Number 3

Prepare for interviews by brushing up on technical questions specific to inference pipelines and machine learning frameworks. Practice explaining complex concepts in simple terms, as communication is key in this field. We want to see how you think and solve problems!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team and contributing to the Agentic AI revolution.

We think you need these skills to ace Inference Engineer in Manchester

Machine Learning

Speech Recognition

Text-to-Speech (TTS)

Inference Pipelines

Generative AI

Neural Networks

Large Language Models (LLM)

Python

PyTorch

Performance Optimisation

Cloud Platform Deployment

Containerisation

Orchestration

Docker

Kubernetes

API Development

Technical Documentation

Communication Skills

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Inference Engineer role. Highlight your experience with ASR, TTS, and any relevant projects that showcase your skills in Python and performance optimisation. We want to see how you can contribute to our Agentic AI vision!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about AI and how your background aligns with our mission at StudySmarter. Keep it concise but impactful – we love a good story!

Showcase Your Technical Skills: Don’t forget to mention your hands-on experience with tools like Triton Inference Server, Docker, and Kubernetes. We’re looking for someone who can hit the ground running, so make sure we know what you bring to the table!

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates. Plus, we love seeing applications come in through our own channels!

How to prepare for a job interview at ConnexAI

✨Know Your Tech Inside Out

Make sure you’re well-versed in the tools and technologies mentioned in the job description, like Triton Inference Server and Docker. Brush up on your Python and PyTorch skills, as you'll likely be asked to demonstrate your expertise during the interview.

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific challenges you've faced in previous roles, especially related to low-latency ML pipelines or performance optimisation. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your contributions.

✨Familiarise Yourself with Their Products

Research the company’s ASR and TTS products thoroughly. Understanding their applications and how they fit into the Agentic AI pipeline will help you articulate how you can contribute to enhancing user experience and system reliability.

✨Communicate Clearly and Confidently

Since strong communication skills are a must, practice explaining complex technical concepts in simple terms. This will not only show your expertise but also your ability to collaborate effectively with cross-functional teams.

Inference Engineer in Manchester

ConnexAI

Location: Manchester

Apply now

Inference Engineer in Manchester

At a Glance

Inference Engineer in Manchester employer: ConnexAI

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Inference Engineer in Manchester

Some tips for your application 🫡

How to prepare for a job interview at ConnexAI

Inference Engineer in Manchester

Land your dream job quicker with Premium