Mid-Level and Senior ML Runtime Engineer in Bristol

Bristol Full-Time 80000 - 100000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Integrate cutting-edge AI hardware with leading inference frameworks and tackle complex ML challenges.
Company: Fractile, a revolutionary tech company in AI acceleration hardware.
Benefits: Competitive salary, equity, hybrid work, and a culture of learning and collaboration.
Other info: Diverse and inclusive workplace welcoming applicants from all backgrounds.
Why this job: Join a small expert team and make a real impact on ambitious AI projects.
Qualifications: Experience in ML inference, strong software engineering skills, and a passion for innovation.

The predicted salary is between 80000 - 100000 £ per year.

About Fractile

We’re taking a revolutionary approach to computing — building AI acceleration hardware that runs the world’s largest language models 100× faster than existing systems. Our team works at the cutting edge of both hardware and software AI development, and we’re growing fast.

The Role

We’re looking for a Senior ML Runtime Engineer to help us integrate Fractile’s AI accelerators with the latest inference frameworks and build the runtime stack that makes them fly. You’ll work on genuinely hard problems — KV cache management, scalable multi‑user inference, and the internals of transformer model execution — alongside a collaborative team that values curiosity and rigor equally. This is a hybrid role, with offices in London and Bristol — your choice of base.

What You’ll Do

Integrate Fractile’s AI acceleration hardware with leading inference engines including vLLM and SGLang
Research KV cache management technologies (including paged attention) and build proof‑of‑concept implementations tailored to our hardware
Work closely with the runtime team to design and build a scalable, bare‑bones reference inference engine
Focus primarily on the transformer ML architecture
Share your expertise to help shape the direction of our runtime stack

What We’re Looking For

We care most about depth of knowledge and a genuine interest in the problem space. You’ll be a strong fit if you have:

Solid experience with ML inference at scale, including multi‑user serving
A deep understanding of paged attention and inference engines such as vLLM
Familiarity with key components of the ML software ecosystem
Strong software engineering skills and an instinct for clean, maintainable systems

Bonus Points

Experience with Rust
Having built your own inference engine from scratch
A degree in Computer Science or a related field

Why Fractile

Work on one of the most technically ambitious projects in AI infrastructure
A small, expert team where your contributions are visible and valued
Hybrid working — split your time between home and our London or Bristol office
Competitive salary and equity
A culture that values learning, directness, and collaboration

Fractile is committed to building a diverse and inclusive team. We welcome applications from people of all backgrounds and actively encourage candidates from underrepresented groups to apply.

Mid-Level and Senior ML Runtime Engineer in Bristol employer: Fractile

Fractile is an exceptional employer, offering a unique opportunity to work on groundbreaking AI acceleration hardware in a collaborative and innovative environment. With a hybrid working model based in London or Bristol, employees enjoy a competitive salary, equity options, and a culture that prioritises learning and teamwork, making it an ideal place for those seeking meaningful contributions in the tech industry.

Contact Details:

Fractile Recruitment Team

View Fractile profile

We think you need these skills to ace Mid-Level and Senior ML Runtime Engineer in Bristol

ML Inference at Scale

KV Cache Management

Paged Attention

Inference Engines (e.g., vLLM, SGLang)

Transformer ML Architecture

Software Engineering

Clean Code Practices

Scalable System Design

Proof-of-Concept Implementation

Rust Programming

ML Software Ecosystem Familiarity

Mid-Level and Senior ML Runtime Engineer in Bristol

Fractile

Location: Bristol

Apply Now

Mid-Level and Senior ML Runtime Engineer in Bristol

At a Glance

Mid-Level and Senior ML Runtime Engineer in Bristol employer: Fractile

We think you need these skills to ace Mid-Level and Senior ML Runtime Engineer in Bristol

Company

Product

Help