Job Board

Companies

Nscale

Specialised AI Engineer

Specialised AI Engineer in London

London Full-Time 80000 - 100000 £ / year (est.) Home office (partial)

Apply now

At a Glance

Tasks: Design and optimise cutting-edge AI systems for a revolutionary GenAI cloud platform.
Company: Join Nscale, a leader in sustainable AI technology and innovation.
Benefits: Competitive salary, inclusive culture, and opportunities for professional growth.
Other info: Diverse and inclusive workplace encouraging applications from all backgrounds.
Why this job: Be at the forefront of AI technology and make a real impact in the industry.
Qualifications: 5+ years in machine learning and strong expertise in AI systems.

The predicted salary is between 80000 - 100000 £ per year.

About Nscale

Nscale is taking on the hyperscalers by building a vertically integrated GenAI cloud platform. We own the data centres, software, and applications that power today's AI stack using sustainable technology solutions. We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As a Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. Collaboration is key, and we work together swiftly and respectfully, embracing adaptability and resilience in all we do.

About the Role

Nscale is looking for Senior / Staff AI Engineers to join our core AI team and build the systems that power our GenAI cloud platform. This role sits at the heart of our AI services platform, designing and optimising distributed systems that power large-scale training, post-training, evaluation, and low-latency, high-throughput inference under strict performance and efficiency constraints. You may specialise deeply in areas such as inference optimisation, large-scale training, post-training (fine-tuning, alignment), or evaluation systems, or operate across multiple parts of the stack. In all cases, you’ll work on hard systems problems at scale, where performance, efficiency, and developer experience are critical. This is a hands-on role for engineers who want to push the boundaries of how AI systems are built, optimised, and consumed by other AI engineers.

Responsibilities

Design, build, and optimise scalable AI platform systems spanning (one or more):

Drive inference performance and efficiency, including:

KV cache management, continuous batching, speculative decoding
Quantisation (INT8/4, FP8), sparsity, pruning, and model compression

Build and improve post-training services, including:

Fine-tuning (LoRA, QLoRA, adapters, full fine-tuning)
Alignment (RLHF, DPO, reward modelling)
Agentic RL (tool calling, off-policy training, parallel thinking, decoupled sampling and updating)
Dataset curation and data processing workflows

Develop evaluation and benchmarking systems to measure:

Model quality, safety, and regression
System performance (latency, throughput, cost)
Real-world behaviour and feedback loops

Develop and optimise distributed systems for GPU/accelerator workloads, focusing on scalability, reliability, and efficiency

Conduct performance analysis and bottleneck investigations across multiple components and stacks spanning training, post-training, and inference

Collaborate with research, infrastructure, and product teams to build the right platform components based on customer demand and industry direction

Build developer-facing APIs, SDKs, and tooling that enable other engineers to effectively use Nscale’s AI services

Requirements

5+ years of experience building production systems in machine learning, distributed systems, or high-performance infrastructure
4+ years of hands-on experience in at least one core area, within large-scale, production AI environments (e.g., AI labs, hyperscalers), such as:

Inference optimisation
Large-scale training / pre-training systems
Post-training (fine-tuning, alignment, distillation)
Evaluation and benchmarking frameworks

Strong hands-on expertise in at least one of the above areas, with working knowledge across others

Proven ability to design, optimise, and operate systems at scale, with a strong understanding of performance trade-offs across latency, throughput, cost, and model quality

Deep understanding of transformer architectures, LLMs, and/or multimodal models, including their behaviour in production systems

Strong proficiency in Python and PyTorch, with a track record of building production-grade ML systems

Experience with distributed compute and training paradigms (e.g., data/model parallelism, sharding, scheduling)

Experience working close to the hardware/software boundary, such as:

GPU/accelerator optimisation (CUDA, ROCm, or similar)
Memory management and system-level performance tuning

Experience building or operating production inference or training systems at scale

Ability to design clean abstractions, APIs, and reusable systems for other engineers

Strong engineering fundamentals, with a track record of writing maintainable, well-tested, production-quality code

Preferred

Experience developing large-scale and high-load production systems.
Experience working in containerised, distributed environments (e.g., Kubernetes, large-scale clusters)
Experience contributing to or working with widely used/open-source AI frameworks or systems is strongly preferred
Hands-on experience with advanced inference optimisation techniques, such as KVCache, MoE, adaptive batching, or gradient checkpointing.
Experience developing APIs using OpenAPI 3.0+ specifications.
Knowledge of efficient training and inference evaluation strategies, with demonstrated success in improving model efficiency.

At Nscale, we are committed to fostering an inclusive, diverse, and equitable workplace. We believe that a variety of perspectives enriches our work environment, and we encourage applications from candidates of all backgrounds, experiences, and abilities. We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds. If there’s anything we can do to accommodate your specific situation, please let us know.

The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.

Specialised AI Engineer in London employer: Nscale

Nscale is an exceptional employer that champions a culture of innovation, collaboration, and accountability, making it an ideal place for Specialised AI Engineers to thrive. With a commitment to sustainable technology solutions and a focus on employee growth, Nscale offers unique opportunities to work on cutting-edge AI systems while fostering an inclusive environment where diverse perspectives are valued. Located in a dynamic tech hub, employees benefit from a supportive atmosphere that encourages professional development and the pursuit of excellence.

Contact Detail:

Nscale Recruiting Team

View Nscale Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Specialised AI Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the AI field, especially those at Nscale. A friendly chat can open doors and give you insights that a job description just can't.

✨Tip Number 2

Show off your skills! If you've got a project or two that highlights your experience with distributed systems or inference optimisation, share them. A portfolio can speak volumes about your capabilities.

✨Tip Number 3

Prepare for the interview by diving deep into Nscale's tech stack. Understand their approach to AI and be ready to discuss how your expertise aligns with their goals. It shows you're genuinely interested!

✨Tip Number 4

Don't forget to apply through our website! It's the best way to ensure your application gets seen. Plus, it shows you're keen on being part of the Nscale team right from the start.

We think you need these skills to ace Specialised AI Engineer in London

Inference Optimisation

Large-Scale Training

Post-Training (Fine-Tuning, Alignment)

Evaluation and Benchmarking Frameworks

Distributed Systems

Performance Analysis

Python

PyTorch

GPU/Accelerator Optimisation

Memory Management

API Development

Containerised Environments (e.g., Kubernetes)

Advanced Inference Optimisation Techniques

Data Processing Workflows

Transformer Architectures

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience in AI systems, especially in areas like inference optimisation or large-scale training. We want to see how your skills align with what we’re doing at Nscale!

Show Your Passion: Let us know why you’re excited about working in AI and what drives you to push the boundaries of technology. A bit of personality goes a long way in making your application stand out!

Be Clear and Concise: When writing your application, keep it straightforward and to the point. We appreciate clarity, so make sure your achievements and experiences shine through without unnecessary fluff.

Apply Through Our Website: We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role you’re interested in!

How to prepare for a job interview at Nscale

✨Know Your Stuff

Make sure you brush up on the latest trends in AI and distributed systems. Be ready to discuss your hands-on experience with inference optimisation, large-scale training, or any other relevant area. Nscale values deep expertise, so show them you know your stuff inside out!

✨Showcase Your Problem-Solving Skills

Prepare to tackle some tough systems problems during the interview. Think of examples from your past work where you optimised performance or solved complex issues. Nscale is all about innovation, so demonstrate how you can think outside the box!

✨Collaboration is Key

Nscale thrives on teamwork, so be ready to discuss how you've collaborated with others in previous roles. Share specific instances where you worked closely with research, infrastructure, or product teams to build effective solutions. Highlight your adaptability and respect for diverse perspectives.

✨Ask Insightful Questions

Prepare thoughtful questions that show your interest in Nscale's mission and culture. Inquire about their approach to sustainable technology solutions or how they foster innovation within the team. This not only shows your enthusiasm but also helps you gauge if it's the right fit for you!

Specialised AI Engineer in London

Nscale

Location: London

Apply now

Specialised AI Engineer in London

At a Glance

Specialised AI Engineer in London employer: Nscale

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Specialised AI Engineer in London

Some tips for your application 🫡

How to prepare for a job interview at Nscale

Specialised AI Engineer in London

Land your dream job quicker with Premium