Senior / Staff Software Engineer (AI / Compiler)
Senior / Staff Software Engineer (AI / Compiler)

Senior / Staff Software Engineer (AI / Compiler)

London Full-Time 104000 - 186000 £ / year (est.) No home office possible
Go Premium
F

At a Glance

  • Tasks: Design and build high-performance systems for AI workloads across distributed clusters.
  • Company: Flux is revolutionising AI with cutting-edge Optical Tensor Processing Units in London.
  • Benefits: Competitive salary, stock options, and a £24k/year incentive for nearby living.
  • Why this job: Join a fast-paced environment that values innovation and bold thinking in AI technology.
  • Qualifications: 5+ years in HPC or AI infrastructure, strong C++ and Python skills required.
  • Other info: Work in a vibrant office in Kings Cross, London, at the heart of the AI hub.

The predicted salary is between 104000 - 186000 £ per year.

Company Overview

Flux is pioneering a new class of AI accelerators called Optical Tensor Processing Units (OTPUs). We’ve already developed functioning prototypes and are now scaling our operations in London. Our work environment rewards innovation, speed, and bold thinking.

The role

We’re hiring Senior and Staff Software Engineers to build the high-performance computing infrastructure that powers our Optical Tensor Processing Units (OTPUs). This isn’t just about scaling models—it’s about rethinking how AI workloads are executed at speed and scale.

You’ll lead the design and implementation of software systems that run distributed, low-latency inference across clusters. You’ll work closely with hardware and ML teams to optimise every layer of the stack—from model representation and execution to data movement and scheduling. Whether it’s through compiler techniques, systems-level tuning, or custom runtime design, you’ll play a critical role in shaping the performance layer of our AI platform. This is a role for engineers who think in microseconds, not just model accuracy. If you’ve worked in HFT, large-scale scientific compute, or AI infrastructure at serious scale, we’d love to talk.

Responsibilities

  • Design and build high-performance systems for running AI/ML workloads across distributed compute clusters
  • Optimise for ultra-low latency and real-time inference at scale—profiling, tuning, and rewriting critical systems as needed
  • Identify and resolve performance bottlenecks across the stack, from model execution and scheduling to hardware-level constraints
  • Collaborate with compiler engineers to improve code generation, execution paths, and memory layouts using tools like LLVM or MLIR
  • Work with hardware teams to ensure the software stack fully leverages the capabilities of our OTPU architecture
  • Extend ML frameworks (e.g. PyTorch, ONNX, OpenXLA) to better support performance-critical inference paths
  • Lead design reviews, mentor engineers, and promote best practices in HPC and performance engineering
  • Stay on the frontier of new developments in AI infrastructure, compute systems, and compiler tooling

Skills & Experience

  • 5+ years of experience building performance-critical systems in HPC, HFT, large-scale simulation, or AI infrastructure
  • Deep understanding of distributed systems, with a focus on real-time or near real-time data processing
  • Strong programming skills in C++ and Python, especially for performance-sensitive applications
  • Hands-on experience with ML compilers (e.g. LLVM, MLIR), and knowledge of runtime and scheduling optimisations
  • Practical knowledge of ML frameworks like PyTorch, ONNX, or OpenXLA, and how to optimise their execution
  • Experience scaling AI workloads across clusters or custom infrastructure—not just deploying on standard cloud setups
  • Strong debugging, profiling, and performance-tuning skills across the stack
  • Degree in Computer Science, Engineering, Mathematics, or a related field

Details

  • Competitive salary ranging from £145k+, depending on experience.
  • Stock options in a rapidly growing AI company.
  • Based in our new 5,000 sq. ft. office in the AI hub of Kings Cross, London.
  • Flux hires candidates within a 45-minute commute of our office—offering an extra £24k/year incentive if you choose to live within 20 minutes.
F

Contact Detail:

Flux Computing Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior / Staff Software Engineer (AI / Compiler)

✨Tip Number 1

Familiarise yourself with the latest advancements in AI infrastructure and compiler technologies. Being well-versed in tools like LLVM or MLIR will not only help you understand the role better but also demonstrate your commitment to staying at the forefront of the field.

✨Tip Number 2

Network with professionals in the AI and HPC communities. Attend relevant meetups, webinars, or conferences to connect with potential colleagues and learn more about the challenges they face. This can give you valuable insights that you can discuss during interviews.

✨Tip Number 3

Prepare to discuss specific examples of how you've optimised performance in previous roles. Be ready to explain your thought process and the impact of your work on system efficiency, as this aligns closely with the responsibilities of the position.

✨Tip Number 4

Showcase your collaborative skills by highlighting experiences where you've worked closely with hardware teams or contributed to cross-functional projects. This is crucial for the role, as collaboration is key to optimising the software stack for the OTPU architecture.

We think you need these skills to ace Senior / Staff Software Engineer (AI / Compiler)

High-Performance Computing (HPC)
Distributed Systems
Real-Time Data Processing
C++ Programming
Python Programming
ML Compilers (LLVM, MLIR)
Performance Tuning
Debugging Skills
Profiling Techniques
AI/ML Frameworks (PyTorch, ONNX, OpenXLA)
Cluster Scaling
Custom Infrastructure Development
Systems-Level Optimisation
Mentoring and Leadership

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience in building performance-critical systems, especially in HPC, HFT, or AI infrastructure. Emphasise your programming skills in C++ and Python, and any hands-on experience with ML compilers like LLVM or MLIR.

Craft a Compelling Cover Letter: In your cover letter, express your passion for AI and high-performance computing. Discuss specific projects where you've optimised systems for low-latency inference and how your skills align with the responsibilities outlined in the job description.

Showcase Relevant Projects: Include a section in your application that showcases relevant projects or experiences. Highlight any work you've done with distributed systems, ML frameworks, or performance tuning, and explain the impact of your contributions.

Prepare for Technical Questions: Anticipate technical questions related to distributed systems, compiler techniques, and performance optimisation. Be ready to discuss your problem-solving approach and provide examples from your past work that demonstrate your expertise.

How to prepare for a job interview at Flux Computing

✨Showcase Your Technical Expertise

Be prepared to discuss your experience with performance-critical systems, especially in HPC or AI infrastructure. Highlight specific projects where you've optimised for low latency and real-time inference, as this aligns closely with what the company is looking for.

✨Demonstrate Collaboration Skills

Since the role involves working closely with hardware and ML teams, be ready to share examples of how you've successfully collaborated in cross-functional teams. Discuss any experiences where you’ve led design reviews or mentored other engineers.

✨Familiarise Yourself with Relevant Tools

Brush up on your knowledge of ML compilers like LLVM or MLIR, and be ready to discuss how you've used these tools in past projects. Understanding how to extend ML frameworks such as PyTorch or ONNX will also be beneficial.

✨Prepare for Problem-Solving Questions

Expect technical questions that assess your debugging, profiling, and performance-tuning skills. Practice explaining your thought process when identifying and resolving performance bottlenecks, as this will demonstrate your analytical abilities.

Senior / Staff Software Engineer (AI / Compiler)
Flux Computing
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

F
  • Senior / Staff Software Engineer (AI / Compiler)

    London
    Full-Time
    104000 - 186000 £ / year (est.)
  • F

    Flux Computing

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>