Staff/Senior Platform Integration Engineer in London

Job Board

Companies

OLIX

Staff/Senior Platform Integration Engineer

Staff/Senior Platform Integration Engineer in London

London Full-Time 80000 - 100000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Lead the technical vision for cutting-edge AI infrastructure and distributed inference systems.
Company: Join OLIX, a pioneering tech company shaping the future of AI.
Benefits: Competitive salary, equity options, premium healthcare, and generous time off.
Other info: Enjoy a dynamic work environment with elite hardware and a supportive team culture.
Why this job: Be at the forefront of AI technology and make a significant impact on the industry.
Qualifications: Deep expertise in distributed inference and strong technical communication skills required.

The predicted salary is between 80000 - 100000 £ per year.

About OLIXAI is growing faster than any technology in history and the explosion in demand has created a massive infrastructure gap; we can no longer build chips or power stations fast enough to keep up. The industry is still leaning on a ten-year-old hardware blueprint that has reached its limit. A new paradigm that is faster and more efficient will be the biggest economic opportunity of the next century and create the most important company of the next decade. The OLIX Decode Accelerator 1 (DX-1) is the first accelerator architected specifically for decode. Rack-scale co-design of logic, data movement, packaging, optics and interconnect enables a step change in system level performance.

Role: As Staff/Senior Software Platform Integration Engineer, you will be the technical authority on how OLIX serves large models as hyperscale AI infrastructure - spanning distributed inference engines, serving-runtime integration, KV cache and memory hierarchy, and the orchestration and networking layers that make serving real. We are looking for experienced Staff, Senior and/or Principal-level engineers who have shipped distributed inference at scale and have strong opinions about how modern serving stacks - vLLM, SGLang, NVIDIA Dynamo - should be extended onto novel accelerators. You will partner closely with leadership and cross-functional engineering teams to set the technical direction for distributed inference on DX-1, define the architectural contracts the rest of the platform builds against, and make the hard technical calls across the serving stack. You bring rare depth across the full stack, the judgment to know what matters and why, and the influence to drive alignment across engineering without relying on authority.

Responsibilities:

Shape the technical vision. Partnering with leadership to set long-term technical direction across serving-engine integration (vLLM, SGLang, NVIDIA Dynamo), disaggregated prefill/decode, KV cache management (NIXL / Mooncake TE), cluster orchestration, fleet management, networking, and deployment - and own the architectural integrity of that vision across the full platform lifecycle.
Translate strategy into architecture. Work with cross-functional partners to turn long-term business direction into concrete architectural priorities, and identify where technical investments will have the highest leverage.
Set the architectural bar. Define the principles, interface contracts, and standards the organisation builds to - across scheduling, fleet operations, ingress/egress, and platform management - and ensure they hold across teams.
Make the hard calls. Own the technical decision-making across the platform stack: orchestration and scheduling architecture, fleet management systems, networking design, and deployment strategy.
Lead through influence. Drive alignment across teams without direct authority - through rigour, clarity, and the quality of your technical thinking.
Raise the technical ceiling. Mentor and stretch engineers across the organisation - not as a manager, but as a technical leader who holds the bar high and helps others reach it.

Skills & Experience:

Deep expertise in distributed inference infrastructure (vLLM, SGLang, Nvidia Dynamo) as well as associated networking (NCCL, RoCE, Infiniband) and KV cache management (NIXL, Mooncake TE) technologies, and rail optimisation to link up accelerator clusters.
Deep expertise in cluster management at hyperscale on bare-metal, custom-accelerator fleets - provisioning, scheduling, and lifecycle ownership across thousands of nodes, including safe firmware update orchestration rolled out at fleet scale without compromising production SLOs.
Track record driving technical outcomes in high-reliability production inference environments: latency and throughput SLOs, capacity and cost modelling, observability, incident management, and security at scale across fleets of accelerators.
Full lifecycle experience from early architecture through to production operations and long-tail reliability.
Outstanding technical communicator. You articulate architectural decisions clearly to engineers, managers, and senior leadership alike, and write design thinking that becomes the organisational reference point.

Compensation & Equity:

Competitive Salary, commensurate with your experience, skills, and location.
Equity & Ownership: Meaningful stock options. You’re not just joining the mission; you’re owning a piece of it.
Proximity Bonus: We value your time. To minimise your commute and maximise your life, we offer a £24k annual Living-Local Bonus if your residence is within 20 minutes of the office.

Health & Wellbeing:

Premium Healthcare: Comprehensive BUPA medical and dental cover, including Medical History Disregarded (MHD), for complete peace of mind.
Time Off: 25 days of annual leave, plus all UK bank holidays.

The Workspace & Tech:

Elite Hardware: M4 Macs come as standard, with M4 Pro upgrades for our engineering team. We will provide whatever you need to do your best work.
Optimal Environment: High-spec noise-cancelling headphones and a fully ergonomic workstation designed for deep focus.
Rapid Prototyping: Access to our high-performance 3D printing lab for work, experimentation, and personal creative projects.

Life at the Office:

Chef-prepared meals: if you need to work late.
Caffeine on Us: We’ve got you covered with a tab at our favourite local coffee shop.

Relocation & Global Mobility:

Visa Sponsorship: We hire the best in the world. We offer full UK and international visa sponsorship.
Seamless Relocation: Whether you’re moving across the country or across the globe, our dedicated relocation partner provides funding and concierge support to get you settled.

Due to U.S. export control regulations, candidates’ eligibility to work at OLIX depends on their most recent citizenship or permanent residency status. We are generally unable to consider applicants whose most recent citizenship or permanent residence is in certain restricted countries (currently including Iran, North Korea, Syria, Cuba, Russia, Belarus, China, Hong Kong, Macau, and Venezuela). Applicants who have subsequently obtained citizenship or permanent residency in another country not subject to these restrictions may still be eligible.

Staff/Senior Platform Integration Engineer in London employer: OLIX

At OLIX, we are at the forefront of a technological revolution, offering our Staff/Senior Platform Integration Engineers an unparalleled opportunity to shape the future of AI infrastructure. With a strong emphasis on employee growth, competitive compensation, and a vibrant work culture that includes chef-prepared meals and premium healthcare, we ensure our team members thrive both professionally and personally. Our commitment to innovation is matched by our dedication to providing a supportive environment where your contributions are valued and rewarded.

Contact Details:

OLIX Recruitment Team

View OLIX profile

StudySmarter Expert Advice🤫

We think this is how you could land Staff/Senior Platform Integration Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with OLIXAI employees on LinkedIn. A personal touch can make all the difference when it comes to landing that interview.

✨Tip Number 2

Show off your skills! If you’ve got a portfolio or any projects that highlight your expertise in distributed inference or platform integration, make sure to share them during your conversations. It’s a great way to demonstrate your capabilities.

✨Tip Number 3

Prepare for technical discussions! Brush up on the latest trends in AI infrastructure and be ready to discuss how you’d tackle challenges at OLIXAI. They’re looking for someone who can think critically and influence without authority.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of the OLIXAI mission.

We think you need these skills to ace Staff/Senior Platform Integration Engineer in London

Distributed Inference Infrastructure

vLLM

SGLang

NVIDIA Dynamo

Networking (NCCL, RoCE, Infiniband)

KV Cache Management (NIXL, Mooncake TE)

Cluster Management at Hyperscale

Provisioning and Scheduling

Lifecycle Ownership

Technical Decision-Making

Architectural Design

Technical Communication

Incident Management

Observability

Security at Scale

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with distributed inference infrastructure and the specific technologies mentioned in the job description. We want to see how your skills align with our needs!

Showcase Your Technical Expertise:Don’t hold back on detailing your technical skills! Whether it’s your experience with vLLM, SGLang, or NVIDIA Dynamo, we want to know how you’ve applied these in real-world scenarios. Be specific about your contributions and outcomes.

Communicate Clearly:As an outstanding technical communicator, your application should reflect that! Use clear and concise language to articulate your architectural decisions and experiences. This will help us see your thought process and how you can influence teams.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our mission at OLIX!

How to prepare for a job interview at OLIX

✨Know Your Stuff

Make sure you have a solid grasp of distributed inference infrastructure and the specific technologies mentioned in the job description, like vLLM, SGLang, and NVIDIA Dynamo. Brush up on your knowledge of KV cache management and networking technologies too, as these will likely come up during the interview.

✨Show Your Architectural Vision

Be prepared to discuss how you would translate OLIX's long-term business direction into concrete architectural priorities. Think about how you can set the architectural bar high and what principles and standards you believe are essential for the organisation.

✨Demonstrate Leadership Through Influence

Since the role requires driving alignment across teams without direct authority, think of examples from your past where you've successfully influenced others. Be ready to share how you’ve made hard technical calls and led projects through clarity and rigorous thinking.

✨Communicate Clearly

As an outstanding technical communicator, you’ll need to articulate your architectural decisions clearly. Practice explaining complex concepts in simple terms, as you’ll be talking to engineers, managers, and senior leadership alike. Prepare to showcase your design thinking as a reference point for the organisation.

Staff/Senior Platform Integration Engineer in London

OLIX

Location: London

Apply Now

Staff/Senior Platform Integration Engineer in London

At a Glance

Staff/Senior Platform Integration Engineer in London employer: OLIX

StudySmarter Expert Advice🤫

We think you need these skills to ace Staff/Senior Platform Integration Engineer in London

Some tips for your application 🫡

How to prepare for a job interview at OLIX

Company

Product

Help