AI/ML Engineer in London

AI/ML Engineer in London

London Full-Time 60000 - 80000 € / year (est.) Home office (partial)
Improbable

At a Glance

  • Tasks: Build AI systems that revolutionise supply chain intelligence with cutting-edge technology.
  • Company: Join Kallikor, a forward-thinking tech company shaping the future of logistics.
  • Benefits: Enjoy competitive pay, flexible work options, and opportunities for professional growth.
  • Other info: Collaborate with a diverse team and mentor junior engineers while advancing your career.
  • Why this job: Make a real impact by developing innovative AI solutions in a dynamic environment.
  • Qualifications: Strong Python skills and experience with machine learning models are essential.

The predicted salary is between 60000 - 80000 € per year.

At Kallikor, we're building the future of supply chain intelligence through AI-powered simulation digital twins. We create living digital representations of real-world operations (warehouses, distribution networks, global logistics) that help organisations make better decisions faster. We're at an inflection point: moving from AI-assisted tools to domain-specific AI that understands supply chains as deeply as our best engineers do. You'll be instrumental in building our first domain-specific language model (DSLM) and the foundation for Project Genome, an ambitious initiative to capture and synthesise the world's supply chain knowledge into actionable intelligence.

This is a production engineering role first. You'll build robust Python systems that happen to train and serve LLMs, not the other way around. We need someone who writes production-quality code, debugs complex distributed systems, and thinks about reliability, who has learned ML/LLMs as powerful tools in their engineering arsenal.

You'll work across our entire AI stack: building FastAPI services that serve models, creating training pipelines that process production data, deploying inference endpoints with proper monitoring, and integrating all of this into our existing Python backend. The ML is important, but the engineering discipline is what makes it production-ready.

Your Opportunity

  • Build production AI systems: Design and implement the full stack, from FastAPI endpoints that handle requests, to training pipelines that process data, to inference services that serve predictions. You'll own the architecture, not just the model weights.
  • Train and deploy our DSLM: Fine-tune models using Unsloth/Axolotl, but more importantly, build the robust infrastructure around it - data pipelines that feed training, evaluation frameworks that catch regressions, deployment systems that handle failover. Make it production-grade.
  • Integrate ML into our backend: We use FastAPI, PydanticAI, FastMCP, Memgraph. You'll extend these systems with ML capabilities, not as a separate "ML service" but as a natural part of our backend architecture. Clean abstractions, proper error handling, observability.
  • Own inference performance: Get models running fast, whether that's vLLM deployment, quantization strategies, batching optimizations, or caching. Hit our <200ms>
  • Shape Project Genome's foundation: Work with our Principal Engineer to architect how we ingest, process, and learn from global supply chain data. This is systems design as much as ML with data pipelines, graph databases, incremental learning strategies being just as important.
  • Mentor through code review and pairing: Raise the bar on code quality, testing, and production practices across the team. Teach mid and junior engineers how to build ML systems that don't fall over.

Why you're made for this

  • You're a strong production Python engineer: You write clean, maintainable, tested code. You understand async/await, know when to use generators vs lists, can profile performance bottlenecks. You've built FastAPI services (or similar) that handle production traffic. Your code passes review without drama.
  • You've built with LLMs in production: You've integrated GPT-4/Claude into real applications, handled streaming responses, dealt with rate limits and retries, cached intelligently. You know the practical challenges: prompt engineering, context management, error handling, cost control.
  • You've trained or fine-tuned models: Whether it's fine-tuning LLMs, training classifiers, or running experiments, you understand the workflow. You've dealt with training data quality, evaluation metrics, and overfitting. You can debug why a model isn't learning what you expected.
  • You think like a systems engineer: You design for failure, add instrumentation, consider edge cases. You know that "the model works on my laptop" isn't shipping. You care about monitoring, logging, alerting, and graceful degradation.
  • You can navigate the ML landscape pragmatically: You know enough about transformers, attention mechanisms, and training dynamics to make informed decisions. But you're not precious about it. If a simple heuristic beats a complex model, you ship the heuristic.
  • You balance velocity with quality: You ship incrementally and iterate based on production data. But you don't accumulate tech debt, you refactor proactively, write tests that matter, and leave the codebase better than you found it.
  • You communicate trade-offs clearly: You can explain to the team why we're choosing LoRA over full fine-tuning, why we're deploying on Fireworks instead of self-hosting, or why a 7B model might beat a 70B model. You help everyone make informed decisions.

What we're looking for specifically

  • Must have:
  • 5+ years building production Python systems (backend services, APIs, data processing)
  • Strong software engineering fundamentals: design patterns, testing, debugging, profiling
  • Experience integrating LLMs into applications (OpenAI/Anthropic APIs, prompt engineering, streaming, PydanticAI)
  • Understanding of ML training workflows (even if you're not an expert. You need to know enough to build the infrastructure)
  • Docker, CI/CD, production deployment experience
  • Can read and understand PyTorch code (you don't need to write novel architectures)
  • Nice to have:
  • Fine-tuning experience (LoRA, full fine-tuning, QLoRA)
  • Distributed training basics (DeepSpeed, FSDP)
  • Graph databases (Memgraph, Neo4j)
  • Supply chain or logistics domain knowledge
  • Experience with agent frameworks (LangChain, PydanticAI, etc.)

What you'll work with

  • Backend Stack: Python, FastAPI, PydanticAI, FastMCP, Memgraph, Postgres
  • ML Stack: PyTorch, Unsloth/Axolotl for training, vLLM for inference, Weights & Biases
  • Models: Qwen 2.5, Llama 3.1, GPT-4, Claude (for now)
  • Infrastructure: AWS (flexible), Docker, Kubernetes, GPUs when needed
  • Team: Principal Engineer (your partner on architecture), Mid Data/ML Engineer (your data pipeline partner), Junior AI Engineer (your mentee)

Example projects you'll own

  • Build a FastAPI service that handles streaming LLM responses with correct error handling and retry logic
  • Create a training pipeline that processes production logs, validates data quality, and triggers fine-tuning runs
  • Deploy a fine-tuned 7B model with vLLM that beats GPT-4 latency while maintaining quality on our domain
  • Design the data ingestion architecture for Project Genome, how we process papers, documentation, and operational data at scale
  • Implement evaluation frameworks that catch model regressions before they reach production

About Us

Kallikor is determined to foster an environment where people can do their best work and feel like they belong. We believe a healthy culture, strong values and contribution from a diverse range of individuals will help us to achieve success. We do not discriminate based on race, ethnicity, gender, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran status, genetic information, marital status or any other legally protected status.

AI/ML Engineer in London employer: Improbable

At Kallikor, we pride ourselves on being an exceptional employer that champions innovation and collaboration in the field of AI and supply chain intelligence. Our vibrant work culture encourages continuous learning and mentorship, providing ample opportunities for professional growth while working alongside industry experts. Located in a dynamic environment, we offer competitive benefits and a commitment to diversity, ensuring every team member feels valued and empowered to contribute to our ambitious projects.

Improbable

Contact Detail:

Improbable Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI/ML Engineer in London

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with Kallikor employees on LinkedIn. A friendly chat can sometimes lead to opportunities that aren’t even advertised!

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving Python and ML. Share it during interviews or on your LinkedIn profile to grab attention.

Tip Number 3

Prepare for technical interviews by practicing coding challenges and system design questions. Use platforms like LeetCode or HackerRank to sharpen your skills and get comfortable with problem-solving under pressure.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, you’ll be one step closer to joining us at Kallikor and making an impact in supply chain intelligence!

We think you need these skills to ace AI/ML Engineer in London

Production Python Engineering
FastAPI
LLM Integration
Prompt Engineering
Data Processing
ML Training Workflows
Docker

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the AI/ML Engineer role. Highlight your experience with Python systems, LLMs, and any relevant projects that showcase your skills in production engineering.

Showcase Your Projects:Include specific examples of your work with FastAPI, data pipelines, or ML models. We love seeing real-world applications of your skills, so don’t hold back on the details!

Be Clear and Concise:When writing your application, keep it straightforward. Use clear language to explain your experience and how it relates to the job. We appreciate a well-structured application that gets to the point.

Apply Through Our Website:We encourage you to submit your application directly through our website. It’s the best way to ensure we see your application and can get back to you quickly!

How to prepare for a job interview at Improbable

Know Your Tech Stack

Familiarise yourself with the specific technologies mentioned in the job description, like FastAPI, PydanticAI, and PyTorch. Be ready to discuss how you've used these tools in your previous projects, especially in building production-quality Python systems.

Showcase Your Problem-Solving Skills

Prepare to discuss real-world challenges you've faced while integrating LLMs into applications. Highlight your experience with prompt engineering, error handling, and performance optimisation. Use examples that demonstrate your ability to think like a systems engineer.

Demonstrate Your Understanding of ML Workflows

Be ready to explain the ML training workflows you’ve worked with, even if you're not an expert. Discuss how you’ve handled data quality, evaluation metrics, and overfitting in your past projects. This shows you understand the infrastructure needed for successful ML deployment.

Communicate Clearly About Trade-offs

Practice explaining technical decisions in simple terms. For instance, why you might choose LoRA over full fine-tuning or the benefits of deploying on Fireworks. Clear communication will show that you can help the team make informed decisions and navigate the complexities of ML.