Researcher, Post Training in London

Job Board

Companies

Lovable

Researcher, Post Training

Researcher, Post Training in London

London Full-Time 60000 - 80000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Own the post-training pipeline, translating research into production for AI models.
Company: Lovable, a fast-growing AI company transforming software creation.
Benefits: Competitive salary, remote work, and opportunities for rapid career growth.
Other info: Dynamic team culture focused on ownership, speed, and collaboration.
Why this job: Join a mission to revolutionise software development for millions worldwide.
Qualifications: Experience with post-training on large language models and solid coding skills.

The predicted salary is between 60000 - 80000 £ per year.

Lovable lets over 2 million people build software using plain language, and the models behind it need to be exceptional. We're hiring an engineer who has gotten their hands dirty with post‑training at scale and wants to do it again for one of the fastest‑growing AI products in the world. You'll own our full post‑training pipeline: translating the latest research into production training recipes, adapting them for code generation and agent workloads, and putting improved models in front of users fast. The goal is to get promising research into production within days or weeks, not months. This isn't an academic research position – you'll spend as much time in production infrastructure as in training configs, and your success is measured by what ships.

Why Lovable? Lovable lets anyone and everyone build software with any language. From solopreneurs to Fortune 100 teams, millions of people use Lovable to transform raw ideas into real products – fast. We are at the forefront of a foundational shift in software creation, which means you have an unprecedented opportunity to change the way the digital world works. Over 2 million people in more than 200 countries already use Lovable to launch businesses, automate work, and bring their ideas to life. And we’re just getting started. We’re a small, talent‑dense team building a generation‑defining company from Stockholm. We value extreme ownership, high velocity, and low‑ego collaboration. We seek out people who care deeply, ship fast, and are eager to make a dent in the world.

What We’re Looking For:

You've personally run post‑training jobs on large language models – RFT, RLVR, preference optimization, or similar – not just called APIs or written prompts, but actually trained and iterated on models.
You can write solid production code. The systems you build need to run reliably, not just produce interesting research artifacts.
You're fluent in at least one major ML framework (PyTorch, JAX) and comfortable working with distributed training setups and GPU clusters.
You understand the math behind preference optimization, reward modeling, and alignment techniques, and can reason about when each approach fits.
You've built or significantly contributed to evaluation systems that capture real‑world quality, not just benchmark scores.
You can trace a model‑quality regression from user‑facing symptoms back through serving, inference, and training – and you enjoy doing it.
You want to ship. Research taste matters, but at Lovable the question is always "how fast can we get this to users?"

Preferred:

You've worked on code generation or agentic use cases specifically.
You've put post‑trained models into the hands of real users and seen how they hold up at scale.
You've owned the full loop: curating data, running training, evaluating results, deploying, and monitoring in production.
You have a habit of reading a paper on Monday and having a prototype running by Friday.
You've experimented with speculative decoding or similar techniques to improve model efficiency.
You have strong views on evaluation methodology and have built evals that actually predict user satisfaction.
You've published or contributed meaningfully to the open‑source ML ecosystem.

What You’ll Do:

Own the full lifecycle of Lovable's post‑training pipeline – from data curation and training runs through evaluation and deployment.
Apply and adapt reinforcement learning, preference optimization, and supervised fine‑tuning methods to make our models better at generating code, reasoning about user intent, and acting as reliable agents.
Build the evaluation and experimentation infrastructure that tells us whether a model change actually helps users – covering helpfulness, safety, latency, and reliability.
Develop and operate the production systems that run training jobs at scale, including GPU orchestration and data pipelines.
Work across team boundaries with our agent, product, and infrastructure engineers to turn model gains into product improvements users can feel.
Investigate and resolve failures end‑to‑end – whether the root cause is in a training recipe, a data issue, or a serving regression.
Read papers, run experiments, and move fast: the goal to get promising research into production within days or weeks, not months.

Researcher, Post Training in London employer: Lovable

At Lovable, we are redefining software creation with a fast-paced, collaborative culture that empowers our team to make a significant impact on millions of users worldwide. Located in Stockholm, we offer an environment that values extreme ownership and rapid innovation, providing ample opportunities for personal and professional growth. Join us to be part of a talented team where your contributions directly influence the future of AI-driven software development.

Contact Details:

Lovable Recruitment Team

View Lovable profile

StudySmarter Expert Advice🤫

We think this is how you could land Researcher, Post Training in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio that highlights your projects, especially those related to post-training and model optimisation. This is your chance to demonstrate what you can do beyond just a CV.

✨Tip Number 3

Prepare for interviews by practising common questions and scenarios specific to the role. Think about how you can showcase your experience with ML frameworks and production systems during the chat.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Lovable.

We think you need these skills to ace Researcher, Post Training in London

Post-Training on Large Language Models

Production Code Writing

ML Frameworks (PyTorch, JAX)

Distributed Training Setups

GPU Clusters Management

Preference Optimization

Reward Modeling

Model Alignment Techniques

Evaluation Systems Development

Model Quality Regression Tracing

Data Curation

Reinforcement Learning

Supervised Fine-Tuning

Experimentation Infrastructure

Open-Source ML Contributions

Some tips for your application 🫡

Show Your Hands-On Experience:Make sure to highlight your practical experience with post-training jobs on large language models. We want to see that you've not just dabbled in theory but have actually rolled up your sleeves and got involved in the nitty-gritty of training and iterating on models.

Demonstrate Your Coding Skills:We’re looking for solid production code, so don’t shy away from showcasing your coding abilities. Include examples of systems you’ve built that run reliably, as this will show us you can deliver results that matter in a production environment.

Emphasise Your Speed and Agility:At Lovable, we value speed. If you’ve got a knack for turning research into prototypes quickly, let us know! Share instances where you’ve read a paper and had a working prototype ready in no time – it’s all about getting things to users fast.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. Plus, it shows you’re keen to join our team!

How to prepare for a job interview at Lovable

✨Know Your Stuff

Make sure you’re well-versed in the post-training processes and techniques mentioned in the job description. Brush up on your experience with large language models, reinforcement learning, and preference optimization. Be ready to discuss specific projects where you've applied these skills.

✨Showcase Your Code

Bring examples of your production code to the interview. This isn’t just about theory; they want to see how you write solid, reliable code. If you’ve worked with ML frameworks like PyTorch or JAX, be prepared to talk about your experiences and any challenges you faced.

✨Talk About Impact

Be ready to discuss how your work has directly impacted users. They value speed and effectiveness, so share instances where you’ve taken research from concept to production quickly. Highlight any metrics or feedback that demonstrate the success of your models in real-world applications.

✨Collaborate and Communicate

Since this role involves working across teams, emphasise your collaboration skills. Share examples of how you’ve worked with product and infrastructure engineers to turn model improvements into tangible user benefits. Show that you can communicate complex ideas clearly and effectively.

Researcher, Post Training in London

Lovable

Location: London

Apply Now

Researcher, Post Training in London

At a Glance

Researcher, Post Training in London employer: Lovable

StudySmarter Expert Advice🤫

We think you need these skills to ace Researcher, Post Training in London

Some tips for your application 🫡

How to prepare for a job interview at Lovable

Company

Product

Help