AI Researcher, Post Training
AI Researcher, Post Training

AI Researcher, Post Training

Full-Time 60000 - 80000 ÂŁ / year (est.) No home office possible
Lovable

At a Glance

  • Tasks: Own the post-training pipeline, translating research into production for AI models.
  • Company: Join Lovable, a fast-growing tech company transforming software creation.
  • Benefits: Competitive salary, flexible work environment, and opportunities for rapid career growth.
  • Other info: Dynamic team culture focused on innovation and collaboration.
  • Why this job: Make a real impact on AI products used by millions worldwide.
  • Qualifications: Experience with post-training on large language models and solid coding skills.

The predicted salary is between 60000 - 80000 ÂŁ per year.

Lovable lets over 2 million people build software using plain language, and the models behind it need to be exceptional. We're hiring an engineer who has gotten their hands dirty with post‑training at scale and wants to do it again for one of the fastest‑growing AI products in the world.

You will own our full post‑training pipeline: translating the latest research into production training recipes, adapting them for code generation and agent workloads, and putting improved models in front of users fast. The goal is to get promising research into production within days or weeks, not months. This isn't an academic research position – you'll spend as much time in production infrastructure as in training configs, and your success is measured by what ships.

Why Lovable? Lovable lets anyone and everyone build software with any language. From solopreneurs to Fortune 100 teams, millions of people use Lovable to transform raw ideas into real products – fast. We are at the forefront of a foundational shift in software creation, which means you have an unprecedented opportunity to change the way the digital world works. Over 2 million people in more than 200 countries already use Lovable to launch businesses, automate work, and bring their ideas to life. And we’re just getting started. We’re a small, talent‑dense team building a generation‑defining company from Stockholm. We value extreme ownership, high velocity, and low‑ego collaboration. We seek out people who care deeply, ship fast, and are eager to make a dent in the world.

What We’re Looking For

  • You've personally run post‑training jobs on large language models – RFT, RLVR, preference optimization, or similar – not just called APIs or written prompts, but actually trained and iterated on models.
  • You can write solid production code. The systems you build need to run reliably, not just produce interesting research artifacts.
  • You're fluent in at least one major ML framework (PyTorch, JAX) and comfortable working with distributed training setups and GPU clusters.
  • You understand the math behind preference optimization, reward modeling, and alignment techniques, and can reason about when each approach fits.
  • You've built or significantly contributed to evaluation systems that capture real‑world quality, not just benchmark scores.
  • You can trace a model‑quality regression from user‑facing symptoms back through serving, inference, and training – and you enjoy doing it.
  • You want to ship. Research taste matters, but at Lovable the question is always 'how fast can we get this to users?'.

Preferred

  • You've worked on code generation or agentic use cases specifically.
  • You've put post‑trained models into the hands of real users and seen how they hold up at scale.
  • You've owned the full loop: curating data, running training, evaluating results, deploying, and monitoring in production.
  • You have a habit of reading a paper on Monday and having a prototype running by Friday.
  • You've experimented with speculative decoding or similar techniques to improve model efficiency.
  • You have strong views on evaluation methodology and have built evals that actually predict user satisfaction.
  • You've published or contributed meaningfully to the open‑source ML ecosystem.

What You’ll Do

  • Own the full lifecycle of Lovable's post‑training pipeline – from data curation and training runs through evaluation and deployment.
  • Apply and adapt reinforcement learning, preference optimization, and supervised fine‑tuning methods to make our models better at generating code, reasoning about user intent, and acting as reliable agents.
  • Build the evaluation and experimentation infrastructure that tells us whether a model change actually helps users – covering helpfulness, safety, latency, and reliability.
  • Develop and operate the production systems that run training jobs at scale, including GPU orchestration and data pipelines.
  • Work across team boundaries with our agent, product, and infrastructure engineers to turn model gains into product improvements users can feel.
  • Investigate and resolve failures end‑to‑end – whether the root cause is in a training recipe, a data issue, or a serving regression.
  • Read papers, run experiments, and move fast: the goal to get promising research into production within days or weeks, not months.

AI Researcher, Post Training employer: Lovable

Lovable is an exceptional employer that empowers its employees to make a significant impact in the rapidly evolving field of AI. With a strong focus on collaboration, ownership, and speed, team members are encouraged to innovate and bring research to production quickly, all while working in a dynamic environment in Stockholm. The company offers ample opportunities for professional growth and development, making it an ideal place for those looking to advance their careers in a meaningful way.
Lovable

Contact Detail:

Lovable Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land AI Researcher, Post Training

✨Tip Number 1

Network like a pro! Reach out to folks in the AI and ML community, attend meetups, and engage on platforms like LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your post-training projects, experiments, and any code you've written. This is your chance to demonstrate your hands-on experience and make a lasting impression.

✨Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Be ready to discuss your past experiences with model training and deployment, and how you’ve tackled challenges in production environments.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of the Lovable team and contributing to our mission.

We think you need these skills to ace AI Researcher, Post Training

Post-Training on Large Language Models
Production Code Development
ML Frameworks (PyTorch, JAX)
Distributed Training Setups
GPU Clusters Management
Preference Optimization
Reward Modeling
Model Alignment Techniques
Evaluation Systems Development
Model Quality Regression Tracing
Data Curation
Reinforcement Learning
Supervised Fine-Tuning
Experimentation Infrastructure
End-to-End Failure Investigation

Some tips for your application 🫡

Show Your Hands-On Experience: Make sure to highlight your practical experience with post-training jobs on large language models. We want to see that you've not just dabbled in theory but have actually rolled up your sleeves and got stuck in!

Demonstrate Your Coding Skills: We’re looking for solid production code, so don’t shy away from showcasing your coding abilities. Include examples of systems you’ve built that run reliably, not just interesting research projects.

Talk About Your Evaluation Systems: If you've built evaluation systems that capture real-world quality, make sure to mention them! We care about how well models perform in the wild, so share any insights or methodologies you've developed.

Apply Through Our Website: Finally, don’t forget to apply through our website! It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. We can’t wait to see what you bring to the table!

How to prepare for a job interview at Lovable

✨Know Your Models Inside Out

Make sure you can talk confidently about your experience with post-training jobs on large language models. Be ready to discuss specific techniques like RFT, RLVR, and preference optimization, and how you've applied them in real-world scenarios.

✨Showcase Your Production Code Skills

Prepare examples of solid production code you've written. Highlight how your systems have run reliably in production, not just in theory. This will demonstrate your ability to bridge the gap between research and practical application.

✨Understand Evaluation Methodologies

Be prepared to discuss how you've built evaluation systems that capture real-world quality. Share your thoughts on what makes an effective evaluation methodology and how it relates to user satisfaction.

✨Emphasise Speed and Agility

Lovable values getting research into production quickly. Share instances where you've taken a paper from concept to prototype in a short timeframe. This will show your alignment with their fast-paced environment and commitment to shipping improvements.

AI Researcher, Post Training
Lovable

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>