Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London
Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents)

Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London

London Full-Time 60000 - 80000 £ / year (est.) No home office possible
relationrx

At a Glance

  • Tasks: Develop advanced AI systems to solve complex biological problems and support scientific discovery.
  • Company: Join Relation, a pioneering TechBio company transforming medicine through technology.
  • Benefits: Competitive salary, inclusive culture, and opportunities for impactful work in drug discovery.
  • Other info: Collaborative environment with excellent career growth and interdisciplinary teamwork.
  • Why this job: Make a real difference in healthcare by leveraging cutting-edge machine learning and biology.
  • Qualifications: PhD or MSc in Machine Learning or related field, with strong programming skills.

The predicted salary is between 60000 - 80000 £ per year.

About Relation

Relation is a sector defining TechBio company developing transformational medicines, with technology at our core. Our ambition is to understand human biology in unprecedented ways, discovering therapies to treat some of life’s most devastating diseases. We leverage single-cell multi-omics from patient tissue, functional assays, and machine learning to drive disease understanding, from cause to cure. We are scaling rapidly and building a team of exceptional individuals to push the boundaries of drug discovery. You will work in highly interdisciplinary teams where biology, computation, and engineering come together to solve complex problems that have not been solved before. Our state-of-the-art wet and dry labs in the heart of London are designed to accelerate this integration and translate insight into impact.

We are committed to building diverse and inclusive teams. Relation is an equal opportunities employer and does not discriminate on the basis of gender, sexual orientation, marital or civil partnership status, gender reassignment, race, colour, nationality, ethnic or national origin, religion or belief, disability, or age. By joining Relation, you will help define how medicines are discovered and deliver meaningful impact for patients.

The opportunity

Join the Turing team as a Machine Learning Scientist, where you will develop advanced AI systems that help scientists reason about complex biological problems. This role focuses on building LLM-driven reasoning systems and intelligent agents, using approaches such as reinforcement learning, RLHF, symbolic reasoning, and agentic architectures. Rather than applying standard ML pipelines, you will work on training and shaping models that can reason over evidence, explore knowledge, and support scientific discovery. You will work closely with computational scientists and biologists to develop systems that integrate large-scale biomedical data, scientific literature, and experimental insights to support target discovery and disease understanding. This is an individual contributor role suited to a mid- to senior-level ML scientist who enjoys solving challenging applied research problems at the intersection of AI and biology.

Day to day, you will:

  • Design and develop agentic ML systems that can reason, plan, and interact with tools and data sources.
  • Train and refine LLM-based reasoning models using approaches such as reinforcement learning, RLHF, or other alignment techniques.
  • Develop algorithms that enable agents to explore and reason over complex scientific evidence.
  • Build systems that integrate large-scale biological data, knowledge sources, and scientific literature.
  • Collaborate closely with computational scientists, engineers, and biologists to translate scientific questions into ML systems.
  • Prototype and iterate on new approaches for reasoning, decision-making, and hypothesis generation in scientific domains.
  • Contribute to the technical direction of the team through experiments, publications, or new methodological ideas.

We are particularly interested in candidates who have previously built systems such as:

  • Training reasoning or tool-using language models using RL, RLHF, or similar approaches.
  • Developing agents that plan, explore, and interact with tools or environments.
  • Designing learning loops where models improve through feedback or interaction.
  • Building multi-step decision-making systems (e.g., scientific discovery systems, robotics policies, simulation agents, or planning systems).
  • Developing evaluation frameworks for reasoning or agentic models.
  • Applying advanced ML techniques to complex real-world domains such as science, robotics, healthcare, or autonomous systems.

Professionally, you will have:

  • A PhD or MSc with substantial experience in Machine Learning, Computer Science, or a related quantitative field.
  • Strong experience working with large language models, including training, fine-tuning, or evaluation.
  • Experience with reinforcement learning, such as policy optimisation, actor–critic methods, or RLHF-style training pipelines.
  • Hands-on experience building agentic or decision-making systems (e.g., tool-using LLMs, planning agents, or multi-agent systems).
  • Strong programming skills in Python and modern ML frameworks.
  • Experience developing applied ML systems in complex domains.

Bonus experience:

  • Experience designing evaluation frameworks for reasoning or agentic systems.
  • Experience applying ML to scientific, biomedical, or healthcare problems.
  • Experience working in interdisciplinary environments combining ML and science.
  • Publications or open-source contributions related to LLMs, reinforcement learning, agentic systems, or applied AI.

Personally, you:

  • Are comfortable working in a matrixed environment, balancing multiple stakeholders and contributing effectively across teams.
  • Take ownership of your work, proactively seek opportunities to contribute, and enable others to do their best work.
  • Communicate openly and directly, give and receive feedback constructively, and handle challenging conversations with respect.
  • Actively seek out diverse perspectives, build strong working relationships, and contribute to shared goals across teams.
  • Embrace challenges with openness and resilience, set high standards for yourself, and strive to deliver meaningful outcomes.

Working Style & Culture at Relation

At Relation, we operate in a matrixed, interdisciplinary environment, where impact is driven through collaboration across scientific, technical, and operational domains. We collaborate, and you will partner with colleagues across multiple teams and projects, contributing your expertise while aligning to shared company priorities. We work together and win together! The patient is waiting!

Recruitment Agencies

Please note that Relation does not accept unsolicited resumes from agencies. Resumes should not be forwarded to our job aliases or employees. Relation will not be liable for any fees associated with unsolicited CVs. Relation is a committed equal opportunities employer.

Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London employer: relationrx

Relation is an exceptional employer that fosters a collaborative and inclusive work culture, where interdisciplinary teams unite to tackle complex biological challenges using cutting-edge technology. Located in the heart of London, our state-of-the-art labs provide an inspiring environment for innovation, while our commitment to employee growth ensures that you will have ample opportunities to develop your skills and contribute to groundbreaking advancements in drug discovery. Join us to make a meaningful impact on patient lives and be part of a diverse team that values every voice.
relationrx

Contact Detail:

relationrx Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London

Tip Number 1

Network like a pro! Reach out to people in the industry, attend meetups, and connect with professionals on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to machine learning and AI. This is your chance to demonstrate your expertise and make a lasting impression.

Tip Number 3

Prepare for interviews by practising common questions and scenarios specific to machine learning and reasoning systems. Mock interviews with friends or mentors can help you feel more confident and ready to tackle any question thrown your way.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Relation.

We think you need these skills to ace Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London

Machine Learning
Reinforcement Learning
Large Language Models (LLMs)
Symbolic Reasoning
Agentic Architectures
Algorithm Development
Data Integration
Scientific Literature Analysis
Python Programming
ML Frameworks
Decision-Making Systems
Evaluation Frameworks
Interdisciplinary Collaboration
Applied Research Problem Solving
Prototyping and Iteration

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that are most relevant to the Machine Learning Scientist role. Highlight your work with LLMs, reinforcement learning, and any interdisciplinary projects you've been part of. We want to see how you can contribute to our mission!

Craft a Compelling Cover Letter: Your cover letter is your chance to tell us why you're the perfect fit for this role. Share your passion for AI and biology, and give us examples of how you've tackled complex problems in the past. Be genuine and let your personality shine through!

Showcase Your Projects: If you've worked on any relevant projects, whether in academia or industry, make sure to include them! We love seeing practical applications of your skills, especially if they involve reasoning systems or agentic architectures. Links to GitHub or publications are a bonus!

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us that you're genuinely interested in joining our team at Relation!

How to prepare for a job interview at relationrx

Know Your Stuff

Make sure you brush up on the latest advancements in machine learning, especially around LLMs and reinforcement learning. Be ready to discuss your previous projects and how they relate to the role. This shows you're not just a fan of the theory but have practical experience too!

Show Your Collaborative Spirit

Since this role involves working closely with biologists and computational scientists, be prepared to talk about your teamwork experiences. Share examples of how you've successfully collaborated across disciplines and how you can contribute to a diverse team environment.

Prepare for Technical Questions

Expect some deep dives into technical topics like training models or developing algorithms. Practice explaining complex concepts in simple terms, as you'll need to communicate effectively with non-technical team members. It’s all about making your expertise accessible!

Ask Insightful Questions

At the end of the interview, don’t shy away from asking questions that show your interest in the company’s mission and the role. Inquire about their current projects or challenges they face in integrating AI with biology. This demonstrates your enthusiasm and forward-thinking mindset!

Machine Learning Scientist – Reasoning Systems & RL (LLMs / Agents) in London
relationrx
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>