At a Glance
- Tasks: Develop agentic systems for scientific hypothesis testing and create synthetic data pipelines.
- Company: LILA is focused on advancing learning systems and reasoning algorithms for large language models.
- Benefits: Enjoy a talent-dense environment with opportunities for collaboration across research and engineering teams.
- Other info: Candidates should be comfortable with long iteration cycles and ambiguous research environments.
- Why this job: Join a cutting-edge team tackling unique challenges in scientific reasoning and machine learning.
- Qualifications: Advanced degree in computer science or machine learning, with strong LLM and empirical research foundation.
The predicted salary is between 60000 - 80000 £ per year.
Your Impact at LILA
We’re building a talent-dense, high-agency research team to develop the next generation of learning systems and reasoning algorithms for agentic LLMs. Our work sits at the intersection of large language models, post-training, and scientific reasoning, with the goal of enabling systems that learn from experience, reason effectively, and improve through interaction.
Scientific domains present a distinct set of challenges that make this problem uniquely hard. Feedback is sparse and delayed — experiments take days or weeks, not milliseconds. Ground truth is expensive or contested. Distribution shift is structural, as instruments, techniques, and knowledge bases evolve continuously. The hypothesis space is vast and reward signal is thin. Existing benchmarks do not capture these nuances. The goal is to build systems that can operate effectively in this scientific regime.
This role spans a few complementary directions. Candidates are expected to bring deep expertise in one (or more) of the following areas. In the event of cross-track expertise, please select the one you align to the most. Our interview process will be catered to verifying the chosen expertise area.
- Expertise Area 1 — Agentic System Building
- Focus: Build systems that autonomously propose, execute, and verify scientific hypotheses over long time horizons.
- Create and analyze long-running auto-research systems that propose and verify hypotheses.
- Design planning frameworks for agentic systems operating over long, sparse feedback loops.
- Design memory architectures that allow agents to build and retrieve structured knowledge over time.
- Explore algorithms in recursive self-improvement, multi-agent coordination, and continual learning.
- Expertise Area 2: Distillation
- Focus: Translate strong inference-time behaviours and reasoning traces into efficient, trainable models.
- Develop distillation strategies from large or ensemble models into deployable systems.
- Research methods for self-improvement, including iterative self-distillation and critique loops.
- Investigate how to preserve generalisation and reduce catastrophic forgetting through the distillation process.
- Expertise Area 3 — Scalable Experience Generation
- Focus: Develop inference-time algorithms and synthetic data pipelines that generate high-quality training signal for scientific reasoning.
- Design and benchmark inference-time search, sampling, and verification strategies.
- Propose new techniques in synthetic environment creation and curriculum learning.
- Develop synthetic data generation strategies that capture high-quality scientific reasoning for agentic model training.
- Measure the end-to-end impact of inference-time improvements on real scientific tasks.
What You’ll Need to Succeed:
- An advanced degree in computer science, machine learning, or a related field, or comparable experience.
- Strong foundation in LLMs and empirical research.
- Experience designing and executing rigorous ML experiments, including benchmarking and ablations.
- Experience working with large-scale training or evaluation pipelines.
- Ability to define and pursue research directions in open-ended, rapidly evolving spaces.
- Strong collaboration and communication skills across research and engineering teams.
Bonus Points For:
- Experience with synthetic data generation, distillation, or self-improvement loops.
- Familiarity with reinforcement learning (e.g., RLHF, on-policy methods).
- Experience with planning, search, or decision-making systems at scale.
- Experience in building agentic systems with tool use, or multi-agent workflows.
- Background in program synthesis, coding benchmarks, or long-horizon tasks.
- Experience building evaluation frameworks or large-scale benchmarks.
Scientific Rigor & Persistence:
- You take a principled approach to experimentation, with careful baselines, ablations, and evaluation design.
- You are motivated by understanding why systems work, not just improving metrics.
- You prioritise clarity, reproducibility, and intellectual honesty in research.
- You are comfortable working through long, nonlinear iteration cycles.
- You operate effectively in ambiguous, fast-evolving research environments.
Research Scientist, Frontier Capabilities in Cambridge employer: Gravity Engineering Services Pvt Ltd.
LILA offers a dynamic work environment in the heart of tech innovation. Employees benefit from collaboration opportunities and a focus on scientific rigor. The team is dedicated to developing next-generation learning systems that push the boundaries of AI.
Contact Details:
Gravity Engineering Services Pvt Ltd. Recruitment Team