Research Scientist - Post-Training
Research Scientist - Post-Training

Research Scientist - Post-Training

Full-Time 65000 - 75000 £ / year (est.) No home office possible
A

At a Glance

  • Tasks: Join our Post-Training Team to enhance AI systems using cutting-edge machine learning techniques.
  • Company: AISI focuses on optimizing AI performance across various risk domains with a talented team.
  • Benefits: Competitive salaries, mentorship from industry leaders, and opportunities for growth in a dynamic environment.
  • Why this job: Work on impactful AI projects while collaborating with top researchers from renowned organizations.
  • Qualifications: Ideal candidates have experience in machine learning research or engineering, especially with LLMs.
  • Other info: Positions available for all seniority levels; salary ranges from £65,000 to £145,000.

The predicted salary is between 65000 - 75000 £ per year.

About the Team

The Post-Training Team is dedicated to optimising AI systems to achieve state-of-the-art performance across the various risk domains that AISI focuses on. This is accomplished through a combination of scaffolding, prompting, supervised and RL fine-tuning of the AI models which AISI has access to.

One of the main focuses of our evaluation teams is estimating how new models might affect the capabilities of AI systems in specific domains. To improve confidence in our assessments, we make significant effort to enhance the model’s performance in the domains of interest.

For many of our evaluations, this means taking a model we have been given access to and embedding it as part of a wider AI system-for example, in our cybersecurity evaluations, we provide models with access to tools for interacting with the underlying operating system and repeatedly call models to act in such environment. In our evaluations which do not require agentic capabilities, we may use elicitation techniques like fine-tuning and prompt engineering to ensure assessing the model at its full capacity.

About the Role

As a member of this team, you will use cutting-edge machine learning techniques to improve model performance in our domains of interest. The work is split into two sub-teams: Agents and Finetuning. Our Agents sub-team focuses on developing the LLM tools and scaffolding to create highly capable LLM-based agents, while our Finetuning Team builds out finetuning pipelines to improve models on our domains of interest.

The Post-Training Team is seeking strong Research Scientists to join the team. The priorities of the team include both research-oriented tasks-such as designing new techniques for scaling inference-time computation or developing methodologies for in-depth analysis of agent behaviour-and engineering-oriented tasks-like implementing new tools for our LLM agents or creating pipelines for supporting and fine-tuning large open-source models. We recognise that some technical staff may prefer to span or alternate between engineering and research responsibilities, and this versatility is something we actively look for in our hires.

You’ll receive mentorship and coaching from your manager and the technical leads on your team, and regularly interact with world-class researchers and other exceptional staff, including alumni from Anthropic, DeepMind and OpenAI.

In addition to junior roles, we offer Senior, Staff, and Principal Research Engineer positions for candidates with the requisite seniority and experience.

Person Specification

You may be a good fit if you have some of the following skills, experience and attitudes:

  • Experience conducting empirical machine learning research (e.g. PhD in a technical field and/or papers at top ML conferences), particularly on LLMs.
  • Experience with machine learning engineering, or extensive experience as a software engineer with a strong demonstration of relevant skills/knowledge in the machine learning.
  • An ability to work autonomously and in a self-directed way with high agency, thriving in a constantly changing environment and a steadily growing team, while figuring out the best and most efficient ways to solve a particular problem.

Particularly strong candidates also have the following experience:

  • Building LLM agents in industry or open-source collectives, particularly in areas adjacent to the main interests of one of our workstreams e.g. in-IDE coding assistants, research assistants etc. (for our Agents subteam)
  • Leading research on improving and measuring the capabilities of LLM agents (for our Agents sub-team)
  • Building pipelines for fine-tuning (or pretraining LLMs). Finetuning with RL techniques is particularly relevant (for our finetuning subteam).
  • Finetuning or pretraining LLMs in a research context, particularly to achieve increased performance in specific domains (for our finetuning subteam).

Salary & Benefits

We are hiring individuals at all ranges of seniority and experience within the research unit, and this advert allows you to apply for any of the roles within this range. We will discuss and calibrate with you as part of the process. The full range of salaries available is as follows:

  • L3: £65,000 – £75,000
  • L4: £85,000 – £95,000
  • L5: £105,000 – £115,000
  • L6: £125,000 – £135,000
  • L7: £145,000

There are a range of pension options available which can be found through the Civil Service website.

Selection Process

In accordance with the Civil Service Commission rules, the following list contains all selection criteria for the interview process.

Required Experience

We select based on skills and experience regarding the following areas:

  • Research problem selection
  • Research Engineering
  • Writing code efficiently
  • Python
  • Frontier model architecture knowledge
  • Frontier model training knowledge
  • Model evaluations knowledge
  • AI safety research knowledge
  • Written communication
  • Verbal communication
  • Teamwork
  • Interpersonal skills
  • Tackle challenging problems
  • Learn through coaching

Desired Experience

We additionally may factor in experience with any of the areas that our work-streams specialise in:

  • Cyber security
  • Chemistry or Biology
  • Safeguards
  • Safety Cases
  • Societal Impacts

Research Scientist - Post-Training employer: AI Safety Institute

At AISI, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration among top-tier researchers and engineers. Our Post-Training Team is dedicated to pushing the boundaries of AI technology, providing employees with unparalleled opportunities for professional growth and mentorship from industry leaders. With competitive salaries and a commitment to employee well-being, AISI is the ideal place for those looking to make a meaningful impact in the field of machine learning.
A

Contact Detail:

AI Safety Institute Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Research Scientist - Post-Training

✨Tip Number 1

Familiarize yourself with the latest advancements in machine learning, especially in LLMs. This will not only help you understand the technical requirements of the role but also allow you to engage in meaningful conversations during interviews.

✨Tip Number 2

Showcase your ability to work autonomously by discussing past projects where you took initiative. Highlight any experience you have in rapidly changing environments, as this is a key aspect of the role.

✨Tip Number 3

If you have experience with fine-tuning or pretraining LLMs, be prepared to discuss specific techniques you've used. This will demonstrate your hands-on expertise and align with the team's focus on improving model performance.

✨Tip Number 4

Network with professionals in the AI and machine learning community, especially those who have worked at companies like Anthropic, DeepMind, or OpenAI. This can provide valuable insights and potentially lead to referrals.

We think you need these skills to ace Research Scientist - Post-Training

Empirical Machine Learning Research
PhD in a Technical Field
Machine Learning Engineering
Software Engineering
Autonomous Work
Problem-Solving Skills
LLM Development
Research on LLM Capabilities
Pipeline Building for Fine-Tuning
Reinforcement Learning Techniques
Pretraining LLMs
Model Evaluation Knowledge
AI Safety Research
Written Communication
Verbal Communication
Teamwork
Interpersonal Skills

Some tips for your application 🫡

Highlight Relevant Experience: Make sure to emphasize your experience in empirical machine learning research, especially if you have a PhD or publications in top ML conferences. Mention any specific projects related to LLMs that showcase your expertise.

Showcase Technical Skills: Detail your machine learning engineering skills and software development experience. Include specific programming languages, tools, and frameworks you are proficient in, particularly Python, as it is crucial for this role.

Demonstrate Problem-Solving Abilities: Provide examples of how you've tackled complex problems autonomously. Highlight your ability to adapt in a fast-paced environment and your approach to finding efficient solutions.

Communicate Clearly: Ensure your application is well-structured and free of jargon. Use clear and concise language to convey your ideas, as strong written communication skills are essential for this position.

How to prepare for a job interview at AI Safety Institute

✨Showcase Your Research Experience

Be prepared to discuss your previous research projects, especially those related to machine learning and LLMs. Highlight any publications or presentations at top ML conferences, as this will demonstrate your expertise and commitment to the field.

✨Demonstrate Technical Proficiency

Make sure to brush up on your coding skills, particularly in Python. Be ready to solve coding problems during the interview and explain your thought process clearly. Familiarity with frontier model architecture and training will also be beneficial.

✨Emphasize Problem-Solving Skills

Prepare examples of how you've tackled challenging problems in your past work. Discuss your approach to research problem selection and how you adapt to changing environments, showcasing your ability to work autonomously and efficiently.

✨Engage in Team Dynamics

Since teamwork is crucial for this role, be ready to share experiences where you've collaborated effectively with others. Highlight your interpersonal skills and how you've learned from coaching or mentoring relationships in the past.

Research Scientist - Post-Training
AI Safety Institute
A
  • Research Scientist - Post-Training

    Full-Time
    65000 - 75000 £ / year (est.)

    Application deadline: 2027-03-20

  • A

    AI Safety Institute

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>