Research Engineer/Research Scientist – Model Transparency
Research Engineer/Research Scientist – Model Transparency

Research Engineer/Research Scientist – Model Transparency

Full-Time 65000 - 145000 £ / year (est.) Home office (partial)
AI Security Institute

At a Glance

  • Tasks: Join us in researching AI safety and transparency, shaping the future of advanced AI.
  • Company: Be part of the AI Security Institute, a leader in AI risk management.
  • Benefits: Enjoy competitive salary, generous leave, and professional development opportunities.
  • Other info: Flexible working options and a vibrant office environment in central London.
  • Why this job: Make a real impact on AI governance while working with top experts in the field.
  • Qualifications: Experience in AI safety or related fields; strong research or engineering skills required.

The predicted salary is between 65000 - 145000 £ per year.

About the AI Security Institute

The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally. We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action.

Deadline for applying: Sunday 24th May 2026, end of day

Team Description

The ability to effectively evaluate and monitor AI systems will grow in importance as models become more capable, autonomous, and integrated into society. If models can detect and game evaluations, obscure their reasoning, or behave differently under observation, the safety claims that governments and developers rely on become unreliable. Understanding and addressing these risks is essential to ensuring that oversight of advanced AI systems keeps pace with their capabilities.

The Model Transparency team is a research team within AISI focused on ensuring that evaluations, assessments, and monitoring of frontier AI systems remain reliable as models become less transparent. We research how and why oversight is declining – through phenomena such as evaluation awareness, unfaithful chain-of-thought reasoning, and changes in model architectures – and develop methods (including white and black box methods) to detect, measure, and mitigate potential issues. We share our findings with frontier AI companies (including Anthropic, OpenAI, DeepMind), UK government officials, allied governments, and publicly to inform their deployment, research, and policy decisions. We also work directly with safety teams at frontier labs, contributing to safety case reviews and helping improve their alignment evaluation methodology.

We’re looking for Research Scientists and Research Engineers for the Model Transparency team with expertise in technical AI safety – such as interpretability, capability or alignment evaluations, model transparency – or with broader experience with frontier LLM research and development. An ideal candidate would have a strong track record of high-quality research in technical AI safety or adjacent fields.

Research Scientists drive the technical substance of our work – staying abreast of the literature, proposing and designing experiments, conducting rigorous analyses, and owning the evidence stack from experiment through to written output. They write, critique, and strengthen the team's reports and publications.

Research Engineers build the systems and tooling that make our research possible and fast – scaling experimental workflows, automating processes, solving infrastructure challenges, and creating systems that accelerate the entire team's output.

We're interested in candidates along the spectrum between Research Engineers and Research Scientists. The application form will ask you to indicate which role you lean towards. The team is led by Joseph Bloom, advised by Geoffrey Irving. You'll work with talented, mission-driven technical staff across AISI, including alumni from Anthropic, OpenAI, DeepMind, and top universities. You may also collaborate with external research teams including those at frontier AI labs, METR, and FAR. We are open to hires across a range of experience levels.

Representative Projects You Might Work On

  • Developing a chain-of-thought monitorability benchmark and comparing monitorability properties across frontier AI systems, leveraging AISI’s unique access to reasoning traces from multiple labs.
  • Designing and running experiments on open-weight models to study alignment and oversight-relevant phenomena – such as reproducing emergent misalignment from reward hacking, or red‑teaming techniques like inoculation prompting and character training.
  • Using white‑box and interpretability methods – such as activation oracles, sparse auto‑encoders or probes – to detect misalignment that isn’t visible through behavioural evaluation alone.
  • Building tooling and infrastructure for our research – including agent orchestration, large‑scale RL pipelines, mechanistic interpretability methodologies, and auditing agents.

The work could also involve:

  • Reviewing frontier lab risk assessments and safety cases, providing independent analysis of alignment claims before deployment decisions.
  • Conducting literature reviews and expert interviews to map the state of model transparency risks and inform AISI’s strategic priorities.
  • Translating technical findings into actionable insights for AISI evaluation teams, UK government officials, and international partners.

What we’re looking for

If you’re unsure whether you meet the criteria below, we’d encourage you to apply anyway – we’d rather you err on the side of applying than not.

Requirements for both roles:

  • A get-things-done mindset – you take ownership, move fast, and care about shipping work that matters.
  • A combination of self-sufficiency and enthusiasm for teamwork – you’re equally happy defining your own agenda and contributing to shared goals.
  • You’re excited about growing, giving and receiving feedback, and building something together.
  • An ability to build, supervise and orchestrate AI agents to complete tasks effectively, while verifying and maintaining quality of work.
  • A demonstrated track record of relevant, high-quality work – whether technical publications, blog posts, or other publicly visible contributions.

Research Scientists – our requirements are:

  • Hands‑on research experience with large language models (LLMs) – such as evaluating or fine‑tuning models, developing and testing monitors, or auditing models with white-box or black-box techniques.
  • Ability and experience in writing research code for machine learning experiments, including experience with ML frameworks like PyTorch or evaluation frameworks like Inspect.
  • An ability to write high‑quality, concise research proposals that are well‑motivated, tractable, and coherent.
  • Good research taste – an ability to identify what’s important, choose productive directions, and avoid getting lost in dead ends.
  • An ability to read research critically, identify flawed arguments, and poke holes in safety claims.

We don’t expect RS candidates to meet all of the following, but they are useful signals:

  • Experience designing and running alignment evaluations or working on model transparency research.
  • Experience with interpretability or white-box methods – such as mechanistic interpretability, sparse autoencoders, probing, or activation analysis.
  • Familiarity with alignment literature, current methods for post‑training and aligning LLMs, and the current state of the field.
  • Prior mentorship or training within technical AI safety – such as through the MATS program or similar.

Research Engineers – our requirements are:

  • Strong software engineering skills and experience building systems that support ML research – infrastructure, pipelines, tooling, or experimental platforms.
  • Ability and experience writing production‑quality code in Python and familiarity with ML frameworks like PyTorch.
  • Experience working with LLMs at scale in some capacity – fine‑tuning, deploying, evaluating, or building scaffolds around them.
  • An understanding of the needs of research scientists, experience working within and supporting a research team or building tools to support research.

We don’t expect RE candidates to meet all of the following, but they are useful signals:

  • A track record of scaling AI automation – getting agents to do useful work, building orchestration systems, or accelerating research workflows with AI tooling.
  • Experience working with very large models (~100B+) at scale, including post‑training (RL, RLHF, DPO), fine‑tuning pipelines, or distributed interpretability work on models that don’t fit into memory.
  • Experience with mechanistic interpretability tooling or white-box analysis infrastructure at scale.
  • Strong open‑source contributions, particularly related to LLMs or AI safety.
  • Proficient usage of LLM coding tools and agents.

What We Offer

  • Impact you could not have elsewhere.
  • Incredibly talented, mission-driven and supportive colleagues.
  • Direct influence on how frontier AI is governed and deployed globally.
  • Work with the Prime Minister’s AI Advisor and leading AI companies.
  • Opportunity to shape the first and best-resourced public-interest research team focused on AI security.

Resources & Access

  • Pre‑release access to multiple frontier models and ample compute.
  • Extensive operational support so you can focus on research and ship quickly.
  • Work with experts across national security, policy, AI research and adjacent sciences.
  • If you’re talented and driven, you’ll own important problems early.
  • 5 days off and annual stipends for learning and development, and funding for conferences and external collaborations.
  • Freedom to pursue research bets without product pressure.
  • Opportunities to publish and collaborate externally.

Life & Family

  • Modern central London office (cafes, food court, gym), or where applicable, option to work in similar government offices in Birmingham, Cardiff, Darlington, Edinburgh, Salford or Bristol.
  • Hybrid working, flexibility for occasional remote work abroad and stipends for work‑from‑home equipment.
  • At least 25 days’ annual leave, 8 public holidays, extra team-wide breaks and 3 days off for volunteering.
  • Generous paid parental leave (36 weeks of UK statutory leave shared between parents + 3 extra paid weeks + option for additional unpaid time).
  • On top of your salary, we contribute 28.97% of your base salary to your pension.
  • Discounts and benefits for cycling to work, donations and retail/gyms.

*These benefits apply to direct employees. Benefits may differ for individuals joining through other employment arrangements such as secondments. Annual salary is benchmarked to role scope and relevant experience. Most offers land between £65,000 and £145,000 made up of a base salary plus a technical allowance (take‑home salary = base + technical allowance). An additional 28.97% employer pension contribution is paid on the base salary. This role sits outside of the DDaT pay framework given the scope of this role requires in depth technical expertise in frontier AI safety, robustness and advanced AI architectures.

Research Engineer/Research Scientist – Model Transparency employer: AI Security Institute

The AI Security Institute is an exceptional employer, offering a unique opportunity to work at the forefront of AI safety and governance in London. With a mission-driven culture, employees benefit from direct influence on global AI deployment, access to cutting-edge resources, and generous support for professional development. The organisation fosters a collaborative environment with talented colleagues, flexible working arrangements, and a strong commitment to employee well-being, making it an ideal place for those passionate about impactful research.
AI Security Institute

Contact Detail:

AI Security Institute Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Research Engineer/Research Scientist – Model Transparency

Tip Number 1

Network like a pro! Reach out to folks in the AI safety space, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Show off your skills! Create a portfolio showcasing your research projects, experiments, or any relevant work you've done. This is your chance to demonstrate your expertise in model transparency and AI safety beyond just a CV.

Tip Number 3

Prepare for interviews by diving deep into the latest trends in AI safety and model transparency. Be ready to discuss your thoughts on current challenges and how you can contribute to the team at AISI. Confidence is key!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of our mission-driven team.

We think you need these skills to ace Research Engineer/Research Scientist – Model Transparency

Technical AI Safety
Model Transparency
Interpretability
Capability Evaluations
Alignment Evaluations
Large Language Models (LLMs)
Research Code Development
Machine Learning Frameworks (e.g., PyTorch)
Experimental Design
Data Analysis
Software Engineering
Infrastructure Development
Automation of Research Workflows
Critical Research Evaluation
Collaboration with Research Teams

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your application to highlight how your skills and experiences align with the role of Research Engineer or Research Scientist. We want to see how you can contribute to our mission at the AI Security Institute!

Showcase Your Work: Include examples of your previous research or projects that demonstrate your expertise in AI safety, model transparency, or related fields. This is your chance to shine, so don’t hold back on sharing your achievements!

Be Clear and Concise: When writing your application, keep it clear and to the point. We appreciate well-structured proposals that are easy to read and understand. Remember, quality over quantity!

Apply Through Our Website: Don’t forget to submit your application through our official website! It’s the best way for us to receive your details and ensures you’re considered for this exciting opportunity.

How to prepare for a job interview at AI Security Institute

Know Your Stuff

Make sure you’re well-versed in the latest research and developments in AI safety, model transparency, and related fields. Brush up on key concepts like interpretability and alignment evaluations, as these will likely come up during your interview.

Showcase Your Experience

Prepare to discuss your previous work in detail, especially any hands-on experience with large language models or relevant projects. Be ready to share specific examples of how you've tackled challenges in AI research or engineering.

Ask Insightful Questions

Demonstrate your interest in the role and the organisation by asking thoughtful questions about their current projects, team dynamics, and future goals. This shows that you’re not just looking for any job, but are genuinely interested in contributing to their mission.

Emphasise Teamwork and Ownership

Highlight your ability to work both independently and collaboratively. Share examples of how you’ve taken ownership of projects while also being a supportive team player. This balance is crucial for success in a dynamic environment like the AI Security Institute.

Research Engineer/Research Scientist – Model Transparency
AI Security Institute

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>