Job Board

Companies

AI Security Institute

Research Scientist, Open Source Technical Safeguards

Research Scientist, Open Source Technical Safeguards in London

London Full-Time 65000 - 145000 £ / year (est.) No home office possible

Apply now

At a Glance

Tasks: Join a dynamic team to develop safeguards against AI misuse and protect society.
Company: AI Security Institute, leading the charge in AI safety and governance.
Benefits: Competitive salary, generous leave, remote work options, and professional development opportunities.
Why this job: Make a real impact on AI safety while collaborating with top experts and government officials.
Qualifications: 3+ years in applied ML or related fields; strong Python skills and creativity required.
Other info: Flexible working arrangements and a supportive, mission-driven culture.

The predicted salary is between 65000 - 145000 £ per year.

The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally.

Societal Resilience is a multidisciplinary team that studies how advanced AI models can impact people and society. We research the prevalence and severity of high-impact societal risks caused by frontier AI deployment, and develop mitigations to address these risks. Core research topics include the use of AI for assisting with criminal activities, preventing critical overreliance on insufficiently robust systems, undermining trust in information, jeopardising psychological wellbeing, or for malicious social engineering. We are interested in both immediate and medium-term risks.

One emerging risk area we are concerned with is the use of open weight models to drive risks like child sexual abuse material (CSAM) and non-consensual intimate imagery (NCII) generation. AISI has previously published research on methods for making open weight models more robust against malicious tampering. In this role, you’ll join a strongly collaborative technical research team to help design and develop technical safeguards for open weight models that will reduce the risks of CSAM, NCII, and other risks. We do not expect this role to handle this kind of content directly.

This is a research scientist position focused on developing technical safeguards against tampering with open weight models. This role will focus on mitigating AI-generated CSAM and NCII by targeting the real-world supply chain driving harm: open-weight models, adaptation artifacts (LoRAs, guides), and downstream distribution infrastructure (hosting platforms, app stores, operating systems).

Our approach prioritises downstream mitigations and actors beyond frontier model developers. This role will build technical tools, protocols, and evidence that platforms and OS/app ecosystems can adopt. This work belongs inside UK government because effective mitigation requires cross-agency coordination (Home Office, DSIT, Ofcom), engagement with regulated platforms under the Online Safety Act, and credible evidence to inform policy trade-offs across innovation, competition, and child protection.

This role will synthesise threat intelligence on how AI generated CSAM and NCII are developed, create scalable screening methodologies that platforms can realistically run, and publish best-practice protocols with NGOs to raise the floor across the ecosystem. You’ll work closely with engineers and domain experts across AISI, as well as external research collaborators at Home Office, Internet Watch Foundation, and Ofcom. Researchers on this team have substantial freedom to shape independent research agendas, lead collaborations, and initiate projects that push the frontier of what evaluations can reveal.

Example Projects:

Publish a Problem Book framing the technical challenges and research directions for preventing CSAM/NCII misuse across model and hosting layers.
Develop threat models for how AI generated CSAM and NCII are created and shared.
Design and pilot scalable, automated screening methodologies platforms can run pre-publication on uploads (topic-general prototypes that avoid exposure to illegal content).
Develop approaches for identifying and tracking known or novel CSAM LoRAs to enable platform blocking at upload.
Co-develop best-practice protocols with NGOs (e.g., Thorn/IWF) for hosting, app store, and OS enforcement.

This is an individual contributor role with no line management responsibilities. You will report into a senior Research Scientist overseeing our team’s misuse workstream.

Your work will raise safety standards across hosting and distribution layers, reduce the availability of CSAM/NCII-generating artifacts (e.g., LoRAs) on major platforms, inform industry protocols and possibly standards, and provide actionable evidence for government decisions. Crucially, we do not expect this role to handle NCII or CSAM material.

Role Requirements:

We’re flexible on the exact profile and expect successful candidates will meet many (but not necessarily all) of the criteria below. Depending on experience, we will consider candidates at either the RS or Senior RS level.

Essential:

At least 3+ years of relevant experience in applied ML, trust & safety tooling, content moderation, security engineering, or adjacent technical fields; we also welcome strong earlier-career applicants (2–3 years) with demonstrated impact in open-source technical work.
Deep familiarity with open-weight image/video models (diffusion, LoRA), model hosting ecosystems (e.g., Hugging Face, GitHub), and the limitations of pre-deployment safeguards.
Strong methodological rigor and creativity; able to design automated, scalable evaluations and detection methods that generalise and avoid reliance on illegal content.
Strong Python and ML stack (PyTorch/JAX), data engineering, and systems skills; experience building pipelines and tooling that run at platform scale.
Knowledge of fingerprinting and detection approaches (e.g., perceptual hashing, embedding-based similarity, behavioural signatures), and their privacy and robustness trade-offs.
Excellent writing and communication for technical and policy audiences; ability to translate evidence into practical governance guidance.
High agency, ethical judgment, and safe-working practices for sensitive topics.
Commit to work from our London office in Whitehall for parts of the week, with flexibility for remote work. We’re looking for full-time commitment but are open to part-time arrangements.

Preferred:

Experience collaborating with hosting platforms, app stores, OS vendors, or regulators (e.g., Ofcom) on safety-by-design initiatives.
Familiarity with Online Safety Act requirements and platform trust & safety operations; prior work with NGOs such as IWF, Thorn, or STOPNCII.org.
Expertise in diffusion models and adaptation techniques (LoRA), model evaluation, and secure tooling for sensitive domains.
Experience with privacy-preserving computation, metadata-poor detection, and standardization efforts (RFCs, protocols).
Open-source contributions (tools, libraries) and evidence of leading cross-sector technical projects.

What We Offer:

Impact you couldn't have anywhere else.
Incredibly talented, mission-driven and supportive colleagues.
Direct influence on how frontier AI is governed and deployed globally.
Work with the Prime Minister’s AI Advisor and leading AI companies.
Opportunity to shape the first & best-resourced public-interest research team focused on AI security.
Resources & access: Pre-release access to multiple frontier models and ample compute.
Extensive operational support so you can focus on research and ship quickly.
Work with experts across national security, policy, AI research, and adjacent sciences.
Growth & autonomy: If you’re talented and driven, you’ll own important problems early. 5 development days per year, an annual L&D budget, and travel support for conferences and external collaborations.
Freedom to pursue research bets without product pressure.
Opportunities to publish and collaborate externally.

Life & family:

Modern central London office (cafes, food court, gym) or option to work in similar government offices in Birmingham, Cardiff, Darlington, Edinburgh, Salford, or Bristol.
Hybrid working with opportunities for occasional remote work abroad.
At least 25 days’ annual leave, 8 public holidays, and extra team-wide breaks.
Generous paid parental leave (36 weeks of UK statutory leave shared between parents + 3 extra paid weeks + option for additional unpaid time).
Plus: 27% government-funded pension contribution on top of salary, work from home equipment and dental insurance.

Annual salary is benchmarked to role scope and relevant experience. Most offers land between £65,000 and £145,000 (base plus technical allowance), with 27% employer pension and other benefits on top.

This role sits outside of the DDaT pay framework given the scope of this role requires in-depth technical expertise in frontier AI safety, robustness and advanced AI architectures.

The Full Range Of Salaries Are Available Below:

Level 3 - Total Package £65,000 - £75,000 inclusive of a base salary £35,720 plus additional technical talent allowance of between £29,280 - £39,280.
Level 4 - Total Package £85,000 - £95,000 inclusive of a base salary £42,495 plus additional technical talent allowance of between £42,505 - £52,505.
Level 5 - Total Package £105,000 - £115,000 inclusive of a base salary £55,805 plus additional technical talent allowance of between £49,195 - £59,195.
Level 6 - Total Package £125,000 - £135,000 inclusive of a base salary £68,770 plus additional technical talent allowance of between £56,230 - £66,230.
Level 7 - Total Package £145,000 inclusive of a base salary £68,770 plus additional technical talent allowance of £76,230.

In accordance with the Civil Service Commission rules, the following list contains all selection criteria for the interview process. The interview process may vary candidate to candidate, however, you should expect a typical process to include some technical proficiency tests, discussions with a cross-section of our team at AISI (including non-technical staff), conversations with your team lead. The process will culminate in a conversation with members of the senior team here at AISI.

Candidates should expect to go through some or all of the following stages once an application has been submitted:

Initial interview.
Technical take home test.
Second interview and review of take home test.
Third interview.
Final interview with members of the senior team.

Additional Information:

Successful candidates must undergo a criminal record check and get baseline personnel security standard (BPSS) clearance before they can be appointed. Additionally, there is a strong preference for eligibility for counter-terrorist check (CTC) clearance. Some roles may require higher levels of clearance, and we will state this by exception in the job advertisement.

The Civil Service embraces diversity and promotes equal opportunities. As such, we run a Disability Confident Scheme (DCS) for candidates with disabilities who meet the minimum selection criteria. The Civil Service also offers a Redeployment Interview Scheme to civil servants who are at risk of redundancy, and who meet the minimum requirements for the advertised vacancy.

Research Scientist, Open Source Technical Safeguards in London employer: AI Security Institute

The AI Security Institute is an exceptional employer, offering a unique opportunity to work at the forefront of AI safety and governance in the heart of London. With a mission-driven culture, employees benefit from direct influence on global AI policies, extensive resources for research, and a supportive environment that fosters professional growth and collaboration with leading experts. The flexible working arrangements, generous leave policies, and commitment to employee well-being make it an attractive place for those seeking meaningful and impactful careers.

Contact Detail:

AI Security Institute Recruiting Team

View AI Security Institute Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Research Scientist, Open Source Technical Safeguards in London

✨Network Like a Pro

Get out there and connect with people in the AI and tech safety space! Attend meetups, conferences, or even online webinars. The more you engage with others, the better your chances of landing that dream role at AISI.

✨Show Off Your Skills

When you get the chance to chat with potential employers, don’t hold back! Share your projects, especially those related to open-weight models or AI safety. We want to see your passion and expertise shine through!

✨Prepare for Technical Challenges

Brush up on your technical skills and be ready for some hands-on tests during interviews. Familiarise yourself with Python, ML stacks, and detection methods. We’re looking for candidates who can hit the ground running!

✨Apply Through Our Website

Don’t forget to apply directly through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who take the initiative to reach out directly.

We think you need these skills to ace Research Scientist, Open Source Technical Safeguards in London

Applied Machine Learning (ML)

Trust & Safety Tooling

Content Moderation

Security Engineering

Open-weight Image/Video Models

Model Hosting Ecosystems

Automated Evaluation Design

Python Programming

Machine Learning Frameworks (PyTorch/JAX)

Data Engineering

Fingerprinting and Detection Approaches

Technical Writing and Communication

Ethical Judgment

Collaboration with NGOs and Regulators

Knowledge of Online Safety Act

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter to highlight your relevant experience in applied ML and technical safeguards. We want to see how your skills align with the role, so don’t hold back on showcasing your expertise!

Showcase Your Projects: If you've worked on any projects related to AI safety or open-weight models, be sure to mention them! We love seeing real-world applications of your skills, especially if they relate to mitigating risks like CSAM and NCII.

Be Clear and Concise: When writing your application, keep it straightforward and to the point. Use clear language to explain your experience and how it relates to the role. We appreciate a well-structured application that’s easy to read!

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. We can’t wait to see what you bring to the table!

How to prepare for a job interview at AI Security Institute

✨Know Your Stuff

Make sure you brush up on your knowledge of open-weight models and the specific risks associated with AI-generated content. Familiarise yourself with the latest research and methodologies in this area, as well as the tools and technologies mentioned in the job description. This will show that you're not just interested in the role but are also passionate about the subject matter.

✨Showcase Your Experience

Prepare to discuss your previous work in applied ML, trust & safety tooling, or any relevant projects you've been involved in. Be ready to share specific examples of how you've tackled similar challenges in the past, especially those related to content moderation or security engineering. This will help demonstrate your capability and fit for the role.

✨Communicate Clearly

Since you'll be working with both technical and non-technical teams, practice explaining complex concepts in simple terms. Think about how you can translate your technical expertise into practical governance guidance. Clear communication is key, so consider doing mock interviews with friends or colleagues to refine your delivery.

✨Engage with the Team

During the interview, don't hesitate to ask questions about the team dynamics and ongoing projects. Show genuine interest in how the AISI collaborates with external partners like NGOs and government bodies. This not only demonstrates your enthusiasm for the role but also helps you gauge if the team culture aligns with your values.

Research Scientist, Open Source Technical Safeguards in London

AI Security Institute

Location: London

Apply now

Research Scientist, Open Source Technical Safeguards in London

London

Full-Time

65000 - 145000 £ / year (est.)

Apply now
AI Security Institute

50-100

View AI Security Institute Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Research Scientist, Open Source Technical Safeguards in London

At a Glance

Research Scientist, Open Source Technical Safeguards in London employer: AI Security Institute

StudySmarter Expert Advice 🤫

✨Network Like a Pro

✨Show Off Your Skills

✨Prepare for Technical Challenges

✨Apply Through Our Website

We think you need these skills to ace Research Scientist, Open Source Technical Safeguards in London

Some tips for your application 🫡

How to prepare for a job interview at AI Security Institute

Research Scientist, Open Source Technical Safeguards in London

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z