Research Engineer, Frontier Safety Mitigations, DeepMind

Research Engineer, Frontier Safety Mitigations, DeepMind

Full-Time 70000 - 90000 £ / year (est.) No working from home possible
Google

At a Glance

  • Tasks: Develop advanced classifiers and data pipelines to enhance AI safety.
  • Company: Join DeepMind, a pioneering AI lab focused on solving global challenges.
  • Benefits: Collaborative culture, diverse learning opportunities, and commitment to public benefit.
  • Other info: Dynamic team environment with excellent career growth potential.
  • Why this job: Make a real impact in AI safety and ethics while working with cutting-edge technology.
  • Qualifications: 5 years of software development experience and strong knowledge in AI and cybersecurity.

The predicted salary is between 70000 - 90000 £ per year.

Qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.

Preferred qualifications:

  • PhD in Computer Science, Machine Learning, or equivalent practical experience, or publications at venues (e.g., NeurIPS, ICLR, ICML, or EMNLP).
  • Experience with cybersecurity detection and response, building classifiers and anomaly detection systems at scale, taking safety defenses or mitigations from research concepts to scalable production systems.
  • Experience in adversarial machine learning, automated red-teaming, or model interpretability and probes.
  • Experience collaborating on or leading applied ML projects, including LLM training, inference, and fine-tuning.
  • Experience using AI coding agents with strong architectural judgment and with TPUs and JAX.
  • Knowledge of AI control, chain-of-thought monitoring, monitorability, and related frontier safety research.

About the job:

In this role, you will de-risk model launches by defending against misuse domains (e.g., Cybersecurity, Chemical, Biological, Radiological, Nuclear, and Conventional Explosive [CBRNE], and Harmful Manipulation). You will build evaluations, conduct red-teaming, research and deploy mitigations (both in-model and out-of-model), and monitor emerging risks to enable the beneficial use of technology. DeepMind is a dedicated scientific community, committed to ‘solving intelligence’ and ensuring technology is used for widespread public benefit. The Frontier Safety Mitigation team operates in a collaborative environment with a culture of support, dedication, and teamwork. The team takes the possibility of dangerous model capabilities seriously as AI advances. Proactively researching and implementing defense-in-depth mitigations is a critical part of the overall strategy for building safe AI. You will join the Frontier Safety Mitigation team within the Gemini Safety team to build safety mitigations for frontier models. You will focus on building defenses against risks, contributing to DeepMind's Frontier Safety Framework commitments.

Responsibilities:

  • Build advanced classifiers and data pipelines to detect misuse, owning the end-to-end process from automated evaluation to rapid model iteration.
  • Build cross-context monitoring systems to detect coordinated harms, developing novel signal aggregation methods across disparate user sessions to identify large-scale attack vectors.
  • Implement data-driven, semi-automated account-level response systems to detect, track, and apply strikes against persistent malicious actors using rich signals from production traffic.
  • Evaluate and secure agentic AI systems by developing threat models, creating testing environments, and deploying robust mitigations against frontier-level agentic hacking and long-horizon attacks.
  • Advance research in automated red-teaming and adversarial robustness, leveraging multi-turn/agentic attacks to systematically test for and uncover misuse vulnerabilities.

Research Engineer, Frontier Safety Mitigations, DeepMind employer: Google

DeepMind is an exceptional employer located in London, UK, offering a collaborative and supportive work culture that prioritises safety and ethics in AI development. Employees benefit from diverse learning opportunities and career pathways, while contributing to groundbreaking research that aims to solve complex global challenges. With a commitment to public benefit and a focus on teamwork, DeepMind fosters an environment where innovative ideas can thrive, making it an attractive place for those seeking meaningful and rewarding employment.

Google

Contact Details:

Google Recruitment Team

We think you need these skills to ace Research Engineer, Frontier Safety Mitigations, DeepMind

Software Development
Programming Languages
Software Testing
Software Maintenance
Software Design
Software Architecture
Cybersecurity Detection and Response