Research Engineer, RSP Evaluations (Autonomy)

Research Engineer, RSP Evaluations (Autonomy)

Full-Time No working from home possible
Anthropic

At a Glance

  • Tasks: Design and run evaluations to measure AI risks and collaborate with top experts.
  • Company: Leading tech firm focused on AI safety and innovation.
  • Benefits: Competitive salary, equity options, unlimited PTO, and comprehensive health coverage.
  • Other info: Hybrid work policy with excellent relocation support and career growth opportunities.
  • Why this job: Make a real impact on AI safety while working with world-class professionals.
  • Qualifications: ML background, Python skills, and experience in collaborative research environments.

We are looking for Research Engineers to build “gold standard” evaluations for catastrophic risks, in order to understand what AI Safety Level (ASL) to assign to models. Research leads on this team collaborate with engineers in one of our focus areas: CBRN, Cyber, Autonomy (this list may expand over time). This will have major implications for the way we train, deploy, and secure our models, as detailed in our Responsible Scaling Policy (RSP). The policy defines a series of capability thresholds – AI Safety Levels (ASLs) – that represent increasing risks – crossing an ASL threshold would trigger a commitment to more stringent safety, security, and operational measures, intended to handle the increased level of risk.

Please note: We are currently only hiring for the Autonomous Replication and Adaption (Autonomy) threats workstream. We will also be prioritising candidates who can start ASAP and can be based in either our San Francisco or London office.

Responsibilities

  • Design and run the evaluations needed to measure dangerous capabilities in models, and determine when we cross an ASL threshold.
  • Lead projects with world‑class experts in fields such as biosecurity, autonomous replication, cybersecurity, and national security, and experiment with new evals to measure how risky AI systems are.
  • Inform decisions at the highest levels of the company.

Qualifications

  • ML‑focused background with engineering and research skills (e.g. experience in Python).
  • Experience managing research programs comprising dozens of technical and non‑technical experts.
  • Ability to find solutions to ambiguously scoped problems.
  • Design and run experiments and iterate quickly to solve machine‑learning problems.
  • Thrives in a collaborative environment (pair programming is preferred).
  • Experience training, working with, and prompting large language models.

Sample Projects

  • ARA risks – build infrastructure and tooling for testing these capabilities, iterating with external ARA experts to scope possible tasks; build custom “testing environments” and new infrastructure.
  • CBRN risks – work with external experts in biosecurity to design clear and repeatable CBRN evaluations, using post‑training infrastructure to prepare new generations of models for routine evaluations.
  • Cyber risks – co‑design a set of clear and repeatable cyber evaluations with external cyber experts; build custom environments or extensions to existing tooling, or locate specialized datasets.

Logistics

Location‑based hybrid policy: We expect all staff to be in one of our offices at least 25% of the time.

Visa Sponsorship

We sponsor visas for eligible candidates, and will make every effort to help you relocate to the United States, retaining an immigration lawyer to assist throughout the process.

Compensation and Benefits

Annual Salary: £260,000—£420,000 GBP. We offer a competitive compensation package that includes salary, equity, and benefits that collectively meet or exceed market rates.

Benefits

US Benefits

  • Optional equity donation matching.
  • Comprehensive health, dental, and vision insurance for you and all your dependents.
  • 401(k) plan with 4% matching.
  • 22 weeks of paid parental leave.
  • Unlimited PTO.
  • Stipends for education, home office improvements, commuting, and wellness.
  • Fertility benefits via Carrot.
  • Daily lunches and snacks in our office.
  • Relocation support for those moving to the Bay Area.

UK Benefits

  • Optional equity donation matching.
  • Private health, dental, and vision insurance for you and all your dependents.
  • Pension contribution matching 4% of your salary.
  • 21 weeks of paid parental leave.
  • Unlimited PTO.
  • Health cash plan.
  • Life insurance and income protection.
  • Daily lunches and snacks in our office.

Research Engineer, RSP Evaluations (Autonomy) employer: Anthropic

Join a forward-thinking company that prioritises innovation and collaboration, offering Research Engineers the opportunity to work on groundbreaking evaluations for catastrophic risks in AI. With competitive compensation, comprehensive benefits, and a supportive work culture that encourages professional growth, our London and San Francisco offices provide a dynamic environment where your contributions will directly influence the future of AI safety. Experience the unique advantage of working alongside world-class experts while enjoying flexible work arrangements and generous parental leave policies.

Anthropic

Contact Details:

Anthropic Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Research Engineer, RSP Evaluations (Autonomy)

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those already working at our company. A friendly chat can open doors and give you insider info that could make your application stand out.

Tip Number 2

Show off your skills! Prepare a portfolio or a project that highlights your experience with machine learning and research. This is your chance to demonstrate how you tackle complex problems and design experiments.

Tip Number 3

Be ready for a technical interview! Brush up on your Python skills and be prepared to discuss your past projects. We love candidates who can think on their feet and solve problems in real-time.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team and contributing to our mission.

We think you need these skills to ace Research Engineer, RSP Evaluations (Autonomy)

Machine Learning
Python
Research Skills
Project Management
Collaboration
Problem-Solving
Experiment Design

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the Research Engineer role. Highlight your experience with machine learning, Python, and any relevant projects that showcase your ability to handle ambiguous problems. We want to see how your skills align with our mission!

Showcase Collaboration Skills:Since we thrive in a collaborative environment, it’s essential to demonstrate your teamwork abilities. Share examples of how you've worked with diverse teams or led projects involving multiple experts. This will show us you can fit right in with our crew!

Be Clear and Concise:When writing your application, clarity is key! Use straightforward language and avoid jargon unless necessary. We appreciate well-structured applications that get straight to the point, making it easy for us to see your qualifications.

Apply Through Our Website:Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!

How to prepare for a job interview at Anthropic

Know Your Stuff

Make sure you brush up on your machine learning concepts and Python skills. Be ready to discuss your experience with AI Safety Levels and how they relate to the role. Familiarise yourself with the Responsible Scaling Policy and be prepared to share your thoughts on its implications.

Show Your Collaborative Spirit

This role thrives in a collaborative environment, so be ready to talk about your experiences working in teams. Share examples of pair programming or projects where you’ve collaborated with experts from different fields. Highlight how you contribute to a team dynamic.

Prepare for Ambiguity

Expect questions that test your ability to navigate ambiguously scoped problems. Think of examples from your past work where you successfully tackled unclear challenges. Show how you approach problem-solving and iterate quickly to find solutions.

Ask Insightful Questions

Prepare thoughtful questions about the team’s current projects and future directions, especially regarding autonomy threats. This shows your genuine interest in the role and helps you understand how you can contribute effectively.