Anthropic Fellows Program — Reinforcement Learning in London
Anthropic Fellows Program — Reinforcement Learning

Anthropic Fellows Program — Reinforcement Learning in London

London Internship 92400 - 124800 £ / year (est.) Home office (partial)
N

At a Glance

  • Tasks: Conduct cutting-edge research in reinforcement learning and collaborate with top AI experts.
  • Company: Join Anthropic, a leader in creating safe and beneficial AI systems.
  • Benefits: Receive a competitive stipend, mentorship, and funding for research expenses.
  • Other info: Work in vibrant locations like London or Berkeley, with opportunities for remote work.
  • Why this job: Make a real impact on AI safety while developing your skills in a dynamic environment.
  • Qualifications: Fluency in Python and a strong technical background in relevant fields.

The predicted salary is between 92400 - 124800 £ per year.

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

We are accepting applications on a rolling basis for the next cohort of Anthropic Fellows, which is expected to start in late September. In some circumstances, we can accommodate fellows starting outside the usual cohort timelines – please note in your application if the September start date doesn't work for you.

The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent – regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts, over 80% of fellows produced papers.

We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond.

What to Expect

  • 4 months of full-time research
  • Direct mentorship from Anthropic researchers
  • Access to a shared workspace (in either Berkeley, California or London, UK)
  • Connection to the broader AI safety and security research community
  • Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (these vary by country)
  • Funding for compute (~15k USD/month) and other research expenses

Interview Process

The interview process will include an initial application & reference check, technical assessments & interviews, and a research discussion. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.

Compensation

The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (with possible extension).

Fellows Workstreams

Due to the success of the Anthropic Fellows for AI Safety Research program, we are now expanding it across teams at Anthropic. We expect there to be significant overlap in the types of skills and responsibilities across the roles and will by default consider candidates for all the workstreams.

  • AI Safety Fellows
  • AI Security Fellows
  • ML Systems & Performance Fellows
  • Reinforcement Learning Fellows
  • Economics & Societal Impacts Fellows

Across the Workstreams, you May Be a Good Fit If You:

  • Are motivated by making sure AI is safe and beneficial for society as a whole
  • Are excited to transition into empirical AI research and would be interested in a full-time role at Anthropic
  • Have a strong technical background in computer science, mathematics, or physics
  • Thrive in fast-paced, collaborative environments
  • Can implement ideas quickly and communicate clearly

Strong Candidates May Also Have:

  • Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity)
  • Experience in areas of research or engineering related to their workstream

Candidates Must Be:

  • Fluent in Python programming
  • Available to work full-time on the Fellows program

Reinforcement Learning Fellows Mentors, Research Areas, & Past Projects

Fellows will undergo a project selection & mentor matching process. Potential research areas and mentors include:

  • Ruhua Jiang
  • Kaidi Cao
  • Sunny Duan
  • David Brandfonbrener
  • Colt Steele
  • Dino Distefano
  • Will Williams

Projects in this workstream may include:

  • Building model-based tools to better understand AI training data and improve training data quality
  • Research project to better understand generalization
  • Creating RL environments to improve Claude models at capabilities that are within your domain of expertise
  • Building RL environments for safety-related tasks
  • Conducting research and implementing solutions in areas such as RL algorithms

Unique Candidate Criteria

You might be a particularly great fit for this workstream if you:

  • Have strong software engineering skills with experience building complex ML systems
  • Can balance research exploration with engineering rigor and operational reliability
  • Enjoy collaborating across research and engineering disciplines
  • Are comfortable working with large-scale distributed systems and high-performance computing
  • Have experience with training, fine-tuning, or evaluating large language models
  • Are adept at analyzing and debugging model training processes

Logistics

Logistics Requirements: To participate in the Fellows program, you must have work authorization in the US, UK, or Canada and be located in that country during the program.

Workspace Locations: We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. We are also open to remote fellows in the UK, US, or Canada. We will ask you about your availability to work from Berkeley or London (full- or part-time) during the program.

Visa Sponsorship: We are not currently able to sponsor visas for fellows. To participate in the Fellows program, you need to have or independently obtain full-time work authorization in the UK, the US, or Canada.

Program Duration: The program runs for 4 months, full-time. If you can't commit to the full duration, please still apply and note your constraints in the application. We review these requests on a case-by-case basis.

Please note: We do not guarantee that we will make any full-time offers to fellows. However, strong performance during the program may indicate that a Fellow would be a good fit for full-time roles at Anthropic. In previous cohorts, 25‑50% of fellows received a full-time offer, and we’ve supported many more to go on to do great work on AI safety and security at other organizations.

Anthropic Fellows Program — Reinforcement Learning in London employer: Nerdleveltech

Anthropic is an exceptional employer that prioritises the growth and development of its team members through the Anthropic Fellows Program, which offers direct mentorship from leading researchers and access to a collaborative workspace in vibrant locations like London. With a strong commitment to fostering a diverse and inclusive work culture, fellows are encouraged to explore their potential in AI research while receiving competitive stipends and funding for research expenses, making it an ideal environment for those passionate about creating safe and beneficial AI systems.
N

Contact Detail:

Nerdleveltech Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Anthropic Fellows Program — Reinforcement Learning in London

Tip Number 1

Network like a pro! Reach out to current or past fellows and mentors from the Anthropic program. A friendly chat can give you insider info and maybe even a referral!

Tip Number 2

Prepare for those interviews! Brush up on your technical skills and be ready to discuss your projects in detail. Show us how your experience aligns with Anthropic's mission of safe and beneficial AI.

Tip Number 3

Don’t just apply; engage! Follow Anthropic on social media, join discussions, and attend relevant events. This shows your genuine interest and helps you stand out from the crowd.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it gives you a chance to tailor your submission to what we’re really looking for in fellows.

We think you need these skills to ace Anthropic Fellows Program — Reinforcement Learning in London

Fluency in Python programming
Strong technical background in computer science
Strong technical background in mathematics
Strong technical background in physics
Software engineering skills
Experience building complex ML systems
Ability to balance research exploration with engineering rigor
Operational reliability
Collaboration across research and engineering disciplines
Comfort with large-scale distributed systems
Experience with high-performance computing
Experience with training, fine-tuning, or evaluating large language models
Analytical skills for debugging model training processes
Empirical research skills

Some tips for your application 🫡

Be Yourself: When you're writing your application, let your personality shine through! We want to get to know the real you, so don’t be afraid to share your unique experiences and perspectives.

Tailor Your Application: Make sure to customise your application for the Anthropic Fellows Program. Highlight your relevant skills and experiences that align with our mission of creating safe and beneficial AI systems.

Show Your Passion: Express your enthusiasm for AI research and safety in your application. We love candidates who are genuinely excited about making a positive impact in the field!

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way to ensure we receive all your details and can consider you for the program.

How to prepare for a job interview at Nerdleveltech

Know Your Stuff

Make sure you brush up on reinforcement learning concepts and any relevant projects you've worked on. Be ready to discuss your technical background in detail, especially your experience with Python and ML systems.

Show Your Passion for AI Safety

Anthropic is all about creating safe and beneficial AI. During the interview, express your motivation for working in this field and how your values align with their mission. Share any personal projects or research that demonstrate your commitment.

Prepare for Technical Assessments

Expect technical assessments as part of the interview process. Practice coding problems related to reinforcement learning and be prepared to solve them on the spot. Familiarise yourself with common algorithms and frameworks used in the industry.

Ask Thoughtful Questions

Interviews are a two-way street! Prepare insightful questions about the fellowship, the team, and ongoing projects at Anthropic. This shows your genuine interest and helps you determine if it's the right fit for you.

Anthropic Fellows Program — Reinforcement Learning in London
Nerdleveltech
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>