Job Board

Companies

Nerdleveltech

Anthropic Fellows Program — Reinforcement Learning

Anthropic Fellows Program — Reinforcement Learning in London

London Internship 92400 - 124800 £ / year (est.) Home office (partial)

Apply now

At a Glance

Tasks: Conduct cutting-edge research in reinforcement learning and collaborate with top AI experts.
Company: Join Anthropic, a leader in creating safe and beneficial AI systems.
Benefits: Receive a competitive stipend, mentorship, and funding for research expenses.
Other info: Work in vibrant locations like London or Berkeley, with opportunities for remote work.
Why this job: Make a real impact on AI safety while developing your skills in a dynamic environment.
Qualifications: Fluency in Python and a strong technical background in relevant fields.

The predicted salary is between 92400 - 124800 £ per year.

About Anthropic

Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

We are accepting applications on a rolling basis for the next cohort of Anthropic Fellows, which is expected to start in late September. In some circumstances, we can accommodate fellows starting outside the usual cohort timelines – please note in your application if the September start date doesn't work for you.

The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent – regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts, over 80% of fellows produced papers.

We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond.

What to Expect

4 months of full-time research
Direct mentorship from Anthropic researchers
Access to a shared workspace (in either Berkeley, California or London, UK)
Connection to the broader AI safety and security research community
Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (these vary by country)
Funding for compute (~15k USD/month) and other research expenses

Interview Process

The interview process will include an initial application & reference check, technical assessments & interviews, and a research discussion. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.

Compensation

The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (with possible extension).

Fellows Workstreams

Due to the success of the Anthropic Fellows for AI Safety Research program, we are now expanding it across teams at Anthropic. We expect there to be significant overlap in the types of skills and responsibilities across the roles and will by default consider candidates for all the workstreams.

AI Safety Fellows
AI Security Fellows
ML Systems & Performance Fellows
Reinforcement Learning Fellows
Economics & Societal Impacts Fellows

Across the Workstreams, you May Be a Good Fit If You:

Are motivated by making sure AI is safe and beneficial for society as a whole
Are excited to transition into empirical AI research and would be interested in a full-time role at Anthropic
Have a strong technical background in computer science, mathematics, or physics
Thrive in fast-paced, collaborative environments
Can implement ideas quickly and communicate clearly

Strong Candidates May Also Have:

Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity)
Experience in areas of research or engineering related to their workstream

Candidates Must Be:

Fluent in Python programming
Available to work full-time on the Fellows program

Reinforcement Learning Fellows Mentors, Research Areas, & Past Projects

Fellows will undergo a project selection & mentor matching process. Potential research areas and mentors include:

Ruhua Jiang
Kaidi Cao
Sunny Duan
David Brandfonbrener
Colt Steele
Dino Distefano
Will Williams

Projects in this workstream may include:

Building model-based tools to better understand AI training data and improve training data quality
Research project to better understand generalization
Creating RL environments to improve Claude models at capabilities that are within your domain of expertise
Building RL environments for safety-related tasks
Conducting research and implementing solutions in areas such as RL algorithms

Unique Candidate Criteria

You might be a particularly great fit for this workstream if you:

Have strong software engineering skills with experience building complex ML systems
Can balance research exploration with engineering rigor and operational reliability
Enjoy collaborating across research and engineering disciplines
Are comfortable working with large-scale distributed systems and high-performance computing
Have experience with training, fine-tuning, or evaluating large language models
Are adept at analyzing and debugging model training processes

Logistics

Logistics Requirements: To participate in the Fellows program, you must have work authorization in the US, UK, or Canada and be located in that country during the program.

Workspace Locations: We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. We are also open to remote fellows in the UK, US, or Canada. We will ask you about your availability to work from Berkeley or London (full- or part-time) during the program.

Visa Sponsorship: We are not currently able to sponsor visas for fellows. To participate in the Fellows program, you need to have or independently obtain full-time work authorization in the UK, the US, or Canada.

Program Duration: The program runs for 4 months, full-time. If you can't commit to the full duration, please still apply and note your constraints in the application. We review these requests on a case-by-case basis.

Please note: We do not guarantee that we will make any full-time offers to fellows. However, strong performance during the program may indicate that a Fellow would be a good fit for full-time roles at Anthropic. In previous cohorts, 25‑50% of fellows received a full-time offer, and we’ve supported many more to go on to do great work on AI safety and security at other organizations.

Anthropic Fellows Program — Reinforcement Learning in London employer: Nerdleveltech

Anthropic is an exceptional employer that prioritises the growth and development of its team members through the Anthropic Fellows Program, which offers direct mentorship from leading researchers and access to a collaborative workspace in vibrant locations like London. With a strong commitment to fostering a diverse and inclusive work culture, fellows are encouraged to explore their potential in AI research while receiving competitive stipends and funding for research expenses, making it an ideal environment for those passionate about creating safe and beneficial AI systems.

Contact Detail:

Nerdleveltech Recruiting Team

View Nerdleveltech Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Anthropic Fellows Program — Reinforcement Learning in London

✨Tip Number 1

Network like a pro! Reach out to current or past fellows and mentors from the Anthropic program. A friendly chat can give you insider info and maybe even a referral!

✨Tip Number 2

Prepare for those interviews! Brush up on your technical skills and be ready to discuss your projects in detail. Show us how your experience aligns with Anthropic's mission of safe and beneficial AI.

✨Tip Number 3

Don’t just apply; engage! Follow Anthropic on social media, join discussions, and attend relevant events. This shows your genuine interest and helps you stand out from the crowd.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it gives you a chance to tailor your submission to what we’re really looking for in fellows.

We think you need these skills to ace Anthropic Fellows Program — Reinforcement Learning in London

Fluency in Python programming

Strong technical background in computer science

Strong technical background in mathematics

Strong technical background in physics

Software engineering skills

Experience building complex ML systems

Ability to balance research exploration with engineering rigor

Operational reliability

Collaboration across research and engineering disciplines

Comfort with large-scale distributed systems

Experience with high-performance computing

Experience with training, fine-tuning, or evaluating large language models

Analytical skills for debugging model training processes

Empirical research skills

Some tips for your application 🫡

Be Yourself: When you're writing your application, let your personality shine through! We want to get to know the real you, so don’t be afraid to share your unique experiences and perspectives.

Tailor Your Application: Make sure to customise your application for the Anthropic Fellows Program. Highlight your relevant skills and experiences that align with our mission of creating safe and beneficial AI systems.

Show Your Passion: Express your enthusiasm for AI research and safety in your application. We love candidates who are genuinely excited about making a positive impact in the field!

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way to ensure we receive all your details and can consider you for the program.

How to prepare for a job interview at Nerdleveltech

✨Know Your Stuff

Make sure you brush up on reinforcement learning concepts and any relevant projects you've worked on. Be ready to discuss your technical background in detail, especially your experience with Python and ML systems.

✨Show Your Passion for AI Safety

Anthropic is all about creating safe and beneficial AI. During the interview, express your motivation for working in this field and how your values align with their mission. Share any personal projects or research that demonstrate your commitment.

✨Prepare for Technical Assessments

Expect technical assessments as part of the interview process. Practice coding problems related to reinforcement learning and be prepared to solve them on the spot. Familiarise yourself with common algorithms and frameworks used in the industry.

✨Ask Thoughtful Questions

Interviews are a two-way street! Prepare insightful questions about the fellowship, the team, and ongoing projects at Anthropic. This shows your genuine interest and helps you determine if it's the right fit for you.

Anthropic Fellows Program — Reinforcement Learning in London

Nerdleveltech

Location: London

Apply now

Anthropic Fellows Program — Reinforcement Learning in London

At a Glance

Anthropic Fellows Program — Reinforcement Learning in London employer: Nerdleveltech

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Anthropic Fellows Program — Reinforcement Learning in London

Some tips for your application 🫡

How to prepare for a job interview at Nerdleveltech

Anthropic Fellows Program — Reinforcement Learning in London

Land your dream job quicker with Premium