At a Glance
- Tasks: Conduct cutting-edge research in reinforcement learning and collaborate with top AI experts.
- Company: Join Anthropic, a leader in creating safe and beneficial AI systems.
- Benefits: Receive a competitive stipend, mentorship, and funding for research expenses.
- Other info: Work in vibrant locations like London or Berkeley, with opportunities for remote work.
- Why this job: Make a real impact on AI safety while developing your skills in a dynamic environment.
- Qualifications: Fluency in Python and a strong technical background in relevant fields.
The predicted salary is between 92400 - 124800 £ per year.
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
We are accepting applications on a rolling basis for the next cohort of Anthropic Fellows, which is expected to start in late September. In some circumstances, we can accommodate fellows starting outside the usual cohort timelines – please note in your application if the September start date doesn't work for you.
The Anthropic Fellows Program is designed to foster AI research and engineering talent. We provide funding and mentorship to promising technical talent – regardless of previous experience. Fellows will primarily use external infrastructure (e.g. open-source models, public APIs) to work on an empirical project aligned with our research priorities, with the goal of producing a public output (e.g. a paper submission). In one of our earlier cohorts, over 80% of fellows produced papers.
We run multiple cohorts of Fellows each year and review applications on a rolling basis. This application is for cohorts starting in July 2026 and beyond.
What to Expect
- 4 months of full-time research
- Direct mentorship from Anthropic researchers
- Access to a shared workspace (in either Berkeley, California or London, UK)
- Connection to the broader AI safety and security research community
- Weekly stipend of 3,850 USD / 2,310 GBP / 4,300 CAD + benefits (these vary by country)
- Funding for compute (~15k USD/month) and other research expenses
Interview Process
The interview process will include an initial application & reference check, technical assessments & interviews, and a research discussion. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.
Compensation
The expected base stipend for this role is 3,850 USD / 2,310 GBP / 4,300 CAD per week, with an expectation of 40 hours per week for 4 months (with possible extension).
Fellows Workstreams
Due to the success of the Anthropic Fellows for AI Safety Research program, we are now expanding it across teams at Anthropic. We expect there to be significant overlap in the types of skills and responsibilities across the roles and will by default consider candidates for all the workstreams.
- AI Safety Fellows
- AI Security Fellows
- ML Systems & Performance Fellows
- Reinforcement Learning Fellows
- Economics & Societal Impacts Fellows
Across the Workstreams, you May Be a Good Fit If You:
- Are motivated by making sure AI is safe and beneficial for society as a whole
- Are excited to transition into empirical AI research and would be interested in a full-time role at Anthropic
- Have a strong technical background in computer science, mathematics, or physics
- Thrive in fast-paced, collaborative environments
- Can implement ideas quickly and communicate clearly
Strong Candidates May Also Have:
- Strong background in a discipline relevant to a specific Fellows workstream (e.g. economics, social sciences, or cybersecurity)
- Experience in areas of research or engineering related to their workstream
Candidates Must Be:
- Fluent in Python programming
- Available to work full-time on the Fellows program
Reinforcement Learning Fellows Mentors, Research Areas, & Past Projects
Fellows will undergo a project selection & mentor matching process. Potential research areas and mentors include:
- Ruhua Jiang
- Kaidi Cao
- Sunny Duan
- David Brandfonbrener
- Colt Steele
- Dino Distefano
- Will Williams
Projects in this workstream may include:
- Building model-based tools to better understand AI training data and improve training data quality
- Research project to better understand generalization
- Creating RL environments to improve Claude models at capabilities that are within your domain of expertise
- Building RL environments for safety-related tasks
- Conducting research and implementing solutions in areas such as RL algorithms
Unique Candidate Criteria
You might be a particularly great fit for this workstream if you:
- Have strong software engineering skills with experience building complex ML systems
- Can balance research exploration with engineering rigor and operational reliability
- Enjoy collaborating across research and engineering disciplines
- Are comfortable working with large-scale distributed systems and high-performance computing
- Have experience with training, fine-tuning, or evaluating large language models
- Are adept at analyzing and debugging model training processes
Logistics
Logistics Requirements: To participate in the Fellows program, you must have work authorization in the US, UK, or Canada and be located in that country during the program.
Workspace Locations: We have designated shared workspaces in London and Berkeley where fellows will work from and mentors will visit. We are also open to remote fellows in the UK, US, or Canada. We will ask you about your availability to work from Berkeley or London (full- or part-time) during the program.
Visa Sponsorship: We are not currently able to sponsor visas for fellows. To participate in the Fellows program, you need to have or independently obtain full-time work authorization in the UK, the US, or Canada.
Program Duration: The program runs for 4 months, full-time. If you can't commit to the full duration, please still apply and note your constraints in the application. We review these requests on a case-by-case basis.
Please note: We do not guarantee that we will make any full-time offers to fellows. However, strong performance during the program may indicate that a Fellow would be a good fit for full-time roles at Anthropic. In previous cohorts, 25‑50% of fellows received a full-time offer, and we’ve supported many more to go on to do great work on AI safety and security at other organizations.
Anthropic Fellows Program — Reinforcement Learning in London employer: Nerdleveltech
Contact Detail:
Nerdleveltech Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Anthropic Fellows Program — Reinforcement Learning in London
✨Tip Number 1
Network like a pro! Reach out to current or past fellows and mentors from the Anthropic program. A friendly chat can give you insider info and maybe even a referral!
✨Tip Number 2
Prepare for those interviews! Brush up on your technical skills and be ready to discuss your projects in detail. Show us how your experience aligns with Anthropic's mission of safe and beneficial AI.
✨Tip Number 3
Don’t just apply; engage! Follow Anthropic on social media, join discussions, and attend relevant events. This shows your genuine interest and helps you stand out from the crowd.
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, it gives you a chance to tailor your submission to what we’re really looking for in fellows.
We think you need these skills to ace Anthropic Fellows Program — Reinforcement Learning in London
Some tips for your application 🫡
Be Yourself: When you're writing your application, let your personality shine through! We want to get to know the real you, so don’t be afraid to share your unique experiences and perspectives.
Tailor Your Application: Make sure to customise your application for the Anthropic Fellows Program. Highlight your relevant skills and experiences that align with our mission of creating safe and beneficial AI systems.
Show Your Passion: Express your enthusiasm for AI research and safety in your application. We love candidates who are genuinely excited about making a positive impact in the field!
Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way to ensure we receive all your details and can consider you for the program.
How to prepare for a job interview at Nerdleveltech
✨Know Your Stuff
Make sure you brush up on reinforcement learning concepts and any relevant projects you've worked on. Be ready to discuss your technical background in detail, especially your experience with Python and ML systems.
✨Show Your Passion for AI Safety
Anthropic is all about creating safe and beneficial AI. During the interview, express your motivation for working in this field and how your values align with their mission. Share any personal projects or research that demonstrate your commitment.
✨Prepare for Technical Assessments
Expect technical assessments as part of the interview process. Practice coding problems related to reinforcement learning and be prepared to solve them on the spot. Familiarise yourself with common algorithms and frameworks used in the industry.
✨Ask Thoughtful Questions
Interviews are a two-way street! Prepare insightful questions about the fellowship, the team, and ongoing projects at Anthropic. This shows your genuine interest and helps you determine if it's the right fit for you.