Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

Job Board

Companies

Deepstreamtech

Research Engineer (Machine Learning, Reinforcement Learning Velocity)

Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

London Full-Time 60000 - 80000 € / year (est.) No home office possible

Apply Now

At a Glance

Tasks: Build and enhance the RL training infrastructure for faster research iterations.
Company: Join a leading AI company focused on innovative machine learning solutions.
Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
Other info: Encouraging diverse applicants; apply even if you don't meet every qualification.
Why this job: Make a real impact by improving research efficiency and reliability in machine learning.
Qualifications: Strong software engineering skills and experience with ML infrastructure preferred.

The predicted salary is between 60000 - 80000 € per year.

Requirements

Have strong software engineering fundamentals and a track record of building performant, reliable systems
Have worked on ML infrastructure, distributed systems, or research tooling
Care about enabling other people's work and find leverage through platforms rather than individual experiments
Are comfortable operating across the stack, from low-level performance work to RL algorithms
Have a bias toward shipping and iterating quickly, with a mix of high agency and low ego
(Desirable) Experience with large-scale distributed training (RL, pre-training, or post-training)
(Desirable) Familiarity with JAX, PyTorch, or similar ML frameworks
(Desirable) A track record of operating at the edge of research and infra in a fast-moving environment
Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience
Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience
Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.

What the job involves

The RL Velocity team owns the efficiency and reliability of our RL Science stack - the infrastructure, tooling, and systems that let researchers iterate quickly on training runs.
As a Research Engineer on the team, you'll build and improve the core platform that underpins how we do RL at Anthropic, removing bottlenecks that slow down research and making it easier for the broader org to ship better models faster.
This is high-leverage work: small improvements to velocity compound across every researcher and every run.
Build and improve the RL training infrastructure that researchers depend on day-to-day.
Identify and remove bottlenecks across the RL stack: debugging, profiling, and rearchitecting where needed.
Partner closely with researchers and with adjacent engineering teams (inference, sandboxing, and many more) to understand pain points and ship tooling that makes them faster.
Own the reliability and performance of research runs end-to-end.
Contribute to design decisions that shape how Anthropic does RL at scale.

Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London employer: Deepstreamtech

At Anthropic, we pride ourselves on being an exceptional employer, particularly for those in the Research Engineer role focused on Machine Learning and Reinforcement Learning. Our collaborative work culture fosters innovation and encourages personal growth, allowing you to make impactful contributions while working alongside talented researchers and engineers. With a commitment to employee development and a supportive environment that values diverse perspectives, you'll find unique opportunities to advance your career in a fast-paced, cutting-edge field.

Contact Detail:

Deepstreamtech Recruiting Team

View Deepstreamtech Profile

StudySmarter Expert Advice🤫

We think this is how you could land Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with researchers and engineers on LinkedIn. Building relationships can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to ML and RL. This gives potential employers a taste of what you can do beyond the written application.

✨Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice coding challenges and be ready to discuss your past projects in detail. We want to see how you think and approach problems!

✨Tip Number 4

Don’t hesitate to apply through our website! Even if you don’t tick every box, we value diverse experiences and perspectives. If you're passionate about the role, go for it – you might just surprise yourself!

We think you need these skills to ace Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

Software Engineering Fundamentals

Machine Learning Infrastructure

Distributed Systems

Research Tooling

Reinforcement Learning Algorithms

Performance Optimisation

JAX

PyTorch

Debugging Skills

Profiling Skills

System Rearchitecting

Collaboration with Researchers

Tooling Development

Reliability Engineering

End-to-End Performance Management

Some tips for your application 🫡

Show Off Your Skills:Make sure to highlight your software engineering fundamentals and any experience you've got with ML infrastructure or distributed systems. We want to see how you've built reliable systems in the past, so don’t hold back!

Be a Team Player:We care about enabling others, so share examples of how you've collaborated with teams or improved processes. If you've worked on platforms that help others succeed, let us know – it’s a big plus!

Keep It Relevant:Tailor your application to the role by mentioning your experience with RL algorithms or frameworks like JAX or PyTorch. Even if you don’t tick every box, we’re keen to see your passion for the field!

Apply Through Our Website:Don’t forget to submit your application through our website! It’s the best way for us to get your details and ensure you’re considered for this exciting opportunity. We can’t wait to hear from you!

How to prepare for a job interview at Deepstreamtech

✨Know Your Tech Inside Out

Make sure you brush up on your software engineering fundamentals and be ready to discuss your experience with ML infrastructure and distributed systems. Be prepared to share specific examples of how you've built performant systems and the impact they had on your previous projects.

✨Showcase Your Collaborative Spirit

This role is all about enabling others, so highlight your experiences working closely with researchers and engineering teams. Talk about how you've identified pain points in past projects and the steps you took to address them, demonstrating your ability to partner effectively across different teams.

✨Demonstrate Your Quick Iteration Mindset

The company values a bias towards shipping and iterating quickly. Prepare to discuss instances where you’ve rapidly prototyped solutions or made improvements to existing systems. Emphasise your ability to balance high agency with low ego, showing that you're focused on the team's success rather than just your own.

✨Be Ready for Technical Challenges

Expect to face some technical questions or challenges during the interview. Brush up on your knowledge of RL algorithms and frameworks like JAX or PyTorch. Practising coding problems related to performance optimisation can also give you an edge, as it shows your readiness to tackle real-world issues.

Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

Deepstreamtech

Location: London

Apply Now

Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

At a Glance

Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London employer: Deepstreamtech

StudySmarter Expert Advice🤫

We think you need these skills to ace Research Engineer (Machine Learning, Reinforcement Learning Velocity) in London

Some tips for your application 🫡

How to prepare for a job interview at Deepstreamtech

Company

Product

Help