Research Engineer / Research Scientist, Finetuning

Job Board

Companies

Anthropic

Research Engineer / Research Scientist, Finetuning

Full-Time Home office (partial)

Apply Now

At a Glance

Tasks: Join us in shaping AI behaviour through innovative machine learning experiments.
Company: Cutting-edge AI research company focused on ethical and impactful technology.
Benefits: Competitive salary, equity options, unlimited PTO, and comprehensive health benefits.
Other info: Collaborative environment with opportunities for personal research and career growth.
Why this job: Make a real difference in AI safety and ethics while working on groundbreaking projects.
Qualifications: Experience in Python, machine learning, and a passion for AI alignment.

You want to help construct and rapidly iterate on machine learning experiments to help us improve the behavior of powerful AI systems through finetuning. You care about making AI helpful, honest, and harmless, and are interested in shaping model behavior to be more aligned with human values and goals. You could describe yourself as both a scientist and an engineer. As a Research Scientist or Research Engineer on the Finetuning team, you'll contribute to research on improving language models through techniques like constitutional AI. You will have the opportunity to do creative, cutting-edge research on frontier models, and to see your work result in concrete improvements in performance and safety.

We generally expect research scientists to be able to iterate on their own experiments. We also provide opportunities for engineers to pursue their own research projects. Therefore this role can be more research oriented or more engineering oriented, depending on the experience and interests of the candidate.

Representative projects

Help develop novel finetuning techniques to improve language model behavior and make models more helpful, honest, and harmless.
Test out techniques like constitutional AI at scale and measure their impacts on model behavior.
Build tooling and infrastructure to enable efficient fine-tuning experiments on large language models.
Develop novel prompts and prompting strategies to improve and test model behaviours.
Run experiments that feed into key AI research and safety efforts at Anthropic.

You may be a good fit if you

Have significant Python, machine learning, research engineering, or research experience.
Prefer fast-moving collaborative projects with concrete goals that involve improving model behaviours.
Are results-oriented, with a bias towards flexibility and impact.
Pick up slack, even if it goes outside your job description.
Care about the impact of AI and of your work.

Strong candidates may also

Have prior experience with large language model finetuning techniques such as RLHF.
Have experience with complex shared codebases and RL infrastructure.
Have experience authoring research papers in machine learning, NLP, or AI alignment or similar industry experience.

Annual Salary (USD)

The expected salary range for this position is $280k - $600k USD.

Logistics

Location-based hybrid policy: Currently, we expect all staff to be in our office at least 25% of the time.
Deadline to apply: None. Applications will be reviewed on a rolling basis.
US visa sponsorship: We do sponsor visas! However, we aren’t able to successfully sponsor visas for every role and every candidate; operations roles are especially difficult to support. But if we make you an offer, we will make every effort to get you into the United States, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Compensation and Benefits

Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.

Equity - On top of this position's salary (listed above), equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.

US Benefits

Optional equity donation matching at a 3:1 ratio, up to 50% of your equity grant.
Comprehensive health, dental, and vision insurance for you and all your dependents.
401(k) plan with 4% matching.
22 weeks of paid parental leave.
Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
Stipends for education, home office improvements, commuting, and wellness.
Fertility benefits via Carrot.
Daily lunches and snacks in our office.
Relocation support for those moving to the Bay Area.

UK Benefits

Optional equity donation matching at a 3:1 ratio, up to 50% of your equity grant.
Private health, dental, and vision insurance for you and your dependents.
Pension contribution (matching 4% of your salary).
22 weeks of paid parental leave.
Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
Health cash plan.
Life insurance and income protection.
Daily lunches and snacks in our office.

This compensation and benefits information is based on Anthropic’s good faith estimate for this position as of the date of publication and may be modified in the future. Employees based outside of the UK or US will receive a different benefits package. The level of pay within the range will depend on a variety of job-related factors, including where you place on our internal performance ladders, which is based on factors including past work experience, relevant education, and performance on our interviews or in a work trial.

How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Research Engineer / Research Scientist, Finetuning employer: Anthropic

At Anthropic, we pride ourselves on being an exceptional employer, particularly for those passionate about advancing AI in a meaningful way. Our Bay Area location fosters a vibrant work culture that encourages collaboration and innovation, while our commitment to employee growth is reflected in our unlimited PTO, comprehensive benefits, and opportunities for creative research projects. Join us to be part of a team that values diverse perspectives and aims to make AI systems more helpful, honest, and harmless.

Contact Details:

Anthropic Recruitment Team

View Anthropic profile

StudySmarter Expert Advice🤫

We think this is how you could land Research Engineer / Research Scientist, Finetuning

✨Tip Number 1

Network like a pro! Reach out to folks in the AI and machine learning community, attend meetups, and connect on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to finetuning and AI alignment. This will give potential employers a taste of what you can do and how you think.

✨Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice explaining your past projects and how they relate to the role you're applying for. We want to see your thought process!

✨Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love candidates who take the initiative to reach out directly.

We think you need these skills to ace Research Engineer / Research Scientist, Finetuning

Python

Machine Learning

Research Engineering

Finetuning Techniques

Constitutional AI

Experimentation

NLP

AI Alignment

Collaboration

Results-Oriented

Flexibility

Impact Measurement

Research Paper Authoring

Large Language Models

Some tips for your application 🫡

Show Your Passion for AI:When writing your application, let your enthusiasm for AI and its impact shine through. We want to see how much you care about making AI helpful, honest, and harmless, so share your thoughts on these values!

Tailor Your Experience:Make sure to highlight your relevant experience in machine learning, Python, or research engineering. We love seeing how your background aligns with the role, so don’t hold back on showcasing your skills and projects!

Be Creative and Clear:Your application is a chance to show off your creativity! Use clear language and structure your thoughts well. We appreciate innovative ideas, especially when it comes to finetuning techniques and improving model behaviour.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, we’re excited to see what you bring to the table!

How to prepare for a job interview at Anthropic

✨Know Your Stuff

Make sure you brush up on your Python and machine learning concepts. Be ready to discuss your past projects, especially those involving finetuning techniques or large language models. This shows you’re not just a theorist but someone who can apply their knowledge practically.

✨Show Your Passion for AI Ethics

Since the role focuses on making AI helpful, honest, and harmless, be prepared to talk about your views on AI ethics. Share any experiences where you've considered the social implications of your work. This will demonstrate that you align with the company's values.

✨Prepare for Technical Questions

Expect to dive deep into technical discussions. Practice explaining complex concepts clearly and concisely. You might be asked to solve problems on the spot, so think through your approach to experimentation and iteration in machine learning.

✨Ask Insightful Questions

At the end of the interview, have some thoughtful questions ready. Inquire about the team’s current projects or the challenges they face in finetuning models. This shows your genuine interest in the role and helps you assess if it’s the right fit for you.

Research Engineer / Research Scientist, Finetuning

Anthropic

Apply Now

Research Engineer / Research Scientist, Finetuning

At a Glance

Research Engineer / Research Scientist, Finetuning employer: Anthropic

StudySmarter Expert Advice🤫

We think you need these skills to ace Research Engineer / Research Scientist, Finetuning

Some tips for your application 🫡

How to prepare for a job interview at Anthropic

Company

Product

Help