Production AI Finetuning Research Engineer

Job Board

Companies

Anthropic

Production AI Finetuning Research Engineer

Full-Time Home office (partial)

Apply Now

At a Glance

Tasks: Train cutting-edge AI models and design innovative finetuning techniques.
Company: Join a pioneering AI company focused on impactful research.
Benefits: Competitive salary, equity options, unlimited PTO, and comprehensive health benefits.
Other info: Diverse team culture with opportunities for growth and learning.
Why this job: Make a real impact in AI while collaborating with top researchers.
Qualifications: Strong Python skills and machine learning experience preferred.

As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our API. In this role, you will design and iterate on state-of-the-art finetuning techniques, such as Constitutional AI and RLHF, to train our production Claude models. You will implement new algorithms, run experiments on data mixes, design evaluations, and improve our production model training pipeline. This role offers the opportunity to contribute to cutting-edge research while also having a direct and measurable impact on the company’s success.

Responsibilities:

Implement and optimize finetuning pipelines to efficiently train production-scale language models with techniques like Constitutional AI.
Develop novel prompts and prompting strategies to improve and test model behaviours.
Collaborate with other research teams to translate novel finetuning techniques into our production model training process, ensuring models are helpful, honest, and harmless.
Design and run a new evaluation that tests Claude’s reasoning capabilities.
Collaborate with a research team to develop a robust evaluation for a new model capability they are developing.
Stay current with state-of-the-art research in AI and machine learning, and propose ways to apply these advancements to production systems.

You may be a good fit if you:

Have significant Python programming experience and machine learning experience.
Are results-oriented, with a bias towards flexibility and impact.
Pick up slack, even if it goes outside your job description.
Enjoy pair programming (we love to pair!).
Want to learn more about machine learning research.
Care about the societal impacts of your work.
Have clear written and verbal communication.

Strong candidates may also have experience with:

Fine-tuning large language models with supervised learning or reinforcement learning.
Developing evaluations for language models.
Complex shared codebases and RL infrastructure.
Authoring research papers in machine learning, NLP, or AI alignment or similar industry experience.

Logistics:

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
US visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate; operations roles are especially difficult to support. But if we make you an offer, we will make every effort to get you into the United States, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Compensation and Benefits:

Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.

Equity - For eligible roles, equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.

US Benefits:

Optional equity donation matching.
Comprehensive health, dental, and vision insurance for you and all your dependents.
401(k) plan with 4% matching.
22 weeks of paid parental leave.
Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
Stipends for education, home office improvements, commuting, and wellness.
Fertility benefits via Carrot.
Daily lunches and snacks in our office.
Relocation support for those moving to the Bay Area.

UK Benefits:

Optional equity donation matching.
Private health, dental, and vision insurance for you and your dependents.
Pension contribution (matching 4% of your salary).
21 weeks of paid parental leave.
Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
Health cash plan.
Life insurance and income protection.
Daily lunches and snacks in our office.

Production AI Finetuning Research Engineer employer: Anthropic

Anthropic is an exceptional employer for those passionate about AI and machine learning, offering a collaborative work culture that values innovation and diversity. With competitive compensation packages, including equity and comprehensive benefits, employees enjoy unlimited PTO, generous parental leave, and opportunities for professional growth in a cutting-edge research environment located in the vibrant Bay Area. Join us to make a meaningful impact on the future of AI while working alongside talented individuals who share your commitment to ethical technology.

Contact Details:

Anthropic Recruitment Team

View Anthropic profile

StudySmarter Expert Advice🤫

We think this is how you could land Production AI Finetuning Research Engineer

✨Tip Number 1

Network like a pro! Reach out to folks in the AI and machine learning community, attend meetups, and engage on platforms like LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to finetuning and model training. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✨Tip Number 3

Prepare for interviews by brushing up on the latest trends in AI and machine learning. Be ready to discuss how you can apply cutting-edge techniques like Constitutional AI and RLHF in real-world scenarios. We love candidates who are passionate and informed!

✨Tip Number 4

Don’t hesitate to apply through our website! Even if you don’t tick every box, we value diverse perspectives and experiences. Your unique background could be just what we need to enhance our team!

We think you need these skills to ace Production AI Finetuning Research Engineer

Python Programming

Machine Learning

Finetuning Techniques

Constitutional AI

Reinforcement Learning (RLHF)

Algorithm Implementation

Data Experimentation

Model Evaluation Design

Collaboration Skills

NLP (Natural Language Processing)

Research Paper Authoring

Communication Skills

Adaptability

Results-Oriented Mindset

Some tips for your application 🫡

Show Off Your Skills:Make sure to highlight your Python programming and machine learning experience in your application. We want to see how your skills align with the role, so don’t hold back on showcasing your past projects or any relevant work!

Tailor Your Application:Take a moment to customise your application for this specific role. Mention how your experience with finetuning techniques or model evaluations can contribute to our team. We love seeing candidates who take the time to connect their background with what we do!

Be Yourself:Don’t worry if you don’t meet every single qualification listed. We encourage you to apply even if you think you might not be the perfect fit. Just be genuine about your experiences and interests, and let us see the real you!

Apply Through Our Website:We recommend applying directly through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we love seeing applications come in through our own platform!

How to prepare for a job interview at Anthropic

✨Know Your Finetuning Techniques

Make sure you brush up on the latest finetuning techniques like Constitutional AI and RLHF. Be ready to discuss how you've implemented these in past projects or how you would approach them in this role. Showing that you’re up-to-date with state-of-the-art research will impress the interviewers.

✨Showcase Your Python Skills

Since significant Python programming experience is a must, prepare to demonstrate your coding skills. You might be asked to solve a problem on the spot, so practice coding challenges related to machine learning. Highlight any projects where you've optimised finetuning pipelines or developed novel prompts.

✨Collaborate and Communicate

This role involves collaboration with other research teams, so be prepared to discuss your teamwork experiences. Share examples of how you’ve worked with others to translate complex ideas into practical applications. Good communication skills are key, so practice articulating your thoughts clearly.

✨Be Ready for Evaluation Design

You’ll need to design and run evaluations for model capabilities, so think about how you would approach this task. Prepare to discuss any previous experience you have with developing evaluations for language models. If you can, bring examples of your work to showcase your thought process.

Production AI Finetuning Research Engineer

Anthropic

Apply Now

Production AI Finetuning Research Engineer

At a Glance

Production AI Finetuning Research Engineer employer: Anthropic

StudySmarter Expert Advice🤫

We think you need these skills to ace Production AI Finetuning Research Engineer

Some tips for your application 🫡

How to prepare for a job interview at Anthropic

Company

Product

Help