AI Evaluation TPM — Cross-Functional Impact

AI Evaluation TPM — Cross-Functional Impact

Full-Time Home office (partial)
Anthropic

At a Glance

  • Tasks: Lead AI model evaluation initiatives and drive high-priority projects across teams.
  • Company: Join a pioneering tech company committed to responsible AI development.
  • Benefits: Competitive salary, equity options, unlimited PTO, and comprehensive health benefits.
  • Other info: Dynamic work environment with opportunities for growth and collaboration.
  • Why this job: Shape the future of AI while making a real impact on society.
  • Qualifications: Experience in technical program management and strong communication skills required.

About the role: We are seeking a Technical Program Manager to lead our AI model evaluation initiatives across multiple workstreams. This role will be crucial in assessing the performance, capabilities, limitations, and potential risks of our AI models. Working closely with our Research, Trust & Safety, Frontier Redteaming, and Policy teams, you will drive high-priority evaluation projects to build new processes, align metrics with policy, and track measurable progress. You will help build and adapt the model evaluation program to ensure model deployments are rigorous and aligned with our commitment to responsible AI development.

The ideal candidate will have a strong technical background and experience managing cross-functional programs in AI development, ML engineering, or related fields. You’ll be joining a team of Technical Program Managers who own and drive cross-functional programs that align to the company’s top priorities. In this role, you’ll have the opportunity to make a foundational impact as you contribute to the scaling of a centralized TPM function for the company.

Extremely strong soft skills are paramount, as our team is front and center in driving lots of company-wide changes and top priority initiatives that require generating buy-in, balancing various opinions, and competing for attention in our rapidly scaling environment. This role is a great fit for someone who has both seen excellence at scale and operated in rapidly scaling, high-ambiguity teams and scope. We are seeking candidates with deep TPM expertise but who are comfortable acting as adaptable generalists who add value fast.

We excel at maintaining a broad view of our work but diving deep into the details when necessary. We understand business goals, translate and organize them into technical programs and projects, and drive execution. We are adept at engaging with both non-technical and technical stakeholders at all levels of the company, including executive leadership.

In this role, you will have the opportunity to shape the development of advanced AI systems and contribute to Anthropic's mission of ensuring that AI benefits all of humanity. If you are passionate about responsible AI development, have a strong technical background, and thrive in a fast-paced, collaborative environment, we'd love to hear from you.

Responsibilities:

  • Partner with teams like Frontier Risk Evaluations, Security, and Trust & Safety to develop and implement comprehensive evaluation protocols for our latest frontier AI models.
  • Build a single source of truth for tracking all types of model evaluations as required by our Responsible Scaling Policy, AI safety institutes, the White House, and others.
  • Develop and maintain procedures for conducting evaluations, including designing test suites, coordinating red team exercises, and analyzing results.
  • Create and manage dashboards and reporting systems to track model performance, safety metrics, and evaluation outcomes across different AI systems and versions.
  • Lead cross-functional workshops to identify potential risks and edge cases for evaluation, ensuring thorough coverage of AI capabilities and limitations.
  • Coordinate with external partners and industry standards bodies to align our evaluation practices with emerging best practices in responsible AI development.
  • Provide detailed status reports, identifying technical risks, dependencies, and areas requiring additional support.
  • Facilitate communication and coordination between technical workstreams and stakeholders.
  • Continuously identify opportunities for technical process improvements and implement changes as needed.
  • Stay up-to-date with the latest developments in AI safety, ML engineering, and related fields to ensure the program remains at the forefront of responsible AI development.

You might be a good fit if you:

  • Have several years of experience in technical program management, with a track record of successfully delivering complex technical programs, preferably in AI development, ML engineering, or related fields.
  • Have experience executing technical programs that require systems and engineering-level knowledge.
  • Have exceptionally strong interpersonal and communication skills that enable you to influence without authority, build cross-organizational support, cooperation and action around initiatives and process adoption.
  • Have experience prompt engineering on language models.
  • Have experience designing and/or running evaluations on Large Language Models.
  • Have knowledge of emerging AI governance frameworks and best practices.
  • Have a high threshold for navigating ambiguity and are able to balance setting strategic priorities with rapid, high-quality execution.
  • Thrive in unstructured environments, and have a knack for bringing order to chaos.

The expected salary range for this position is: Annual Salary: $300,000—$320,000 USD.

Logistics

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

US visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate; operations roles are especially difficult to support. But if we make you an offer, we will make every effort to get you into the United States, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Compensation and Benefits

Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.

Equity - For eligible roles, equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.

US Benefits - The following benefits are for our US-based employees: Optional equity donation matching. Comprehensive health, dental, and vision insurance for you and all your dependents. 401(k) plan with 4% matching. 22 weeks of paid parental leave. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more! Stipends for education, home office improvements, commuting, and wellness. Fertility benefits via Carrot. Daily lunches and snacks in our office. Relocation support for those moving to the Bay Area.

UK Benefits - The following benefits are for our UK-based employees: Optional equity donation matching. Private health, dental, and vision insurance for you and your dependents. Pension contribution (matching 4% of your salary). 21 weeks of paid parental leave. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more! Health cash plan. Life insurance and income protection. Daily lunches and snacks in our office.

AI Evaluation TPM — Cross-Functional Impact employer: Anthropic

At Anthropic, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters collaboration and innovation in the rapidly evolving field of AI. Our commitment to employee growth is evident through comprehensive benefits, including unlimited PTO, generous parental leave, and equity opportunities, all designed to support a healthy work-life balance. Join us in our Bay Area office, where you'll have the chance to make a meaningful impact on responsible AI development while working alongside a diverse team of experts dedicated to shaping the future of technology.

Anthropic

Contact Details:

Anthropic Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation TPM — Cross-Functional Impact

Tip Number 1

Network like a pro! Reach out to folks in your industry, especially those already working at Anthropic. A friendly chat can open doors and give you insider info on the role.

Tip Number 2

Prepare for interviews by diving deep into AI evaluation topics. Brush up on your technical knowledge and be ready to discuss how you can contribute to responsible AI development.

Tip Number 3

Show off your soft skills! During interviews, highlight your ability to communicate and collaborate across teams. We love candidates who can balance opinions and drive initiatives.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we’re always looking for passionate individuals who want to make an impact.

We think you need these skills to ace AI Evaluation TPM — Cross-Functional Impact

Technical Program Management
AI Model Evaluation
Cross-Functional Collaboration
Interpersonal Skills
Communication Skills
Prompt Engineering
Evaluation Design for Large Language Models

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the AI Evaluation TPM role. Highlight your experience in technical program management and any relevant projects you've led. We want to see how your skills align with our mission of responsible AI development!

Showcase Your Soft Skills:Since this role requires strong interpersonal skills, don’t shy away from sharing examples of how you've influenced teams or navigated complex situations. We love seeing candidates who can balance technical expertise with excellent communication!

Be Clear and Concise:When writing your application, keep it straightforward and to the point. Use bullet points where possible to make your achievements stand out. We appreciate clarity and want to quickly understand how you can contribute to our team.

Apply Through Our Website:We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re keen on joining our awesome team!

How to prepare for a job interview at Anthropic

Know Your AI Stuff

Make sure you brush up on the latest trends and developments in AI and ML. Understand the technical aspects of model evaluation, as well as the ethical implications. This will help you engage confidently with both technical and non-technical stakeholders during the interview.

Showcase Your Soft Skills

Since this role requires strong interpersonal skills, be ready to demonstrate how you've influenced teams and built cross-organisational support in the past. Prepare examples that highlight your ability to navigate ambiguity and drive initiatives forward, even when faced with competing priorities.

Prepare for Cross-Functional Scenarios

Think about how you would approach working with different teams like Research, Trust & Safety, and Policy. Be ready to discuss how you would develop comprehensive evaluation protocols and manage workshops to identify risks. This shows you can think strategically and work collaboratively.

Ask Insightful Questions

Prepare thoughtful questions that reflect your understanding of the company's mission and the role's responsibilities. Inquire about their current challenges in AI evaluation or how they measure success in their projects. This not only shows your interest but also your proactive mindset.