Member of Technical Staff - Evaluations
Member of Technical Staff - Evaluations

Member of Technical Staff - Evaluations

Full-Time 36000 - 60000 £ / year (est.) Home office (partial)
R

At a Glance

  • Tasks: Conduct analysis to enhance AI model capabilities and develop evaluation frameworks.
  • Company: Join a cutting-edge AI team with experts from top tech companies.
  • Benefits: Top-tier salary, comprehensive health benefits, and generous parental leave.
  • Why this job: Make a real impact in the AI frontier and shape the future of technology.
  • Qualifications: Strong skills in statistical analysis and familiarity with LLM evaluation methods.
  • Other info: Dynamic startup environment with opportunities for personal and professional growth.

The predicted salary is between 36000 - 60000 £ per year.

Reflection’s mission is to build open superintelligence and make it accessible to all. We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.

About The Role

  • Conduct critical comparative analysis to advance our understanding of model capabilities.
  • Build and refine evaluation systems and processes that create tight feedback loops between data, evals, and model behaviour.
  • Develop generalizable evaluation frameworks that capture what matters for reasoning, alignment, and usefulness.
  • Collaborate closely with pre-training, post-training, and applied teams to translate insights into model improvements.
  • Push the boundaries of what’s measurable, from synthetic evals to human feedback and real-world interaction data.

About You

  • Strong statistical analysis and experimental design skills to rigorously measure model improvements.
  • Familiarity with LLM evaluation methodologies: static benchmarks, human preference evals, and/or agentic tasks.
  • High agency and thrive in a fast-paced startup environment; bias for impact over process.
  • Excited to work in a new frontier lab, defining how we measure and accelerate progress toward more capable models.
  • Collaborative, detail-oriented, and motivated by building the feedback loops that make models truly improve.

What We Offer

  • Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
  • Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.
  • Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.
  • Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time.
  • Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Member of Technical Staff - Evaluations employer: Reflection AI

Reflection is an exceptional employer, offering a unique opportunity to work at the forefront of AI research in a collaborative and innovative environment. With top-tier compensation, comprehensive health benefits, and a strong emphasis on work-life balance, employees are empowered to make impactful contributions while enjoying a supportive culture that values personal and professional growth. Join us in shaping the future of open superintelligence alongside a talented team from leading tech companies.
R

Contact Detail:

Reflection AI Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Member of Technical Staff - Evaluations

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Reflection or similar companies. A friendly chat can open doors that applications alone can't.

✨Tip Number 2

Show off your skills in real-time! If you get the chance, participate in hackathons or workshops related to AI evaluations. It’s a great way to demonstrate your expertise and passion.

✨Tip Number 3

Prepare for interviews by diving deep into LLM evaluation methodologies. Brush up on your statistical analysis skills and be ready to discuss how you can contribute to building effective feedback loops.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, we love seeing candidates who are proactive about their job search.

We think you need these skills to ace Member of Technical Staff - Evaluations

Statistical Analysis
Experimental Design
LLM Evaluation Methodologies
Static Benchmarks
Human Preference Evaluations
Agentic Tasks
Collaboration
Attention to Detail
Feedback Loop Development
Model Improvement
Adaptability
High Agency
Impact Orientation

Some tips for your application 🫡

Show Your Passion: When writing your application, let your enthusiasm for AI and model evaluation shine through. We want to see that you’re genuinely excited about the work we do at Reflection and how you can contribute to our mission of building open superintelligence.

Tailor Your Experience: Make sure to highlight your relevant skills and experiences that align with the role. Whether it’s your statistical analysis prowess or familiarity with LLM evaluation methodologies, we want to see how your background fits into our team’s goals.

Be Clear and Concise: Keep your application straightforward and to the point. We appreciate clarity, so avoid jargon and focus on communicating your ideas effectively. This will help us understand your thought process and how you approach problem-solving.

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team!

How to prepare for a job interview at Reflection AI

✨Know Your Stats

Brush up on your statistical analysis and experimental design skills. Be ready to discuss how you've measured model improvements in the past, as this role requires a strong foundation in these areas.

✨Familiarise with LLM Evaluation

Make sure you understand various LLM evaluation methodologies, such as static benchmarks and human preference evaluations. Prepare examples of how you've applied these methods in previous projects to showcase your expertise.

✨Embrace the Startup Vibe

This is a fast-paced environment, so demonstrate your high agency and ability to thrive under pressure. Share experiences where you've made impactful decisions quickly, showing that you prioritise results over processes.

✨Collaboration is Key

Highlight your collaborative spirit by discussing past experiences where you've worked closely with different teams. Emphasise your detail-oriented approach and how you've built effective feedback loops to improve models.

Member of Technical Staff - Evaluations
Reflection AI

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

R
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>