At a Glance
- Tasks: Create innovative evaluation benchmarks and tools for AI model performance.
- Company: Join a cutting-edge AI research team at Cohere.
- Benefits: Enjoy 6 weeks vacation, health benefits, and a flexible remote work environment.
- Why this job: Make a real impact in the future of AI evaluation methods.
- Qualifications: Strong software engineering skills and experience with LLMs.
- Other info: Diverse and inclusive culture with excellent career growth opportunities.
The predicted salary is between 36000 - 60000 Β£ per year.
Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.
Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.
Why this role?
Evaluation is critical to making progress in scaling intelligence. As models continue to become superhuman in many real-world use cases, we must continue to develop new evaluation techniques that accurately reflect what models are already capable of, as well as set the agenda for what future models should be capable of. In this role, you are responsible for creating these next-generation evaluation methods and infrastructure to measure LLM progress.
As a Senior Research Scientist, Model Evaluation, You Will:
- Create ambitious new evaluation benchmarks that push the limits of what our models can accomplish.
- Work on highly cross-functional teams to translate model feedback into trustworthy, repeatable evaluations.
- Conduct research to advance the state-of-the-art in LLM evaluation methods, including training LLM judges; refining LLM-based data synthesis pipelines; and improving evaluation efficiency.
- Build scalable and reusable tools for digging into model performance.
You May Be a Good Fit If:
- You enjoy rapidly building prototypes that demonstrate the boundaries of what LLMs are capable of, and you have developed resources to measure those capabilities.
- You have spent dozens of hours reviewing complex data and LLM outputs to ensure high data quality.
- You are obsessive about rigorously measuring AI capabilities, and also about making sure your measurements actually align with the capabilities you care about.
- You have strong software engineering skills.
If some of the above doesnβt line up perfectly with your experience, we still encourage you to apply!
Full-Time Employees At Cohere Enjoy These Perks:
- An open and inclusive culture and work environment
- Work closely with a team on the cutting edge of AI research
- Weekly lunch stipend, in-office lunches & snacks
- Full health and dental benefits, including a separate budget to take care of your mental health
- 100% Parental Leave top-up for up to 6 months
- Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
- Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
- 6 weeks of vacation (30 working days!)
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Senior Research Scientist, Model Evaluation in London employer: Cohere
Contact Detail:
Cohere Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land Senior Research Scientist, Model Evaluation in London
β¨Tip Number 1
Network like a pro! Reach out to folks in the AI and research community, especially those who work at Cohere. A friendly chat can open doors and give you insights that a job description just can't.
β¨Tip Number 2
Show off your skills! If you've got prototypes or projects that demonstrate your evaluation methods or model capabilities, share them. A portfolio can speak volumes about your expertise and passion.
β¨Tip Number 3
Prepare for the interview by diving deep into Cohere's work. Understand their models and evaluation techniques. This will not only impress them but also help you ask insightful questions during your chat.
β¨Tip Number 4
Don't forget to apply through our website! Itβs the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who take that extra step.
We think you need these skills to ace Senior Research Scientist, Model Evaluation in London
Some tips for your application π«‘
Show Your Passion for AI: When you're writing your application, let your enthusiasm for AI and model evaluation shine through. We want to see how excited you are about pushing the boundaries of what's possible with LLMs!
Be Specific About Your Experience: Don't just list your previous roles; dive into the details! Share specific projects where you've developed evaluation methods or worked with LLMs. This helps us understand your hands-on experience and how it aligns with our needs.
Tailor Your Application: Make sure to customise your application for this role. Highlight relevant skills and experiences that match the job description. We love seeing candidates who take the time to connect their background with what weβre looking for!
Apply Through Our Website: We encourage you to apply directly through our website. Itβs the best way for us to receive your application and ensures you donβt miss out on any important updates during the process!
How to prepare for a job interview at Cohere
β¨Know Your Models Inside Out
Make sure youβre well-versed in the latest advancements in LLMs and evaluation techniques. Brush up on your knowledge of model capabilities and be ready to discuss how youβve applied these in your previous work.
β¨Showcase Your Prototyping Skills
Prepare to talk about any prototypes you've built that demonstrate the limits of LLMs. Bring examples of your work that highlight your ability to create ambitious evaluation benchmarks and tools.
β¨Emphasise Cross-Functional Collaboration
Since this role involves working with diverse teams, be ready to share experiences where you successfully collaborated across different functions. Highlight how you translated feedback into actionable evaluations.
β¨Demonstrate Your Rigor
Be prepared to discuss your approach to ensuring high data quality and rigorous measurement of AI capabilities. Share specific examples of how youβve maintained standards in your evaluations and what methods you used.