Machine Learning Evaluation Engineer in London
Machine Learning Evaluation Engineer

Machine Learning Evaluation Engineer in London

London Full-Time 60000 - 80000 £ / year (est.) No home office possible
M

At a Glance

  • Tasks: Evaluate and improve AI systems to enhance writing experiences for users.
  • Company: Join Marker, an innovative AI-native word processor transforming the future of writing.
  • Benefits: Competitive salary, equity options, and a supportive work environment.
  • Why this job: Be at the forefront of AI technology and creativity, shaping the future of writing.
  • Qualifications: Experience in AI evaluation, Python programming, and a passion for creative writing.
  • Other info: Collaborate directly with leadership in a dynamic, growth-oriented team.

The predicted salary is between 60000 - 80000 £ per year.

AI Evaluation, Research Methods, Python, LLM Observability

Salary range: £60,000-£80,000 p.a. + equity, depending on experience (up to £100,000 for candidates with exceptional relevant experience)

What is Marker? Marker is an AI-native Word Processor – a reimagining of Google Docs and Microsoft Word. Join us in building the next generation of agentic AI assistants supporting serious writers in their work. We are a small, ambitious company using cutting-edge technology to give everybody writing superpowers.

What you'll do at Marker:

  • Design and implement evaluation frameworks for complex, subjective AI outputs (like writing feedback that's meant to inspire rather than just correct).
  • Build flexible evaluation pipelines that can assess quality across multiple dimensions - from human preference to actual writing improvement.
  • Research and prototype new evaluation methodologies for creative and subjective AI tasks.
  • Collaborate with our engineering team to integrate evaluation insights into our development process.
  • Help define what "quality" means for different AI outputs and create metrics that actually matter for our users.
  • Work on challenging problems like: "How do we automatically evaluate whether an AI comment successfully encourages thoughtful revision?"

What we can offer:

  • A calm, human-friendly work environment among kind and experienced professionals.
  • Fun, creative, novel, and interesting technical work at the intersection of AI research and product development.
  • An opportunity to work with and learn about the latest advancements in AI evaluation and language models.
  • Direct collaboration with leadership to shape how we understand and improve our AI systems.
  • As much responsibility and growth opportunities as you want to take on.

Are you a good fit for this role? In order to be successful in this role, you will recognise yourself in the following:

  • You have experience with AI/ML evaluation methodologies and can speak the language of AI research.
  • You've worked hands-on with language models and understand the challenges of evaluating subjective, creative outputs.
  • You are a self-starter willing to work independently and at speed - we imagine a 2-week experiment cadence at most.
  • You are familiar with and have worked on related technical systems (evaluation pipelines, data collection tools) but don't need to be a full-stack engineer.
  • You think critically about what metrics actually matter and aren't satisfied with vanity metrics.
  • You're comfortable working with ambiguous problems where the "right answer" isn't obvious.
  • You have some programming experience (Python preferred) and can work independently on technical projects.
  • You're interested in the intersection of AI capabilities and human creativity.

An exceptional candidate for this role would be able to demonstrate some of the following:

  • Experience building evaluation systems for generative AI in production environments.
  • Knowledge of TypeScript and ability to integrate with our existing systems.
  • Background in human-computer interaction, computational creativity, or writing research.
  • Experience with A/B testing, statistical analysis, and experimental design.
  • Familiarity with modern AI observability and monitoring tools.
  • Published research or deep interest in AI evaluation methodologies.
  • Interest in writing (fiction, non-fiction, essays).

However, you are NOT expected to:

  • Be a senior software engineer - we're looking for someone who can build evaluation systems, not architect our entire backend.
  • Have solved every evaluation problem before - this is cutting-edge work and we're figuring it out together.
  • Be experienced with every library in our stack from day one - you'll work closely with Ryan and our engineering team.
  • Have a specific degree - we value practical experience and research ability over credentials.

Our stack:

  • Our AI engine uses a range of models, including self-hosted and fine-tuned open source models, as well as latest reasoning models from Anthropic and OpenAI.
  • Evaluation and research tools built primarily in Python, with integration into our TypeScript infrastructure.
  • Our agentic AI execution platform is written in TypeScript, hosted on Cloudflare Workers.
  • Standard ML tooling: various evaluation frameworks, data analysis tools, and monitoring systems.
  • Our text editor frontend is a web application built with React, TypeScript and ProseMirror.

Apply now! Interested? Email us at work@writewithmarker.com with your CV (or a link to your CV site). Tell us a little bit about yourself and why you'd like to work at Marker!

Please note that this role is currently only available based in our London hub, and at this time we are not able to sponsor work visas in the UK.

Machine Learning Evaluation Engineer in London employer: Marker

At Marker, we pride ourselves on fostering a calm and human-friendly work environment that encourages creativity and innovation. As a Machine Learning Evaluation Engineer, you'll have the unique opportunity to collaborate directly with leadership, engage in cutting-edge AI research, and take on as much responsibility as you desire, all while being part of a small, ambitious team dedicated to empowering writers. With competitive salaries, equity options, and a focus on employee growth, Marker is an excellent employer for those looking to make a meaningful impact in the world of AI.
M

Contact Detail:

Marker Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Machine Learning Evaluation Engineer in London

✨Tip Number 1

Get to know the company inside out! Research Marker, their products, and their mission. This way, when you chat with them, you can show off your enthusiasm and how you fit into their vision.

✨Tip Number 2

Network like a pro! Connect with current employees on LinkedIn or attend industry events. A friendly face can make all the difference when it comes to landing that interview.

✨Tip Number 3

Prepare for the interview by practising common questions related to AI evaluation and research methods. We want you to feel confident and ready to showcase your skills in Python and LLM observability.

✨Tip Number 4

Don’t forget to follow up after your interview! A quick thank-you email can leave a lasting impression and remind them why you’re the perfect fit for the role.

We think you need these skills to ace Machine Learning Evaluation Engineer in London

AI Evaluation Methodologies
Research Methods
Python Programming
Language Models
Evaluation Frameworks
Evaluation Pipelines
Data Collection Tools
Critical Thinking
Metrics Development
A/B Testing
Statistical Analysis
Experimental Design
Human-Computer Interaction
Computational Creativity
AI Observability

Some tips for your application 🫡

Show Your Passion for AI and Writing: When you write to us, let your enthusiasm for AI and writing shine through! Share what excites you about the future of writing and how you see AI playing a role in it. This is your chance to connect with us on a personal level.

Tailor Your CV to Highlight Relevant Experience: Make sure your CV reflects your experience with AI evaluation methodologies and any hands-on work with language models. We want to see how your background aligns with the role, so don’t hold back on showcasing your skills!

Be Clear and Concise: Keep your application straightforward and to the point. We appreciate clarity, so avoid jargon unless it’s relevant. A well-structured email will make it easier for us to see why you’re a great fit for the team.

Apply Through Our Website: While emailing us is great, we encourage you to apply through our website as well. It helps us keep track of applications better and ensures you don’t miss out on any important updates from us!

How to prepare for a job interview at Marker

✨Know Your AI Evaluation Inside Out

Make sure you brush up on the latest AI evaluation methodologies. Be ready to discuss your hands-on experience with language models and how you've tackled the challenges of evaluating subjective outputs. This will show that you’re not just familiar with the theory but have practical insights to share.

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've worked on ambiguous problems. Think about how you approached these challenges and what metrics you deemed important. This will demonstrate your critical thinking and ability to navigate complex issues, which is key for this role.

✨Familiarise Yourself with Their Tech Stack

Get to know the technologies mentioned in the job description, especially Python and TypeScript. If you can, try to work on a small project or two using these languages. Being able to speak their language will help you connect with the team and show your genuine interest in the role.

✨Express Your Passion for Writing and AI

Since Marker is all about enhancing writing through AI, be prepared to share your thoughts on the intersection of AI capabilities and human creativity. Whether it’s your own writing experiences or your views on AI's role in creative processes, showing your enthusiasm will resonate well with the interviewers.

Machine Learning Evaluation Engineer in London
Marker
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

M
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>