AI Evaluation Engineer (Remote) in York

AI Evaluation Engineer (Remote) in York

York Full-Time 50000 - 60000 € / year (est.) Home office possible
Outlier AI

At a Glance

  • Tasks: Shape the future of AI by providing human feedback on innovative projects.
  • Company: Join a cutting-edge company focused on improving AI agents.
  • Benefits: Enjoy remote work, competitive salary, and opportunities for professional growth.
  • Other info: Collaborate with top AI organisations in a dynamic and innovative environment.
  • Why this job: Make a real impact on the development of advanced generative systems.
  • Qualifications: 2+ years in backend engineering or AI automation with strong coding skills.

The predicted salary is between 50000 - 60000 € per year.

About the Project

Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows. Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery using MEMORY.md to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

AI Evaluation Engineer (Remote) in York employer: Outlier AI

At Outlier, we pride ourselves on being an exceptional employer that fosters a collaborative and innovative work culture. Our remote environment allows for flexibility while providing ample opportunities for professional growth in the rapidly evolving field of AI. Join us to not only contribute to groundbreaking projects but also to be part of a team that values your expertise and encourages continuous learning.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) in York

Tip Number 1

Network like a pro! Reach out to folks in the AI and software development communities. Attend meetups, webinars, or even online forums. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving backend engineering or AI automation. This is your chance to demonstrate your expertise in building production-grade software and handling complex systems.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge. Be ready to discuss your experience with languages like Python or JavaScript, and how you've tackled real-world problems using SQL databases. Practice explaining your thought process clearly!

Tip Number 4

Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining us at StudySmarter. Tailor your application to highlight your relevant experience and passion for shaping the future of AI agents.

We think you need these skills to ace AI Evaluation Engineer (Remote) in York

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Show Off Your Skills:Make sure to highlight your experience in backend engineering and AI automation. We want to see how you've built production-grade software and tackled complex systems integration, so don’t hold back!

Be Specific:When detailing your technical skills, be specific about the languages you know and the projects you've worked on. Mention any experience with SQL databases and multi-turn system interactions to catch our eye.

Attention to Detail is Key:We love candidates who pay attention to detail. Make sure your application is free from typos and clearly outlines your qualifications. This shows us you can provide high-density technical feedback, which is crucial for the role.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. We can’t wait to see what you bring to the table!

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you’re well-versed in the programming languages mentioned in the job description, like Python or JavaScript. Brush up on your SQL skills too, as you'll likely need to demonstrate your ability to work with databases during the interview.

Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've tackled complex systems integration or AI automation challenges. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your contributions effectively.

Demonstrate Attention to Detail

Since the role requires providing clear technical feedback, be ready to discuss how you ensure accuracy in your work. Bring examples of how your attention to detail has positively impacted past projects, especially in live environments.

Familiarise Yourself with Real-World Applications

Research how AI agents are currently being used in various industries. Be prepared to discuss how you can contribute to projects like OpenClaw and share your thoughts on potential improvements or innovations in AI workflows.