AI Evaluation Engineer (Remote) (Cardiff) in London

AI Evaluation Engineer (Remote) (Cardiff) in London

London Full-Time 45000 - 55000 € / year (est.) No home office possible
Outlier AI

At a Glance

  • Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
  • Company: Outlier, a leader in AI innovation with a focus on collaboration.
  • Benefits: Remote work options, competitive salary, and opportunities for professional growth.
  • Other info: Dynamic remote environment with exciting projects and career advancement opportunities.
  • Why this job: Join us to train advanced generative systems and make a real impact.
  • Qualifications: 2+ years in backend engineering or AI automation; strong coding skills required.

The predicted salary is between 45000 - 55000 € per year.

About the Project

Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows.

Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

Remote working/work at home options are available for this role.

AI Evaluation Engineer (Remote) (Cardiff) in London employer: Outlier AI

Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to shape the future of autonomous agents from the comfort of their own homes in Cardiff. With a strong emphasis on employee growth, we provide opportunities for continuous learning and development while working on cutting-edge projects with leading AI organisations. Our commitment to flexibility and remote work ensures a healthy work-life balance, making Outlier a rewarding place to advance your career in AI.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) (Cardiff) in London

Tip Number 1

Network like a pro! Reach out to folks in the AI and tech space on LinkedIn or at meetups. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to AI and backend engineering. This gives potential employers a taste of what you can do beyond your CV.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common coding challenges and be ready to discuss your past projects in detail — we want to see how you think!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Outlier.

We think you need these skills to ace AI Evaluation Engineer (Remote) (Cardiff) in London

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Engineer role. Highlight your experience in backend engineering and any relevant projects you've worked on that showcase your skills in AI automation and complex systems integration.

Showcase Your Skills:Don’t just list your skills; demonstrate them! Use specific examples from your past work to show how you’ve built production-grade software or handled multi-turn system interactions. This will help us see your practical experience in action.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Explain why you're passionate about shaping the future of autonomous agents and how your background aligns with our mission at Outlier. Keep it engaging and personal, so we get a sense of who you are.

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates regarding your application status.

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you brush up on your programming languages, especially Python and JavaScript, as well as SQL databases. Be ready to discuss your past projects in detail, focusing on how you built and maintained production-grade software.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex system interactions or multi-stage coordination tasks. Highlight any real-world problems you've solved using AI agents or integrations with tools like Supabase or Gmail.

Attention to Detail is Key

Demonstrate your ability to provide clear, high-density technical feedback. Bring examples of how your attention to detail has helped identify subtle failures in systems, such as privacy leaks or authority escalation.

Be Ready for Technical Questions

Expect questions that dive deep into your experience with backend engineering and AI automation. Practice articulating your thought process clearly, as this will show your understanding of complex systems and your ability to communicate effectively.