AI Evaluation Engineer (Remote) (Sheffield)

AI Evaluation Engineer (Remote) (Sheffield)

Full-Time 45000 - 55000 € / year (est.) Home office possible
Outlier AI

At a Glance

  • Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
  • Company: Outlier, a leader in AI collaboration with innovative companies.
  • Benefits: Remote work options, competitive salary, and opportunities for professional growth.
  • Other info: Dynamic remote environment with exciting projects and career advancement.
  • Why this job: Join us to train advanced generative systems and make a real impact.
  • Qualifications: 2+ years in backend engineering or AI automation; strong coding skills required.

The predicted salary is between 45000 - 55000 € per year.

Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. Do you want to shape the future of autonomous agents like OpenClaw? We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows.

Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications
  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.
Nice to have
  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery using to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

Remote working/work at home options are available for this role.

AI Evaluation Engineer (Remote) (Sheffield) employer: Outlier AI

Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to contribute meaningfully to cutting-edge projects from the comfort of their homes in Sheffield. With a strong emphasis on employee growth, Outlier offers opportunities for professional development and the chance to work alongside leading AI organisations, making it an ideal place for those passionate about shaping the future of autonomous agents.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) (Sheffield)

Tip Number 1

Network like a pro! Reach out to folks in the AI and software development communities. Join relevant forums, attend meetups, or even slide into LinkedIn DMs. You never know who might have the inside scoop on job openings!

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving AI automation or backend engineering. This is your chance to demonstrate your expertise in building production-grade software and handling complex systems.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge. Be ready to discuss your experience with languages like Python or JavaScript, and how you've tackled real-world problems. Practice explaining your thought process clearly and concisely.

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive about their job search. So, get that application in and let’s shape the future of AI together!

We think you need these skills to ace AI Evaluation Engineer (Remote) (Sheffield)

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Engineer role. Highlight your experience in backend engineering and any relevant projects you've worked on that showcase your skills in AI automation and complex systems integration.

Showcase Your Skills:Don’t just list your skills; demonstrate them! Use specific examples from your past work to show how you’ve built production-grade software and handled multi-turn system interactions. This will help us see your practical experience in action.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Explain why you're passionate about shaping the future of autonomous agents and how your background makes you a perfect fit for our team. Keep it engaging and personal!

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates from our team!

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you brush up on your backend engineering skills and the programming languages mentioned in the job description. Be ready to discuss your experience with Python, JavaScript, or any other relevant language, and how you've used them in real-world projects.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex systems integration or AI automation challenges. Think about specific instances where you built production-grade software and how you approached multi-turn system interactions.

Be Detail-Oriented

Since attention to detail is crucial for this role, practice articulating your thought process when providing technical feedback. Highlight any experiences where your meticulousness led to significant improvements in a project.

Familiarise Yourself with Real-World Applications

Research how AI agents are integrated with tools like Supabase and Gmail. Be prepared to discuss how you would approach building multi-stage coordination tasks and solving real-world problems using these technologies.