AI Evaluation Engineer (Remote) (Leeds) in Aberford

AI Evaluation Engineer (Remote) (Leeds) in Aberford

Aberford Full-Time 50000 - 60000 € / year (est.) No home office possible
Outlier AI

At a Glance

  • Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
  • Company: Outlier, a leader in AI innovation and collaboration.
  • Benefits: Remote work options, competitive salary, and opportunities for professional growth.
  • Other info: Dynamic remote environment with exciting projects and career advancement.
  • Why this job: Join us to train advanced generative systems and make a real impact.
  • Qualifications: 2+ years in backend engineering or AI automation with strong coding skills.

The predicted salary is between 50000 - 60000 € per year.

About the Project

Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. Do you want to shape the future of autonomous agents like OpenClaw? We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows.

Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery using to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

Remote working/work at home options are available for this role.

AI Evaluation Engineer (Remote) (Leeds) in Aberford employer: Outlier AI

Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to shape the future of autonomous agents from the comfort of their own homes. With a strong emphasis on employee growth, Outlier offers opportunities to work on cutting-edge projects with leading AI organisations, ensuring that team members are at the forefront of technology while enjoying the flexibility of remote work. Join us in making a meaningful impact in the world of AI, where your contributions will be valued and recognised.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) (Leeds) in Aberford

Tip Number 1

Network like a pro! Reach out to people in the AI and tech space, especially those who work at companies you're interested in. A friendly chat can open doors that a CV just can't.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to AI and backend engineering. This gives potential employers a taste of what you can do.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common coding challenges and be ready to discuss your past experiences in detail.

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive about their job search.

We think you need these skills to ace AI Evaluation Engineer (Remote) (Leeds) in Aberford

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Show Your Passion for AI:When you're writing your application, let your enthusiasm for AI shine through! We want to see how excited you are about shaping the future of autonomous agents. Share any relevant projects or experiences that highlight your passion.

Tailor Your Experience:Make sure to customise your application to reflect the qualifications we’re looking for. Highlight your experience in backend engineering and any specific projects where you've worked with AI automation or complex systems integration. This helps us see how you fit into our team!

Be Clear and Concise:We appreciate clarity! When detailing your skills and experiences, keep it straightforward and to the point. Use bullet points if necessary to make it easy for us to read through your application quickly.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team at StudySmarter!

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you brush up on your backend engineering skills and the programming languages mentioned in the job description. Be ready to discuss your experience with Python, JavaScript, or any other relevant language, and how you've used them in real-world projects.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex systems integration or AI automation challenges. Think about specific instances where you built production-grade software and how you approached multi-turn system interactions.

Be Ready for Technical Feedback

Since the role involves providing high-density technical feedback, practice articulating your thoughts clearly. You might be asked to evaluate a system's behaviour, so think about how you would communicate your insights effectively.

Familiarise Yourself with Real-World Applications

Research how AI agents are integrated with tools like Supabase and Gmail. Being able to discuss practical applications and potential pitfalls, such as privacy leaks or authority escalation, will show that you understand the complexities of the role.