AI Evaluation Engineer (Remote) (London)

AI Evaluation Engineer (Remote) (London)

London Full-Time 50000 - 70000 € / year (est.) No home office possible
E

At a Glance

  • Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
  • Company: Join Outlier, a leader in AI innovation and collaboration.
  • Benefits: Enjoy remote work flexibility, competitive salary, and opportunities for professional growth.
  • Other info: Dynamic remote work environment with exciting projects and career advancement potential.
  • Why this job: Be at the forefront of AI technology and make a real impact on advanced generative systems.
  • Qualifications: 2+ years in backend engineering or AI automation; strong coding skills in major languages.

The predicted salary is between 50000 - 70000 € per year.

About the Project

Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows. Whether you are a passionate orchestration guru or experienced software developer, we want you to help us train the world's most advanced generative systems.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

Remote working/work at home options are available for this role.

AI Evaluation Engineer (Remote) (London) employer: Employer near you

Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to contribute meaningfully to the future of autonomous agents from the comfort of their own homes in London. With a strong emphasis on employee growth, Outlier offers opportunities to work alongside leading AI organisations, enhancing your skills in backend engineering and complex systems integration while shaping cutting-edge technology. The flexibility of remote work combined with the chance to engage in impactful projects makes Outlier a truly rewarding place to advance your career.

E

Contact Detail:

Employer near you Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) (London)

Tip Number 1

Network like a pro! Reach out to folks in the AI and tech space on LinkedIn or at meetups. We can’t stress enough how personal connections can open doors that applications alone can’t.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those related to AI and backend engineering. This gives us a tangible way to see what you can do beyond your CV.

Tip Number 3

Prepare for interviews by practising common technical questions and scenarios relevant to AI evaluation. We recommend doing mock interviews with friends or using online platforms to get comfortable.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take that extra step!

We think you need these skills to ace AI Evaluation Engineer (Remote) (London)

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Engineer role. Highlight your experience in backend engineering and any relevant projects you've worked on that showcase your skills in AI automation and complex systems integration.

Showcase Your Skills:Don’t just list your skills; demonstrate them! Use specific examples from your past work to show how you’ve built production-grade software and handled multi-turn system interactions. This will help us see your practical experience in action.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Explain why you're passionate about shaping the future of autonomous agents and how your background aligns with our mission at Outlier. Keep it engaging and personal, so we get a sense of who you are.

Apply Through Our Website:We encourage you to apply through our website for the best chance of being considered. It’s straightforward and ensures your application goes directly to us, making it easier for you to join our innovative team!

How to prepare for a job interview at Employer near you

Know Your Tech Inside Out

Make sure you’re well-versed in the programming languages mentioned in the job description, like Python or JavaScript. Brush up on your SQL skills too, as you might be asked to solve problems on the spot.

Showcase Your Experience

Prepare specific examples from your past work that demonstrate your ability to build and maintain production-grade software. Highlight any projects where you’ve worked with complex systems integration or AI automation.

Understand the Project's Goals

Familiarise yourself with Outlier’s mission and the types of AI agents they work with. Being able to discuss how your skills can contribute to shaping autonomous agents will show your genuine interest in the role.

Prepare for Technical Feedback Scenarios

Since the role involves providing clear technical feedback, think about how you would approach giving constructive criticism on system behaviours. Be ready to discuss how you’ve handled similar situations in the past.