AI Evaluation Engineer (Remote) (City of Edinburgh) in Livingston

AI Evaluation Engineer (Remote) (City of Edinburgh) in Livingston

Livingston Full-Time 50000 - 60000 € / year (est.) Home office possible
Outlier AI

At a Glance

  • Tasks: Help shape the future of AI by providing human feedback on innovative projects.
  • Company: Join Outlier, a leader in AI collaboration and innovation.
  • Benefits: Enjoy remote work flexibility, competitive salary, and opportunities for professional growth.
  • Other info: Dynamic remote environment with exciting challenges and career advancement.
  • Why this job: Make a real impact on cutting-edge AI systems and their development.
  • Qualifications: 2+ years in backend engineering or AI automation; strong coding skills required.

The predicted salary is between 50000 - 60000 € per year.

About the Project

Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows. Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

Remote working/work at home options are available for this role.

AI Evaluation Engineer (Remote) (City of Edinburgh) in Livingston employer: Outlier AI

Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to shape the future of autonomous agents from the comfort of their own homes in Edinburgh. With a strong emphasis on employee growth, we provide opportunities for continuous learning and development while working on cutting-edge projects with leading AI organisations. Our commitment to flexibility and a supportive environment makes Outlier a rewarding place to advance your career in AI technology.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) (City of Edinburgh) in Livingston

Tip Number 1

Network like a pro! Reach out to folks in the AI and tech space, especially those who work with autonomous agents. A friendly chat can open doors that a CV just can't.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repo showcasing your projects related to backend engineering or AI automation. This gives us a tangible way to see what you can do.

Tip Number 3

Prepare for the interview by brushing up on your technical knowledge. Be ready to discuss your experience with multi-turn system interactions and how you've tackled complex problems in the past.

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who take that extra step.

We think you need these skills to ace AI Evaluation Engineer (Remote) (City of Edinburgh) in Livingston

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Engineer role. Highlight your experience in backend engineering and any relevant projects you've worked on that align with our focus on AI automation and complex systems integration.

Showcase Your Skills:We want to see your technical skills shine! Be sure to mention your proficiency in at least two major programming languages and your experience with SQL databases. Specific examples of your work will help us understand your capabilities better.

Detail Your Experience:When describing your past roles, focus on your hands-on experience with live environments and multi-turn system interactions. We love seeing how you've tackled real-world problems, so don't hold back on the details!

Apply Through Our Website:To make sure your application gets the attention it deserves, apply directly through our website. It’s the best way for us to keep track of your application and ensure you’re considered for this exciting opportunity!

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you brush up on your backend engineering skills and the programming languages mentioned in the job description. Be ready to discuss your experience with Python, JavaScript, or any other relevant language, and how you've used them in real-world projects.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex systems integration or AI automation challenges. Think about specific instances where you built production-grade software and how you approached multi-turn system interactions.

Be Ready for Technical Feedback

Since the role involves providing high-density technical feedback, practice articulating your thoughts clearly. You might be asked to evaluate a system's behaviour, so think about how you would communicate your insights effectively.

Familiarise Yourself with Real-World Applications

Research how AI agents are integrated with live tools like Supabase or Gmail. Being able to discuss practical applications and potential pitfalls, such as privacy leaks or authority escalation, will show that you understand the complexities of the role.