AI Evaluation Engineer (Remote) in Swansea

AI Evaluation Engineer (Remote) in Swansea

Swansea Full-Time 60000 - 80000 € / year (est.) Home office possible
Outlier AI

At a Glance

  • Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
  • Company: Join a cutting-edge company at the forefront of AI innovation.
  • Benefits: Enjoy remote work flexibility, competitive pay, and opportunities for professional growth.
  • Other info: Dynamic role with opportunities to tackle real-world challenges and enhance your tech skills.
  • Why this job: Make a real impact on advanced generative systems and collaborate with top AI organisations.
  • Qualifications: 2+ years in backend engineering or AI automation; strong coding skills in major languages.

The predicted salary is between 60000 - 80000 € per year.

About the Project

Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows.

Ideal Qualifications

  • 2+ years of experience in backend engineering, AI automation, or complex systems integration.
  • Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
  • Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
  • Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
  • Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.

Nice to have

  • Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
  • Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
  • High level of comfort implementing persistent state and session discovery using MEMORY.md to track agent progress.
  • Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.

AI Evaluation Engineer (Remote) in Swansea employer: Outlier AI

At Outlier, we pride ourselves on being an exceptional employer that fosters a collaborative and innovative work culture. Our remote environment allows for flexibility while providing ample opportunities for professional growth in the rapidly evolving field of AI. Join us to not only contribute to groundbreaking projects but also to be part of a team that values your insights and encourages continuous learning.

Outlier AI

Contact Detail:

Outlier AI Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Engineer (Remote) in Swansea

Tip Number 1

Network like a pro! Reach out to folks in the AI and software development communities. Join relevant forums, attend meetups, or even slide into LinkedIn DMs. You never know who might have the inside scoop on job openings!

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving backend engineering or AI automation. This is your chance to demonstrate your expertise in building production-grade software and handling complex systems.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge. Be ready to discuss your experience with languages like Python or JavaScript, and how you've tackled real-world problems with multi-turn system interactions. Practice makes perfect!

Tip Number 4

Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining us at StudySmarter. Tailor your application to highlight your experience with AI agents and complex workflows to stand out from the crowd.

We think you need these skills to ace AI Evaluation Engineer (Remote) in Swansea

Backend Engineering
AI Automation
Complex Systems Integration
Production-Grade Software Development
Modular Software Architecture
Python
JavaScript

Some tips for your application 🫡

Show Off Your Skills:Make sure to highlight your experience in backend engineering and AI automation. We want to see how you've built and maintained production-grade software, so don’t hold back on the details!

Be Specific About Your Experience:When you describe your past projects, focus on the specifics. Mention the languages you've used, like Python or JavaScript, and any SQL databases you've worked with. This helps us understand your technical background better.

Attention to Detail is Key:We love candidates who pay attention to detail. Make sure your application is free from typos and clearly structured. This reflects your ability to provide high-density technical feedback, which is crucial for the role.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. We can’t wait to see what you bring to the table!

How to prepare for a job interview at Outlier AI

Know Your Tech Inside Out

Make sure you’re well-versed in the programming languages mentioned in the job description, like Python or JavaScript. Brush up on your SQL skills too, as you'll likely need to demonstrate your ability to work with databases during the interview.

Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex systems integration or AI automation challenges in the past. Be ready to discuss specific projects where you built production-grade software and how you approached multi-turn system interactions.

Demonstrate Attention to Detail

Since the role requires providing clear technical feedback, practice articulating your thoughts on complex system behaviours. You might be asked to analyse a scenario or a piece of code, so being precise and thorough will set you apart.

Familiarise Yourself with Real-World Applications

Research how AI agents are integrated with tools like Supabase or Gmail. Being able to discuss real-world applications and potential pitfalls, such as privacy leaks or authority escalation, will show that you understand the practical implications of your work.