At a Glance
- Tasks: Shape the future of AI by providing human feedback to improve autonomous agents.
- Company: Outlier, a leader in AI innovation and collaboration.
- Benefits: Remote work options, competitive salary, and opportunities for professional growth.
- Other info: Dynamic remote environment with exciting projects and career advancement potential.
- Why this job: Join us to train advanced generative systems and make a real impact.
- Qualifications: 2+ years in backend engineering or AI automation; strong coding skills required.
The predicted salary is between 50000 - 60000 € per year.
About the Project
Outlier helps the world's most innovative companies improve their AI agents by providing human feedback. We collaborate with leading AI organizations to train Large Language Models (LLMs) to function as proactive, multi-step agents. Our projects focus on teaching these systems how to design, coordinate, and optimize complex, real-world architectural workflows.
Whether you are a passionate orchestration guru or experienced software developer — we want you to help us train the world's most advanced generative systems.
Ideal Qualifications
- 2+ years of experience in backend engineering, AI automation, or complex systems integration.
- Proven ability to build and maintain production-grade software with modular separation (e.g., distinct services for data parsing, logic processing, and reporting).
- Strong command of at least two major languages (e.g., Python, JavaScript, Go, or Java) and experience working with SQL databases.
- Practical experience building for live, non-mocked environments and handling multi-turn system interactions.
- Outstanding attention to detail and the ability to provide clear, high-density technical feedback on complex system behaviors.
Nice to have
- Expertise building multi-stage coordination tasks where data acquisition leads to reasoned output.
- Hands-on experience integrating agents with live tools such as Supabase, Gmail, and various APIs to solve real-world problems.
- High level of comfort implementing persistent state and session discovery to track agent progress.
- Experience identifying subtle failures like privacy leaks, authority escalation, or indirect prompt injections.
Remote working/work at home options are available for this role.
AI Evaluation Engineer (Remote) (City of Edinburgh) in London employer: Outlier AI
Outlier is an exceptional employer that fosters a collaborative and innovative work culture, allowing AI Evaluation Engineers to contribute meaningfully to cutting-edge projects from the comfort of their own homes in Edinburgh. With a strong emphasis on employee growth, Outlier offers opportunities to work alongside leading AI organisations, enhancing your skills while shaping the future of autonomous agents. The flexibility of remote work combined with the chance to engage in complex, real-world architectural workflows makes this role both rewarding and impactful.
StudySmarter Expert Advice🤫
We think this is how you could land AI Evaluation Engineer (Remote) (City of Edinburgh) in London
✨Tip Number 1
Network like a pro! Reach out to folks in the AI and tech space, especially those who work at companies you're interested in. A friendly chat can open doors that a CV just can't.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those related to AI and backend engineering. This gives potential employers a taste of what you can do beyond the written application.
✨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common coding challenges and be ready to discuss your past projects in detail — we want to see how you think!
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining our team at Outlier.
We think you need these skills to ace AI Evaluation Engineer (Remote) (City of Edinburgh) in London
Some tips for your application 🫡
Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Engineer role. Highlight your experience in backend engineering and any relevant projects you've worked on that showcase your skills in AI automation and complex systems integration.
Showcase Your Skills:Don’t just list your skills; demonstrate them! Use specific examples from your past work to show how you’ve built production-grade software and handled multi-turn system interactions. This will help us see your practical experience in action.
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Explain why you're passionate about shaping the future of autonomous agents and how your background aligns with our mission at StudySmarter. Keep it engaging and personal!
Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It helps us keep everything organised and ensures your application gets the attention it deserves. Plus, it’s super easy!
How to prepare for a job interview at Outlier AI
✨Know Your Tech Inside Out
Make sure you brush up on your programming languages, especially Python and JavaScript. Be ready to discuss your experience with SQL databases and how you've built production-grade software. They’ll want to see that you can talk the talk and walk the walk!
✨Showcase Your Problem-Solving Skills
Prepare examples of how you've tackled complex systems integration or AI automation challenges in the past. Think about specific projects where you had to design workflows or optimise processes, and be ready to explain your thought process.
✨Demonstrate Attention to Detail
Since this role requires high-density technical feedback, practice articulating your thoughts clearly. Bring examples of your work that highlight your attention to detail, especially in identifying subtle failures or issues in system behaviours.
✨Familiarise Yourself with Real-World Applications
Research how AI agents are integrated with tools like Supabase and Gmail. Be prepared to discuss how you would approach real-world problems using these technologies, as they’re looking for someone who can think practically about implementation.