At a Glance
- Tasks: Design and evaluate test scenarios for AI agents, simulating real-world tasks.
- Company: Mindrift, a leader in ethical AI innovation and collaboration.
- Benefits: Flexible freelance role, work from anywhere, enhance your portfolio.
- Why this job: Shape the future of AI while working on exciting projects that matter.
- Qualifications: IT background with experience in QA, software testing, or data analysis.
- Other info: Fully remote position, perfect for balancing studies or other commitments.
The predicted salary is between 30000 - 50000 £ per year.
At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
The Mindrift platform connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behaviour to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.
Responsibilities:- Designing structured test scenarios based on real-world tasks.
- Defining the golden path and acceptable agent behaviour.
- Annotating task steps, expected outputs, and edge cases.
- Working with developers to test your scenarios and improve clarity.
- Reviewing agent outputs and adapting tests accordingly.
Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.
Requirements:- Bachelor's and/or Master’s Degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
- Background in QA, software testing, data analysis, or NLP annotation.
- Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
- Strong written communication skills in English.
- Comfortable with structured formats like JSON/YAML for scenario description.
- Can define expected agent behaviours (gold paths) and scoring logic.
- Basic experience with Python and JavaScript.
- Curious and open to working with AI-generated content, agent logs, and prompt-based behaviour.
- You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
Our freelance role is fully remote so you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.
Nice to Have:- Experience in writing manual or automated test cases.
- Familiarity with LLM capabilities and typical failure modes.
- Understanding of scoring metrics (precision, recall, coverage, reward functions).
Contribute on your own schedule, from anywhere in the world. This opportunity allows you to:
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
- Influence how future AI models understand and communicate in your field of expertise.
AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London employer: Braintrust
Contact Detail:
Braintrust Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London
✨Tip Number 1
Network like a pro! Reach out to folks in the AI and tech community, whether it's through LinkedIn or local meetups. You never know who might have the inside scoop on freelance gigs that aren't even advertised yet.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your test scenarios and any relevant projects you've worked on. This will give potential clients a taste of what you can do and set you apart from the crowd.
✨Tip Number 3
Stay updated with the latest trends in AI and testing. Follow industry blogs, join forums, and participate in discussions. This not only boosts your knowledge but also shows potential clients that you're passionate and engaged.
✨Tip Number 4
Apply through our website! We make it super easy for you to find roles that match your skills. Plus, you'll be part of a community that's all about shaping the future of AI together.
We think you need these skills to ace AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London
Some tips for your application 🫡
Show Off Your Skills: Make sure to highlight your IT background and any relevant experience in QA, software testing, or NLP annotation. We want to see how your skills align with the role, so don’t hold back!
Be Clear and Concise: When writing your application, keep it straightforward. Use structured formats like JSON or YAML if you can, as it shows you’re comfortable with the tools we use. Clarity is key!
Tailor Your Application: Don’t just send a generic application. Take the time to tailor your responses to our job description. Mention specific responsibilities and requirements that resonate with your experience.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity!
How to prepare for a job interview at Braintrust
✨Know Your Tech Inside Out
Make sure you brush up on your IT background, especially in areas like QA, software testing, and data analysis. Be ready to discuss how these skills apply to designing structured test scenarios for AI agents.
✨Showcase Your Analytical Skills
Prepare examples that highlight your analytical mindset and attention to detail. Think of specific instances where you've defined expected behaviours or created test cases, as this will resonate well with the role's requirements.
✨Familiarise Yourself with Test Design Principles
Understand key concepts like reproducibility, coverage, and edge cases. Be prepared to discuss how you would apply these principles when creating evaluation scenarios for LLM-based agents.
✨Communicate Clearly and Confidently
Since strong written communication skills are essential, practice explaining complex ideas simply. You might be asked to describe your approach to annotating task steps or defining scoring logic, so clarity is key!