AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London
AI Agent Testing Specialist - Must have IT background (Freelance, Remote)

AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London

London Freelance 30000 - 50000 £ / year (est.) No home office possible
Go Premium
Braintrust

At a Glance

  • Tasks: Design and evaluate test scenarios for AI agents, simulating real-world tasks.
  • Company: Mindrift, a leader in ethical AI innovation and collaboration.
  • Benefits: Flexible freelance role, work from anywhere, enhance your portfolio.
  • Why this job: Shape the future of AI while working on exciting projects that matter.
  • Qualifications: IT background with experience in QA, software testing, or data analysis.
  • Other info: Fully remote position, perfect for balancing studies or other commitments.

The predicted salary is between 30000 - 50000 £ per year.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

The Mindrift platform connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behaviour to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Responsibilities:
  • Designing structured test scenarios based on real-world tasks.
  • Defining the golden path and acceptable agent behaviour.
  • Annotating task steps, expected outputs, and edge cases.
  • Working with developers to test your scenarios and improve clarity.
  • Reviewing agent outputs and adapting tests accordingly.
How To Get Started:

Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone.

Requirements:
  • Bachelor's and/or Master’s Degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
  • Background in QA, software testing, data analysis, or NLP annotation.
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
  • Strong written communication skills in English.
  • Comfortable with structured formats like JSON/YAML for scenario description.
  • Can define expected agent behaviours (gold paths) and scoring logic.
  • Basic experience with Python and JavaScript.
  • Curious and open to working with AI-generated content, agent logs, and prompt-based behaviour.
  • You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.

Our freelance role is fully remote so you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.

Nice to Have:
  • Experience in writing manual or automated test cases.
  • Familiarity with LLM capabilities and typical failure modes.
  • Understanding of scoring metrics (precision, recall, coverage, reward functions).
Benefits:

Contribute on your own schedule, from anywhere in the world. This opportunity allows you to:

  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.

AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London employer: Braintrust

At Mindrift, we pride ourselves on fostering a culture of innovation and collaboration, where your contributions directly shape the future of AI. As a remote freelance AI Agent Testing Specialist, you will enjoy the flexibility to work on your own schedule while engaging in cutting-edge projects that enhance your professional portfolio. With opportunities for growth and the chance to influence AI development, Mindrift is an excellent employer for those looking to make a meaningful impact in the tech industry.
Braintrust

Contact Detail:

Braintrust Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London

✨Tip Number 1

Network like a pro! Reach out to folks in the AI and tech community, whether it's through LinkedIn or local meetups. You never know who might have the inside scoop on freelance gigs that aren't even advertised yet.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your test scenarios and any relevant projects you've worked on. This will give potential clients a taste of what you can do and set you apart from the crowd.

✨Tip Number 3

Stay updated with the latest trends in AI and testing. Follow industry blogs, join forums, and participate in discussions. This not only boosts your knowledge but also shows potential clients that you're passionate and engaged.

✨Tip Number 4

Apply through our website! We make it super easy for you to find roles that match your skills. Plus, you'll be part of a community that's all about shaping the future of AI together.

We think you need these skills to ace AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London

Analytical Mindset
Attention to Detail
Test Design Principles
Data Analysis
NLP Annotation
Written Communication Skills
JSON/YAML Proficiency
Expected Agent Behaviour Definition
Scoring Logic
Python
JavaScript
Curiosity in AI-generated Content
Adaptability
Experience in QA or Software Testing

Some tips for your application 🫡

Show Off Your Skills: Make sure to highlight your IT background and any relevant experience in QA, software testing, or NLP annotation. We want to see how your skills align with the role, so don’t hold back!

Be Clear and Concise: When writing your application, keep it straightforward. Use structured formats like JSON or YAML if you can, as it shows you’re comfortable with the tools we use. Clarity is key!

Tailor Your Application: Don’t just send a generic application. Take the time to tailor your responses to our job description. Mention specific responsibilities and requirements that resonate with your experience.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity!

How to prepare for a job interview at Braintrust

✨Know Your Tech Inside Out

Make sure you brush up on your IT background, especially in areas like QA, software testing, and data analysis. Be ready to discuss how these skills apply to designing structured test scenarios for AI agents.

✨Showcase Your Analytical Skills

Prepare examples that highlight your analytical mindset and attention to detail. Think of specific instances where you've defined expected behaviours or created test cases, as this will resonate well with the role's requirements.

✨Familiarise Yourself with Test Design Principles

Understand key concepts like reproducibility, coverage, and edge cases. Be prepared to discuss how you would apply these principles when creating evaluation scenarios for LLM-based agents.

✨Communicate Clearly and Confidently

Since strong written communication skills are essential, practice explaining complex ideas simply. You might be asked to describe your approach to annotating task steps or defining scoring logic, so clarity is key!

AI Agent Testing Specialist - Must have IT background (Freelance, Remote) in London
Braintrust
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>