AI Agent Testing Specialist - Must have IT background (Freelance, Remote)

AI Agent Testing Specialist - Must have IT background (Freelance, Remote)

Freelance 36000 - 60000 £ / year (est.) No working from home possible
Braintrust

At a Glance

  • Tasks: Design and evaluate test scenarios for AI agents using real-world tasks.
  • Company: Join Mindrift, a leader in ethical AI innovation.
  • Benefits: Flexible freelance role, work from anywhere, enhance your portfolio.
  • Other info: Fully remote position with opportunities for learning and growth.
  • Why this job: Shape the future of AI while working on exciting projects.
  • Qualifications: IT background with skills in QA, software testing, or data analysis.

The predicted salary is between 36000 - 60000 £ per year.

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

The Mindrift platform connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behaviour to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Responsibilities:
  • Designing structured test scenarios based on real-world tasks.
  • Defining the golden path and acceptable agent behaviour.
  • Annotating task steps, expected outputs, and edge cases.
  • Working with developers to test your scenarios and improve clarity.
  • Reviewing agent outputs and adapting tests accordingly.

Requirements:

  • Bachelor's and/or Master’s Degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
  • Background in QA, software testing, data analysis, or NLP annotation.
  • Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
  • Strong written communication skills in English.
  • Comfortable with structured formats like JSON/YAML for scenario description.
  • Can define expected agent behaviours (gold paths) and scoring logic.
  • Basic experience with Python and JS.
  • Curious and open to working with AI-generated content, agent logs, and prompt-based behaviour.
  • You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.

Our freelance role is fully remote so you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.

Nice to Have:

  • Experience in writing manual or automated test cases.
  • Familiarity with LLM capabilities and typical failure modes.
  • Understanding of scoring metrics (precision, recall, coverage, reward functions).

Benefits:

  • Contribute on your own schedule, from anywhere in the world.
  • This opportunity allows you to take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments.
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio.
  • Influence how future AI models understand and communicate in your field of expertise.

AI Agent Testing Specialist - Must have IT background (Freelance, Remote) employer: Braintrust

At Mindrift, we pride ourselves on fostering a culture of innovation and collaboration, where your contributions directly shape the future of AI. As a remote freelance AI Agent Testing Specialist, you will enjoy the flexibility to work on your own schedule while engaging in cutting-edge projects that enhance your professional portfolio. With opportunities for growth and the chance to influence AI development, Mindrift is an excellent employer for those looking to make a meaningful impact in the tech industry.

Braintrust

Contact Details:

Braintrust Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Agent Testing Specialist - Must have IT background (Freelance, Remote)

Make Your Portfolio Shine

As a freelancer in software development, your portfolio is your bread and butter. Showcase not only your completed projects but also your coding skills on platforms like GitHub or GitLab. This visibility can lead to clients reaching out to you directly!

Join Developer Communities

Get involved in developer communities like Stack Overflow or Reddit's r/programming. Contributing to discussions, sharing your knowledge, or even helping others can expand your network and lead to freelance opportunities. Plus, it keeps you in the loop about what’s trending in software development.

Freelance Platforms Are Your Friend

Sign up for popular freelance platforms such as Upwork or Freelancer. They’re filled with people looking for talent like yours! Don’t forget to personalise your pitch to each potential client, highlighting how your unique skills can solve their specific problems.

Utilise Your Network

Don’t hesitate to reach out to your existing contacts - you never know who might need your services or can refer you to someone who does. Personal connections can lead to more reliable gigs than cold applications. And we're here to help too! If you're looking for freelance opportunities, check out opportunities listed on our website.

We think you need these skills to ace AI Agent Testing Specialist - Must have IT background (Freelance, Remote)

Analytical Mindset
Attention to Detail
Test Design Principles
NLP Annotation
Data Analysis
Structured Formats (JSON/YAML)
Expected Agent Behaviors Definition

Some tips for your application 🫡

Showcase Your GitHub Projects:When applying for a freelance software engineering gig at Braintrust, make sure to include your GitHub link in your application. Highlighting your code repositories, contributions to Open Source, and any personal projects can really set you apart and give us insight into your skills and coding style.

Tailor Your CV with Relevant Skills:In software development, the specifics matter! Make sure your CV lists the programming languages and technologies you excel at. Focus on the ones that align with the projects Braintrust is working on. Listing your tech stack clearly will help us understand how you fit into our team.

Include a Portfolio of Your Work:A solid portfolio is a must when applying for freelance roles. Include links to any apps, websites, or software you've developed. Highlight any projects that reflect a strong user experience, efficiency, or innovative solutions—this is your chance to shine and show us what you can bring to the table!

Mention Your Availability and Rates:Since this is a freelance role, we want to know when you’re available and what your rates are like! Be upfront about your typical work hours and project timelines. This transparency will help us see if we can sync up for future projects at Braintrust.

How to prepare for a job interview at Braintrust

Showcase Your Code Wizardry

Since you're going for a freelance role in software engineering, have a solid portfolio ready to flaunt your best work. Include projects that highlight your coding skills, frameworks you excel in, and any problem-solving feats you've pulled off. This is your chance to shine, so choose pieces that reflect your unique style and expertise!

Prepare for Technical Challenges

Freelance gigs often involve tech assessments or coding challenges, so be ready to tackle some hands-on problems. Brush up on common algorithms, data structures, and any languages/frameworks relevant to the role at Braintrust. Being comfortable with platforms like HackerRank or LeetCode can give you an edge and showcase your skills under pressure!

Be Clear About Your Rates and Flexibility

As a freelancer, be prepared to discuss your rates upfront. It's crucial to communicate your pricing structure clearly and whether you're open to negotiation. Do your homework on industry standards to ensure you pitch a fair and competitive rate that reflects your skills and experience!

Understand Their Tech Stack

Before the interview, get familiar with the tools and technologies used at Braintrust. Whether it's a particular framework or a specific coding methodology, being knowledgeable about their tech stack not only shows your interest but also helps you present how you could fit seamlessly into their existing projects. Demonstrating you’ve done your homework can set you apart from other candidates!