Remote AI Evaluation Scenario Architect & Testing Specialist

Full-Time No home office possible

A leading AI company in the United Kingdom is seeking a candidate to design realistic evaluation scenarios for LLM-based agents. The role involves creating test cases, defining gold-standard behaviors, and analyzing agent logs. Candidates should have a degree in a relevant field and skills in Python and JavaScript. Flexibility in working hours and location is offered, with compensation up to $50/hour based on experience. This is a unique opportunity to enhance your portfolio while influencing future AI models. #J-18808-Ljbffr

Contact Detail:

Mindrift Recruiting Team

View Mindrift Profile

Remote AI Evaluation Scenario Architect & Testing Specialist

Mindrift

Apply now

Remote AI Evaluation Scenario Architect & Testing Specialist

Full-Time

Apply now
Mindrift

50-100

View Mindrift Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Remote AI Evaluation Scenario Architect & Testing Specialist

Remote AI Evaluation Scenario Architect & Testing Specialist

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z