A leading AI company in the United Kingdom is seeking a candidate to design realistic evaluation scenarios for LLM-based agents. The role involves creating test cases, defining gold-standard behaviors, and analyzing agent logs. Candidates should have a degree in a relevant field and skills in Python and JavaScript. Flexibility in working hours and location is offered, with compensation up to $50/hour based on experience. This is a unique opportunity to enhance your portfolio while influencing future AI models. #J-18808-Ljbffr
Contact Detail:
Mindrift Recruiting Team