At a Glance
- Tasks: Evaluate AI outputs across text, image, and video to ensure quality and accuracy.
- Company: Join iMerit, a global leader in AI evaluation with a supportive culture.
- Benefits: Flexible remote work, competitive pay, and opportunities for continuous learning.
- Other info: Minimum commitment of 20 hours per week with flexible scheduling options.
- Why this job: Shape the future of AI while working on innovative projects from anywhere.
- Qualifications: Experience in data annotation or AI evaluation; strong attention to detail required.
The predicted salary is between 18 - 25 £ per hour.
iMerit seeks detail-oriented and analytically minded Multimodal GenAI Evaluation Analysts to perform highly nuanced evaluations of AI system outputs across different modalities: text, image, video, and multimodal interactions. Analysts will assess the accuracy, appropriateness, quality, clarity, and cultural alignment of model outputs against complex guidelines, ensuring that results align with project standards and real-world use cases. These evaluations will directly inform the development and fine-tuning of advanced large language models (LLMs), vision models (LVMs), and multimodal AI systems.
Role Responsibilities
- Evaluate outputs generated by LLMs across multiple modalities (text, image captions, video descriptions, and multimodal prompts).
- Assess quality against project-specific criteria such as correctness, coherence, completeness, style, cultural appropriateness, and safety.
- Identify subtle errors, hallucinations, or biases in AI responses.
- Apply domain expertise and logical reasoning to resolve ambiguous or unclear outputs.
- Provide detailed written feedback, tagging, and scoring of outputs to ensure consistency across the evaluation team.
- Escalate unclear cases and contribute to refining evaluation guidelines.
- Collaborate with Project Managers and Quality Leads to meet accuracy, reliability, and turnaround benchmarks.
Skills & Competencies
- Strong critical reading, observational, and evaluative skills across different modalities.
- Ability to articulate nuanced judgments with precision and clarity.
- Excellent English comprehension (CEFR B2 or above); additional languages a plus.
- Familiarity with LLMs, generative AI, and multimodal systems.
- Strong attention to detail and ability to apply guidelines consistently.
- Awareness of cultural and linguistic nuances, including potential bias and harm in AI outputs.
- Comfort with evolving workflows, rapid feedback cycles, and complex quality frameworks.
Requirements
- Bachelor's degree/diploma or equivalent educational qualification.
- 1+ years of experience in data annotation, LLM evaluation, content moderation, or related AI/ML domains.
- Demonstrated experience working with data annotation tools and software platforms.
- Strong understanding of language and multimodal communication (instruction following in image generation, fact-checking, narrative coherence in video, etc.).
- Ability to adapt quickly to changing project directions and fast-paced work environments.
- Previous experience creating or annotating complex data specifically for Large Language Model (LLM) training.
- Prior exposure to generative AI, prompt engineering, or LLM fine-tuning workflows is a plus.
- While moderation of high-harm/high-risk material is not part of this role, candidates should be aware that occasional exposure to NSFW or otherwise sensitive content may occur due to imperfections in client‑provided datasets. Applicants should indicate that they are comfortable working in environments where such incidental exposure is a possibility.
What We Offer
- Opportunities to shape the evaluation standards for next-generation multimodal AI systems.
- Innovative and supportive global working environment.
- Competitive compensation and flexible remote working arrangements.
- Continuous learning and growth in applied AI evaluation.
Please acknowledge that you agree to the selection process below. You will receive an iMerit platform assessment (15–30 minutes). If successfully completed, you’ll be invited to join the first project. After onboarding, once you’ve completed 10 hours of work, a quality test will be conducted. If you pass the quality test, you’ll continue on a 3‑month project and will be invited to participate in upcoming projects.
Note
- You will complete a quick 15–30 minute assessment. This requires downloading a browser extension, which can be removed once the assessment is completed.
- ID verification and background check are required.
- Onboarding will be completed through iMerit’s platform.
Commitment
- Minimum 20 hours per week (flexible schedule). You may work more hours if desired.
Hourly Rates
- Malaysia – $5/hr
- Mexico, Colombia, Brazil, Costa Rica – $8.50/hr
- Argentina, Poland, Bulgaria, Romania, Malta, Latvia, Lithuania, UAE – $13/hr
- Portugal, Italy, Greece, Spain – $15.50/hr
- Canada, Australia, New Zealand, United Kingdom, Ireland, US, Finland, France, Sweden, Belgium, Austria, Denmark, Germany, Luxembourg, Estonia – $22/hr
For Digital Nomads: If you are currently traveling, please let us know. This ensures any discrepancies between your current location and your work authorization location do not affect your application.
Writer / AI Annotator / (Remote- freelance, 100+ openings) in London employer: Braintrust
Contact Detail:
Braintrust Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Writer / AI Annotator / (Remote- freelance, 100+ openings) in London
✨Tip Number 1
Get your research game on! Before you apply, dive deep into iMerit and understand their projects. Knowing the ins and outs of what they do will help you tailor your approach and show them you're genuinely interested.
✨Tip Number 2
Practice makes perfect! If you're looking to evaluate AI outputs, brush up on your critical reading and evaluative skills. Try analysing some sample outputs or even create your own evaluations to get a feel for the process.
✨Tip Number 3
Network like a pro! Connect with current or former iMerit employees on LinkedIn. They can provide insider tips and might even give you a heads-up about upcoming opportunities. Plus, it shows you're proactive!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, we often have exclusive openings listed there that you won’t find anywhere else. Don’t miss out!
We think you need these skills to ace Writer / AI Annotator / (Remote- freelance, 100+ openings) in London
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your application for the Writer / AI Annotator role. Highlight your relevant experience with LLMs and data annotation, and show us how your skills align with the job description.
Showcase Your Attention to Detail: Since this role requires a keen eye for detail, include examples in your application that demonstrate your ability to evaluate and provide feedback on complex outputs. We want to see how you spot nuances!
Be Clear and Concise: When writing your application, keep it clear and to the point. Use straightforward language to articulate your thoughts, as clarity is key in this role. We appreciate well-structured applications!
Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and get you started on the right foot. We can’t wait to hear from you!
How to prepare for a job interview at Braintrust
✨Know Your AI Stuff
Make sure you brush up on your knowledge of large language models and multimodal systems. Familiarise yourself with the latest trends in generative AI, as well as any specific tools or platforms mentioned in the job description. This will help you answer questions confidently and show that you're genuinely interested in the field.
✨Showcase Your Attention to Detail
Since the role requires a keen eye for detail, prepare examples from your past work where you identified subtle errors or biases. Be ready to discuss how you applied guidelines consistently and how your evaluations contributed to project success. This will demonstrate your analytical skills and ability to follow complex criteria.
✨Practice Articulating Nuanced Judgments
In this role, you'll need to provide detailed feedback and articulate your evaluations clearly. Practice explaining your thought process on sample outputs, focusing on aspects like correctness, coherence, and cultural appropriateness. This will help you communicate effectively during the interview and showcase your evaluative skills.
✨Be Ready for Rapid Changes
The job mentions evolving workflows and fast-paced environments, so be prepared to discuss how you've adapted to changes in previous roles. Share specific examples of how you handled shifting priorities or new guidelines, which will show that you're flexible and can thrive under pressure.