Senior AI Model Evaluation Director in London

Senior AI Model Evaluation Director in London

London Full-Time 90000 - 120000 £ / year (est.) No working from home possible
London Stock Exchange

At a Glance

  • Tasks: Lead the evaluation and validation of advanced AI and ML models in a dynamic environment.
  • Company: Join a forward-thinking organisation at the forefront of AI innovation.
  • Benefits: Competitive salary, flexible working options, and opportunities for professional growth.
  • Other info: Collaborative culture with a focus on innovation and excellence.
  • Why this job: Shape the future of AI while ensuring model safety and compliance.
  • Qualifications: Extensive experience in AI/ML product management and model evaluation required.

The predicted salary is between 90000 - 120000 £ per year.

Requirements:

  • Bachelor's, Master’s, or PhD or equivalent experience in Computer Science, Machine Learning, Applied Mathematics, Statistics, Financial Engineering, or a related quantitative field.
  • Significant professional experience (typically 7–12+ years) in AI/ML product development/management, model validation, quantitative research, risk modelling, or related areas.
  • Demonstrated success working with technical teams or senior specialists in high-stakes modelling environments.
  • Deep understanding of AI/ML systems—including LLMs, agentic architectures, RAG pipelines, credit or pricing models, or risk modelling techniques.
  • Hands-on experience developing or validating models, performing statistical testing, and analysing model assumptions, limitations, and risks.
  • Familiarity with model evaluation tooling, experimentation frameworks, and modern ML infrastructure.
  • Excellent communication skills, with the ability to present complex findings clearly to both technical and non-technical audiences.
  • (Desirable) Experience in B2B data or RegTech environments.
  • (Desirable) Experience managing AI systems in production environments or high-scale data and ML platforms.
  • (Desirable) Experience in working with teams in MLOps, DevOps, or large-scale compute environments (e.g., GPU clusters, cloud orchestration, Kubernetes).
  • (Desirable) Experience with Generative AI evaluation, agent testing, or AI safety frameworks.
  • (Desirable) Track record of partnering with regulatory bodies or leading audit-readiness efforts.

What the job involves:

We are seeking an experienced AI & Model Evaluation Manager to lead the evaluation, validation, and governance of advanced AI, machine learning, and statistical models across our Active Data Layer programme of development. This role blends technical depth, strategic leadership, and strong stakeholder management, ensuring that our models are accurate, reliable, safe, and aligned with regulatory and organisational standards.

You will oversee the end-to-end lifecycle of model evaluation - ranging from large language models (LLMs), agentic systems, and machine learning models used across Risk Intelligence to source, determine, match, and resolve model-driven data tasks within our Financial Crime and Screening domains.

You will act as the product owner of the AILab and associated analytics infrastructure supporting our ADL models, working in close partnership with technology leads on the development and implementation of these new capabilities. As a senior member of the Data & Product team, you will partner closely with engineering and architecture teams, including Data Science, AIOps, as well as the wider business functions such as Risk & Compliance, Legal, and Content Operations to ensure our data capabilities are accurate, fair, robust, auditable, and business-effective.

In addition, you will use your experience to shape best practices, drive innovation, and influence broader AI and model governance frameworks.

AI & ML Model Evaluation:

  • Lead the design and execution of evaluation methodologies for LLMs, multimodal systems, AI agents, and traditional ML models.
  • Oversee scenario-based testing, regression suites, multiturn agent simulations, and automated evaluation systems such as LLM-as-Judge and hybrid scoring approaches.
  • Build, refine, and maintain frameworks that assess model quality, robustness, performance, safety, explainability, and reliability at scale.

Model Validation & Governance:

  • Direct the independent review and validation of models across teams, ensuring compliance with internal LSEG governance standards and processes, and relevant regulatory expectations.
  • Maintain a robust model inventory, validation documentation, and version controlled evidence supporting approval and audit requirements.
  • Serve as a subject matter expert on model risk & decision methodologies (as applicable), AI evaluation patterns, and modelling frameworks.

Experimentation & Monitoring:

  • Oversee the development and operation of online and offline experimentation platforms, including A/B testing, shadow deployments, canary releases, and continuous monitoring.
  • Embed evaluation and experimentation into CI/CD pipelines, enabling automated quality gates and reliable release processes for model-driven products.
  • Implement observability practices that track model drift, degradation, safety issues, and agent behaviour over time.

Leadership & Strategy:

  • Work collaboratively within a cross-functional high-performing team, fostering innovation, technical excellence, and a collaborative culture.
  • Define and execute strategic direction for AI evaluation, model risk management, and model governance frameworks.
  • Partner with product, engineering, research, risk, compliance, and senior leadership across the organisation to influence AI development practices and decision-making.
  • Represent the function in internal and external audits, regulatory engagements, and cross-functional governance forums.

Stakeholder Engagement:

  • Act as a trusted advisor to model owners, developers, and business leaders—translating complex technical findings into actionable insights.
  • Support change management across the organisation to drive consistency in evaluation standards, documentation quality, and responsible AI adoption.

What Success Looks Like:

  • Models across the organisation consistently meet high standards of quality, safety, performance, and compliance.
  • The business benefits from reliable model-driven decisions, underpinned by transparent, well-governed evaluation practices.
  • The team operates with excellence, delivering high-impact insights, innovative evaluation techniques, and robust model validation outcomes.
  • Stakeholders across engineering, risk, and product view the function as a strategic partner and trusted authority.

Senior AI Model Evaluation Director in London employer: London Stock Exchange

As a leading employer in the AI and machine learning sector, we offer a dynamic work environment that fosters innovation and collaboration. Our commitment to employee growth is evident through continuous learning opportunities and a culture that values diverse perspectives, ensuring that every team member can contribute meaningfully to our mission of advancing AI governance and model evaluation. Located in a vibrant tech hub, we provide access to cutting-edge resources and a network of industry experts, making it an exciting place for professionals looking to make a significant impact.

London Stock Exchange

Contact Details:

London Stock Exchange Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior AI Model Evaluation Director in London

Tip Number 1

Network like a pro! Get out there and connect with folks in the AI and ML space. Attend meetups, webinars, or industry conferences. You never know who might be looking for someone with your skills!

Tip Number 2

Show off your expertise! Create a portfolio showcasing your projects, especially those related to model evaluation and AI systems. This will give potential employers a taste of what you can bring to the table.

Tip Number 3

Prepare for interviews by brushing up on your communication skills. Practice explaining complex AI concepts in simple terms. Remember, you’ll need to impress both technical and non-technical audiences!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace Senior AI Model Evaluation Director in London

AI/ML Product Development
Model Validation
Quantitative Research
Risk Modelling
Statistical Testing
Model Evaluation Tooling
Experimentation Frameworks

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the role of Senior AI Model Evaluation Director. Highlight your experience in AI/ML product development, model validation, and any relevant quantitative fields. We want to see how your background aligns with our needs!

Showcase Your Skills:In your application, don’t just list your skills—show us how you've used them! Provide examples of your hands-on experience with model evaluation tooling and statistical testing. This will help us understand your practical expertise.

Communicate Clearly:Since this role involves presenting complex findings, make sure your application reflects your communication skills. Use clear and concise language to explain your past experiences and how they relate to the job. We love clarity!

Apply Through Our Website:We encourage you to apply through our website for a smoother process. It’s the best way for us to receive your application and ensures you’re considered for the role. Don’t miss out on this opportunity!

How to prepare for a job interview at London Stock Exchange

Know Your Stuff

Make sure you brush up on your knowledge of AI/ML systems, especially LLMs and risk modelling techniques. Be ready to discuss your hands-on experience with model validation and statistical testing, as this will show that you’re not just familiar with the theory but have practical skills too.

Showcase Your Leadership Skills

Since this role involves strategic leadership, prepare examples of how you've successfully led teams in high-stakes environments. Think about times when you influenced decision-making or drove innovation in AI evaluation practices—these stories will highlight your capability to manage and inspire others.

Communicate Clearly

You’ll need to present complex findings to both technical and non-technical audiences, so practice explaining your work in simple terms. Use clear, concise language and avoid jargon where possible. This will demonstrate your ability to be a trusted advisor to stakeholders.

Be Ready for Scenario-Based Questions

Expect questions that assess your problem-solving skills in real-world scenarios. Prepare to discuss how you would approach model evaluation and governance challenges, including how you’d handle compliance with regulatory standards. This will show that you can think critically and strategically under pressure.