AI Evaluation Lead in London

AI Evaluation Lead in London

London Full-Time 72100 - 100000 € / year (est.) Home office (partial)
Financial Conduct Authority

At a Glance

  • Tasks: Lead AI evaluation projects, ensuring quality and performance of GenAI outputs.
  • Company: Join the FCA, a key player in regulating UK financial services.
  • Benefits: Enjoy 28 days leave, hybrid work, private healthcare, and a non-contributory pension.
  • Other info: Diverse and inclusive culture with excellent career growth opportunities.
  • Why this job: Make a real impact on UK finance while working with cutting-edge AI technology.
  • Qualifications: Experience in AI/ML initiatives and strong stakeholder management skills required.

The predicted salary is between 72100 - 100000 € per year.

Division: Data, Technology and Innovation

Department: AI Product Delivery

Salary: National (Edinburgh and Leeds) ranging from £72,100 to £100,000 and London from £79,300 to £110,000 per annum (salary offered will be based on skills and experience)

This role is graded as: Technical Specialist – Regulatory

About the FCA and team

We regulate financial services firms in the UK, to keep financial markets fair, thriving and effective. By joining us, you’ll play a key part in protecting consumers, driving economic growth, and shaping the future of UK finance services. The Data, Technology and Innovation (DTI) division enables the FCA to be a digital-first, data-led smart regulator by delivering a secure, agile, and cost-effective technology and data ecosystem that drives better decisions, transparency, and operational efficiency.

Working alongside the wider AI Programme (which will continue to oversee/coordinate AI activity across the FCA), the department will partner with business leads to shape and deliver work in priority areas — Authorisations, SPC, EMO and Anti‑Money Laundering.

Role responsibilities

  • Define and own evaluation frameworks for GenAI outputs covering quality measures such as accuracy, relevance, robustness and hallucination rates.
  • Design, curate and govern test datasets and benchmarks to ensure consistent model and solution performance assessment over time.
  • Support development of automated evaluation pipelines and operational reporting to embed assurance into delivery.
  • Identify, assess and mitigate risks in model behaviour, for example bias, errors, safety concerns and edge cases with clear escalation and control recommendations.
  • Manage delivery through well-defined work packages, setting priorities, operating standards and performance objectives including line management of Business Analysts.
  • Engage senior stakeholders to understand strategic priorities, build a pipeline of scoped and prioritised projects and translate needs into analytics led solutions with clear business value.
  • Work in the public interest protecting 40 million UK consumers who rely on financial services and supporting long term economic growth from an industry contributing 12% of UK economic output.
  • Manage digital and data-led change by encouraging innovative experiments and working with senior stakeholders while empowering a diverse team to collaborate openly.

Skills required

Minimum:

  • Experience delivering analytics, data science and AI/ML initiatives, including defining success measures, evaluating model/product performance, and applying innovative problem-solving approaches.
  • Demonstrated experience leading people (line management, coaching, and performance reviews), capable of overseeing a portfolio of projects and adjusting delivery as priorities shift.
  • Effective stakeholder management skills, including working with senior colleagues to translate priorities into well-scoped, prioritised work with clear outcomes and measurable value.

Essential:

  • Demonstrable experience designing and applying evaluation frameworks for GenAI ML solutions such as accuracy, relevance, robustness, consistency and hallucination or error rates including defining clear acceptance thresholds.
  • Experience curating, documenting and governing test datasets and benchmarks including version control to enable repeatable assessment and comparability over time.
  • Ability to identify, assess and mitigate model risks including bias, safety concerns, data leakage, harmful outputs and edge cases and to recommend appropriate controls and escalation routes.
  • Experience building or specifying automated evaluation and monitoring approaches such as scripted test runs, scoring and dashboards or management information to embed assurance into delivery and ongoing operations.
  • Demonstrated analytical skills with the ability to interpret evaluation results, communicate uncertainty and limitations, translate complex technical concepts for senior stakeholders and make evidence-based recommendations to improve solution performance.
  • Consistent delivery discipline including planning and managing evaluation work packages, prioritising across competing demands and ensuring outputs meet agreed standards and timelines.
  • Clear written documentation skills producing evaluation reports, test plans and assurance artefacts that are audit ready and suitable for governance forums.

Benefits

  • 28 days annual leave plus bank holidays.
  • Hybrid model where employees work a minimum of 40% in the office each month (expectation of 50% for senior leaders). Changing from September to a minimum of 50% in the office each month (expectation of 60% for Directors and Executive Directors).
  • Non-contributory pension (8–12% depending on age) and life assurance at eight times your salary.
  • Private healthcare with Bupa, income protection, and 24/7 Employee Assistance.
  • 35 hours of paid volunteering annually.
  • A flexible benefits scheme designed around your lifestyle.

Our values & culture

Our colleagues are the key to our success as a regulator. We are committed to fostering a diverse and inclusive culture: one that’s free from discrimination and bias, celebrates difference, and supports colleagues to deliver at their best. We believe that our differences and similarities enable us to be a better organisation – one that makes better decisions, drives innovation, and delivers better regulation.

If you require any adjustments due to a disability or condition, your recruiter is here to help - reach out for tailored support. We welcome diverse working styles and aim to find flexible solutions that suit both the role and individual needs, including options like part-time and job sharing where applicable.

Disability Confident: our hiring approach

We’re proud to be a Disability Confident Employer, and therefore, people or individuals with disabilities and long-term conditions who best meet the minimum criteria for a role will go through to the next stage of the recruitment process. In cases of high application volumes, we may progress applicants whose experience most closely matches the role’s key requirements.

Useful information and timeline

  • Advert Closing: 19th May
  • CV Review/Shortlist: 22nd May
  • First Stage Interviews W/C: 1st June
  • Second Stage Interviews W/C: 8th June

Your Recruiter will discuss the process in detail with you during screening for the role, therefore, please make them aware if you are going to be unavailable for any date during this time.

AI Evaluation Lead in London employer: Financial Conduct Authority

The FCA is an exceptional employer, offering a dynamic work environment in the heart of Edinburgh and Leeds, where you can make a meaningful impact on the financial services sector. With a strong commitment to employee growth, a diverse and inclusive culture, and generous benefits including a non-contributory pension and 28 days of annual leave, the FCA empowers its employees to thrive both personally and professionally while contributing to the protection of 40 million UK consumers.

Financial Conduct Authority

Contact Detail:

Financial Conduct Authority Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Lead in London

Tip Number 1

Network like a pro! Reach out to people in the industry, especially those already working at the FCA. A friendly chat can give you insights and maybe even a referral. Don’t be shy; we all love a good conversation!

Tip Number 2

Prepare for the interview by diving deep into the role. Understand the evaluation frameworks and AI initiatives mentioned in the job description. We want you to show off your knowledge and passion for the field!

Tip Number 3

Practice makes perfect! Get a friend or mentor to do mock interviews with you. This will help you articulate your thoughts clearly and boost your confidence. Remember, we’re here to support you!

Tip Number 4

Don’t forget to apply through our website! It’s the only way to ensure your application gets seen. Plus, it shows you’re serious about joining the FCA team. Let’s get you that dream job!

We think you need these skills to ace AI Evaluation Lead in London

Analytics
Data Science
AI/ML Initiatives
Evaluation Frameworks
Model Performance Evaluation
Stakeholder Management
Risk Assessment

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Lead role. Highlight your experience with analytics, data science, and AI/ML initiatives. We want to see how your skills align with the job description!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about the role and how you can contribute to the FCA's mission. Keep it concise but impactful – we love a good story!

Showcase Your Achievements:When detailing your experience, focus on specific achievements that demonstrate your ability to lead projects and manage stakeholders. Numbers and outcomes speak volumes, so don’t hold back!

Follow Application Guidelines:Remember, applications must be submitted through our online portal. Avoid sending your application via email or social media, as those won’t be accepted. Stick to the guidelines, and you’ll be golden!

How to prepare for a job interview at Financial Conduct Authority

Know Your AI Stuff

Make sure you brush up on your knowledge of AI evaluation frameworks, especially around GenAI outputs. Be ready to discuss accuracy, relevance, and robustness in detail, as these are key areas for the role.

Showcase Your Leadership Skills

Since this role involves managing a team, be prepared to share examples of how you've successfully led projects or teams in the past. Highlight your experience with line management and coaching, as well as how you handle shifting priorities.

Engage with Stakeholders

Demonstrate your effective stakeholder management skills by discussing how you've previously translated strategic priorities into actionable work. Bring examples of how you’ve built relationships with senior colleagues to drive projects forward.

Prepare for Technical Questions

Expect technical questions about evaluating model risks and automated evaluation approaches. Brush up on your ability to communicate complex concepts clearly, as you'll need to explain these to non-technical stakeholders.