AI Evaluation Lead in Leeds

AI Evaluation Lead in Leeds

Leeds Full-Time 72100 - 100000 € / year (est.) Home office (partial)
Financial Conduct Authority

At a Glance

  • Tasks: Lead AI evaluation projects, ensuring quality and performance of GenAI outputs.
  • Company: Join the FCA, a key player in regulating UK financial services.
  • Benefits: Enjoy 28 days leave, hybrid work, private healthcare, and a flexible benefits scheme.
  • Other info: Diverse and inclusive culture with excellent career development opportunities.
  • Why this job: Make a real impact on consumer protection and economic growth in the UK.
  • Qualifications: Experience in AI/ML initiatives and strong stakeholder management skills required.

The predicted salary is between 72100 - 100000 € per year.

Division: Data, Technology and Innovation

Department: AI Product Delivery

Salary: National (Edinburgh and Leeds) ranging from £72,100 to £100,000 and London from £79,300 to £110,000 per annum (salary offered will be based on skills and experience)

This role is graded as: Technical Specialist – Regulatory

About The FCA And Team

We regulate financial services firms in the UK, to keep financial markets fair, thriving and effective. By joining us, you’ll play a key part in protecting consumers, driving economic growth, and shaping the future of UK finance services.

The Data, Technology and Innovation (DTI) division enables the FCA to be a digital‑first, data‑led smart regulator by delivering a secure, agile, and cost‑effective technology and data ecosystem that drives better decisions, transparency, and operational efficiency.

Working alongside the wider AI Programme (which will continue to oversee/coordinate AI activity across the FCA), the department will partner with business leads to shape and deliver work in priority areas — Authorisations, SPC, EMO and Anti‑Money Laundering.

Role Responsibilities

  • Define and own evaluation frameworks for GenAI outputs covering quality measures such as accuracy, relevance, robustness and hallucination rates.
  • Design, curate and govern test datasets and benchmarks to ensure consistent model and solution performance assessment over time.
  • Support development of automated evaluation pipelines and operational reporting to embed assurance into delivery.
  • Identify, assess and mitigate risks in model behaviour for example: bias, errors, safety concerns and edge cases with clear escalation and control recommendations.
  • Manage delivery through well‑defined work packages, setting priorities, operating standards and performance objectives, including line management of Business Analysts.
  • Engage senior stakeholders to understand strategic priorities, build a pipeline of scoped and prioritised projects and translate needs into analytics‑led solutions with clear business value.
  • Work in the public interest protecting 40 million UK consumers who rely on financial services and supporting long‑term economic growth from an industry contributing 12% of UK economic output.
  • Manage digital and data‑led change by encouraging innovative experiments and working with senior stakeholders while empowering a diverse team to collaborate openly.

Minimum Skills Required

  • Experience delivering analytics, data science and AI/ML initiatives, including defining success measures, evaluating model/product performance, and applying innovative problem‑solving approaches.
  • Demonstrated experience leading people (line management, coaching, and performance reviews), capable of overseeing a portfolio of projects and adjusting delivery as priorities shift.
  • Effective stakeholder management skills, including working with senior colleagues to translate priorities into well‑scoped, prioritised work with clear outcomes and measurable value.

Essential

  • Demonstrable experience designing and applying evaluation frameworks for GenAI ML solutions such as accuracy, relevance, robustness, consistency and hallucination or error rates, including defining clear acceptance thresholds.
  • Experience curating, documenting and governing test datasets and benchmarks, including version control to enable repeatable assessment and comparability over time.
  • Ability to identify, assess and mitigate model risks including bias, safety concerns, data leakage, harmful outputs and edge cases and to recommend appropriate controls and escalation routes.
  • Experience building or specifying automated evaluation and monitoring approaches such as scripted test runs, scoring and dashboards or management information to embed assurance into delivery and ongoing operations.
  • Demonstrated analytical skills with the ability to interpret evaluation results, communicate uncertainty and limitations, translate complex technical concepts for senior stakeholders and make evidence‑based recommendations to improve solution performance.
  • Consistent delivery discipline including planning and managing evaluation work packages, prioritising across competing demands and ensuring outputs meet agreed standards and timelines.
  • Clear written documentation skills producing evaluation reports, test plans and assurance artefacts that are audit ready and suitable for governance forums.

Benefits

  • 28 days annual leave plus bank holidays.
  • Hybrid model where employees work a minimum of 40% in the office each month (expectation of 50% for senior leaders); from September minimum of 50% in the office each month (expectation of 60% for Directors and Executive Directors).
  • Non‑contributory pension (8–12% depending on age) and life assurance at eight times your salary.
  • Private healthcare with Bupa, income protection, and 24/7 Employee Assistance.
  • 35 hours of paid volunteering annually.
  • A flexible benefits scheme designed around your lifestyle.

Our values & culture

Our colleagues are the key to our success as a regulator. We are committed to fostering a diverse and inclusive culture: one that’s free from discrimination and bias, celebrates difference, and supports colleagues to deliver at their best. We believe that our differences and similarities enable us to be a better organisation – one that makes better decisions, drives innovation, and delivers better regulation.

If you require any adjustments due to a disability or condition, your recruiter is here to help - reach out for tailored support. We welcome diverse working styles and aim to find flexible solutions that suit both the role and individual needs, including options like part‑time and job sharing where applicable.

Disability Confident: our hiring approach

We’re proud to be a Disability Confident Employer, and therefore, people or individuals with disabilities and long‑term conditions who best meet the minimum criteria for a role will go through to the next stage of the recruitment process. In cases of high application volumes, we may progress applicants whose experience most closely matches the role’s key requirements.

Useful information and timeline

  • Advert Closing: 19th May
  • CV Review/Shortlist: 22nd May
  • First Stage Interviews W/C: 1st June
  • Second Stage Interviews W/C: 8th June

Your Recruiter will discuss the process in detail with you during screening for the role, therefore, please make them aware if you are going to be unavailable for any date during this time.

AI Evaluation Lead in Leeds employer: Financial Conduct Authority

The FCA is an exceptional employer, offering a dynamic work environment in the heart of Edinburgh or Leeds, where you can contribute to shaping the future of UK financial services. With a strong commitment to employee growth, a diverse and inclusive culture, and generous benefits including a non-contributory pension and private healthcare, the FCA empowers its employees to thrive both personally and professionally while making a meaningful impact on the lives of 40 million consumers.

Financial Conduct Authority

Contact Detail:

Financial Conduct Authority Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Evaluation Lead in Leeds

Tip Number 1

Network like a pro! Reach out to people in the industry, especially those already working at the FCA. A friendly chat can give you insights and maybe even a referral. Don’t be shy; we all love a good conversation!

Tip Number 2

Prepare for your interviews by practising common questions related to AI evaluation and data science. We recommend doing mock interviews with friends or using online platforms. The more you practise, the more confident you'll feel when it’s showtime!

Tip Number 3

Showcase your skills! Create a portfolio that highlights your experience with AI/ML initiatives and evaluation frameworks. This is your chance to shine and demonstrate how you can add value to the FCA.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, keep an eye on the job postings regularly; new opportunities pop up all the time!

We think you need these skills to ace AI Evaluation Lead in Leeds

Evaluation Frameworks
GenAI Outputs Assessment
Test Dataset Governance
Automated Evaluation Pipelines
Risk Identification and Mitigation
Stakeholder Management
Analytics and Data Science

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Evaluation Lead role. Highlight your experience with analytics, data science, and AI/ML initiatives. We want to see how your skills align with the responsibilities outlined in the job description.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about the role and how your background makes you a perfect fit. Don’t forget to mention your experience with evaluation frameworks and stakeholder management.

Showcase Your Achievements:When detailing your experience, focus on specific achievements that demonstrate your ability to deliver results. Use metrics where possible to quantify your impact, especially in areas like model performance and risk mitigation.

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen to join our team!

How to prepare for a job interview at Financial Conduct Authority

Know Your AI Inside Out

Make sure you’re well-versed in the latest trends and technologies in AI, especially around evaluation frameworks for GenAI outputs. Brush up on concepts like accuracy, relevance, and robustness, as these will likely come up during your interview.

Prepare Real-World Examples

Think of specific projects where you've designed or applied evaluation frameworks for AI solutions. Be ready to discuss how you defined success measures and tackled challenges like bias or safety concerns. Concrete examples will help demonstrate your expertise.

Engage with Stakeholders

Since stakeholder management is key for this role, prepare to discuss how you’ve successfully engaged with senior colleagues in the past. Think about how you translated their priorities into actionable work packages and the outcomes that resulted from those collaborations.

Showcase Your Analytical Skills

Be prepared to talk about how you interpret evaluation results and communicate findings. Practice explaining complex technical concepts in simple terms, as this will be crucial when discussing your recommendations with non-technical stakeholders.