Test & AI Evaluation Lead in Oxford

Test & AI Evaluation Lead in Oxford

Oxford Full-Time 50000 - 65000 £ / year (est.) No working from home possible
O

At a Glance

  • Tasks: Lead testing strategies for AI-driven systems in high-stakes environments.
  • Company: Join Oxford Dynamics, a fast-growing deep-tech company in AI and robotics.
  • Benefits: Competitive salary, flexible hours, hybrid work, and private healthcare.
  • Other info: Be part of an inclusive team making a real impact in technology.
  • Why this job: Shape the future of AI systems that protect lives and infrastructure.
  • Qualifications: Experience in testing complex software systems and automated testing.

The predicted salary is between 50000 - 65000 £ per year.

Salary: Competitive depending on experience

Location: 2-3 days on-site at our Harwell office with travel to client site when required

Contract type: Full-time permanent - 37.5 hours

A note from the Founders

Oxford Dynamics is at an inflection point. We operate in some of the most complex and high-stakes environments in the world - defence, national security, AI and robotics. The decisions we make now will define not just how fast we grow, but who we become. You will work closely with all the team. You will be trusted with judgment calls. You will influence the business. And you will see the impact of your work every day in the work we do. If you are excited by ownership, pace and purpose - and by building something that genuinely matters - we would love to hear from you.

Who We Are

Founded in 2020, Oxford Dynamics (OD) is a fast-growing UK deep-tech company developing AI and robotic systems designed to operate in mission-critical environments. Our flagship AVIS (A Very Intelligent System) AI framework fuses multi-modal data - text, imagery, telemetry and sensor feeds - enabling operators to interrogate complex information at speed and make better decisions under pressure. Our STRIDER robotic platform performs autonomous tasks in hazardous environments, protecting people while extending operational reach. Our ambition is simple but demanding: to converge AI and robotics so machines can sense, understand and act in complex, real-world environments. We work with defence and security organisations internationally to help protect nations, infrastructure and lives.

What you will be doing here/ why this role matters

Oxford Dynamics is a small team who rely on a collaborative and positive approach and so the right attitude for this role is equally as important as experience. We are at an important stage and time in our growth, and as a Senior AI Generative Robotics Engineer you will be an essential part of our success. You’ll work at the cutting edge of agentic and generative AI, building systems that move beyond lab demos and into real-world deployment at pace. At Oxford Dynamics, you’ll have the freedom to experiment in a fast-moving environment, the responsibility to deliver, and the opportunity to shape how multi-agent AI systems operate in complex, constrained, and high-trust environments. If you’re excited by agent orchestration, VLLMs, and deploying AI where it matters, this role is built for you!

Role Summary

We’re hiring a Test & AI Evaluation Lead to own how Oxford Dynamics validates its AI-driven, mission-critical systems - from multi-agent orchestration and LLM outputs through to cloud infrastructure and real-time user-facing applications. You’ll design and lead test approaches where correctness, resilience, and security matter as much as feature velocity. Working embedded with AI, Backend, Frontend, and DevOps, you’ll shape how we validate agent behaviours, data pipelines, and end-to-end operational workflows - from research prototypes through to production deployments for Defence and Security customers. Quality is built in from day one, not inspected at the end.

Key Responsibilities

  • Define and own the end-to-end test strategy across AI, backend, frontend, and infrastructure layers.
  • Establish testing standards appropriate for agentic AI systems, including non-deterministic behaviour and probabilistic outputs.
  • Ensure testing aligns with mission-critical, safety-conscious, and security-first delivery expectations.
  • Act as the primary quality authority across projects, advising engineering and product leadership on risk and readiness.

AI & Data-Focused Testing

  • Design approaches for testing multi-agent workflows, including orchestration logic, memory/state handling, and tool integrations.
  • Define validation strategies for LLM outputs, including groundedness, hallucination detection, task success rates, and regression testing.
  • Work with AI Engineers to embed evaluation metrics and pass/fail thresholds into pipelines.
  • Validate data ingestion, transformation, and inference pipelines across structured and unstructured data sources.

Automation & Tooling

  • Drive a test-automation-first mindset, integrating tests into CI/CD pipelines (GitHub Actions, Argo CD).
  • Oversee automated testing across API and service layers, UI (E2E and accessibility), and infrastructure and deployment workflows.
  • Select, implement, and evolve testing tools and frameworks appropriate to modern cloud-native and AI systems.

Non-Functional Testing

  • Own performance, scalability, reliability, and resilience testing for distributed systems.
  • Coordinate security testing activities in line with secure-by-design principles (e.g. IAM, secrets handling, data boundaries).
  • Validate backup, disaster recovery, and failover scenarios alongside DevOps and Backend teams.

Delivery & Collaboration

  • Embed with delivery teams to ensure testing is planned early and executed continuously.
  • Work closely with Product and Engineering to define clear acceptance criteria and definition of done.
  • Provide clear, decision-ready quality reporting to technical and non-technical stakeholders.
  • Support customer-facing demonstrations, trials, and operational readiness assessments.

Required Skills & Experience

  • Proven experience as a Test Manager, Senior Test Lead, or equivalent on complex software systems.
  • Strong track record of taking applications into production in regulated environments.
  • Strong background in automated testing across APIs, services, and UIs, integrated into CI/CD pipelines.
  • Experience testing distributed, cloud-native systems (AWS, GCP, or Kubernetes), including performance, reliability, and resilience.
  • Awareness of compliance frameworks (e.g. ISO 27001, NIST, OWASP).
  • ISTQB Advanced / Test Manager certification or equivalent practical experience.
  • SC Clearance or eligibility to obtain UK SC Clearance.

Preferred Experience

  • Experience in UK defence, public sector, or security environments.
  • Experience testing AI/ML/LLM-based systems, including non-deterministic outputs.
  • Exposure to agent-based or workflow-driven architectures.

Soft Skills

  • A pragmatic, delivery-focused mindset – able to balance speed with rigour.
  • Comfortable operating in fast-moving, ambiguous, R&D-heavy environments.
  • Confidence challenging assumptions and raising quality risks early.
  • Strong written and verbal communication, especially around complex technical risk.

Why Oxford Dynamics?

Join the most exciting growth area in the UK: AI and Robotics! Every member of the Oxford Dynamics team has a major impact on the products and services we provide. Regardless of job title you’ll get to make a real difference and learn from colleagues about all areas of our business.

Benefits

  • Salary: negotiable based on experience and attitudes
  • Rapid career progression with meaningful ownership of core systems
  • Opportunity to shape the future of a fast-growing, successful, early-stage business
  • Flexible working hours
  • Hybrid working model
  • Company pension (UK Government NEST scheme) with company contributions at 4%
  • Private Healthcare
  • 29 days holiday in addition to public holidays (Full Time Equivalent)

Oxford Dynamics is committed to creating an inclusive team experience for all. Regardless of race, gender, religion, sexual orientation, age, disability, or parental status, we believe our work is at its best when everyone feels free to be their authentic self.

Why This Role?

You’ll play a critical shaping role in how Oxford Dynamics delivers trustworthy, production-ready AI systems into some of the most demanding operational environments there are. If you enjoy working close to the technology, influencing how systems are built – not just tested – and tackling the realities of validating AI-driven software, this role gives you genuine ownership and impact.

Test & AI Evaluation Lead in Oxford employer: Oxford Dynamics Limited

At Oxford Dynamics, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters collaboration and innovation in the cutting-edge fields of AI and robotics. Our Harwell office provides a unique opportunity to work closely with a small, dedicated team where every member's contributions are valued, and rapid career progression is supported through meaningful ownership of core systems. With flexible working hours, a hybrid model, and a commitment to inclusivity, we empower our employees to thrive both personally and professionally while making a significant impact in mission-critical environments.

O

Contact Details:

Oxford Dynamics Limited Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Test & AI Evaluation Lead in Oxford

Tip Number 1

Network like a pro! Get out there and connect with people in the industry. Attend meetups, conferences, or even online webinars. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Show off your skills! Create a portfolio or a personal project that showcases your expertise in AI and testing. This is your chance to demonstrate what you can do beyond just a CV. Make it easy for potential employers to see your capabilities.

Tip Number 3

Prepare for interviews by practising common questions and scenarios related to AI evaluation and testing. Think about how you would approach real-world problems and be ready to discuss your thought process. Confidence is key!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love hearing from passionate candidates who are excited about making an impact in AI and robotics.

We think you need these skills to ace Test & AI Evaluation Lead in Oxford

Test Strategy Development
Automated Testing
CI/CD Integration
Multi-Agent Workflow Testing
LLM Output Validation
Cloud-Native Systems Testing
Performance Testing

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with AI and testing. We want to see how your skills align with our mission at Oxford Dynamics, so don’t hold back on showcasing relevant projects!

Show Your Passion:Let your enthusiasm for AI and robotics shine through in your application. We’re looking for candidates who are excited about the impact of their work, so share why you’re drawn to this field and what motivates you to join our team.

Be Clear and Concise:When writing your application, keep it straightforward and to the point. We appreciate clarity, so make sure your key achievements and experiences stand out without unnecessary fluff. Remember, less is often more!

Apply Through Our Website:We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy to do!

How to prepare for a job interview at Oxford Dynamics Limited

Know Your Stuff

Make sure you understand the ins and outs of AI evaluation and testing. Brush up on your knowledge of multi-agent orchestration, LLM outputs, and cloud infrastructure. Being able to discuss these topics confidently will show that you're not just interested in the role but also have the expertise to back it up.

Show Your Collaborative Spirit

Since this role involves working closely with various teams, be prepared to demonstrate your collaborative skills. Share examples from your past experiences where you successfully worked with engineers, product managers, or other stakeholders to achieve a common goal. This will highlight your ability to fit into their team culture.

Prepare for Technical Questions

Expect some technical questions related to automated testing, CI/CD pipelines, and non-functional testing. Brush up on your knowledge of compliance frameworks like ISO 27001 and NIST. Practising how to articulate your thought process when solving complex problems can really set you apart.

Ask Insightful Questions

At the end of the interview, don’t shy away from asking questions. Inquire about their current projects, challenges they face in testing AI systems, or how they envision the future of AI and robotics at Oxford Dynamics. This shows your genuine interest in the company and the role, and it gives you valuable insights into their work environment.