Site Reliability Engineering Lead in Cardiff

Site Reliability Engineering Lead in Cardiff

Cardiff Full-Time 70000 - 90000 € / year (est.) No home office possible
LexisNexis Risk Solutions

At a Glance

  • Tasks: Lead a team to enhance platform reliability and incident response for critical services.
  • Company: Join LexisNexis Risk Solutions, a leader in risk assessment solutions.
  • Benefits: Enjoy competitive pay, wellness support, and tailored benefits for your location.
  • Other info: Collaborative culture focused on learning and career development.
  • Why this job: Make a real impact by driving automation and improving service health.
  • Qualifications: Experience in leading engineers and strong knowledge of SRE practices required.

The predicted salary is between 70000 - 90000 € per year.

Are you excited to lead a team that improves platform reliability, resilience, and incident response for critical services? Do you enjoy building a supportive on-call culture while driving automation and secure-by-default operations?

LexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions focused on helping businesses of all sizes drive higher revenue growth, maximize operational efficiencies, and improve customer experience. Our solutions help our customers solve difficult problems in the areas of Anti-Money Laundering/Counter Terrorist Financing, Identity Authentication & Verification, Fraud and Credit Risk mitigation and Customer Data Management.

As a Site Reliability Engineering Lead, you’ll lead and partner with cross-functional teams to keep our platforms reliable, resilient, secure, and continuously improving — if you’re passionate about operational excellence and helping others succeed, we’d love to hear from you.

In this role, you will lead a team of Site Reliability Engineers focused on improving the reliability, resilience, and operational readiness of the platforms your team supports. You’ll partner closely with engineering, product, and security teams to reduce operational risk, strengthen incident response, and drive meaningful automation that improves service health and customer outcomes.

Responsibilities
  • Lead and develop a team of SREs — set direction, manage conflicting priorities and trade-offs, remove blockers, support wellbeing on-call, and keep work focused on the highest reliability risks and opportunities.
  • People management: hire and onboard talent, provide regular coaching and feedback, support career development, and contribute to performance and progression processes.
  • Own service reliability for the platforms your team supports: define and evolve operation metrics, uphold standards for observability, monitoring, alerting, and operational readiness.
  • Work closely with Security and Engineering to embed secure-by-default operations (e.g., patching, access controls, secrets management) and support audit and compliance needs.
  • Participate in the on-call rota (including escalation/incident leadership as needed) and continuously improve runbooks, alerts, and operational readiness.
  • Act as a senior escalation point during incidents, providing calm, structured coordination to restore service quickly and safely, and ensuring clear stakeholder communications.
  • Lead blameless post-incident reviews and Root Cause Analyses (RCAs), ensuring actions are prioritised, tracked, and shared across teams.
  • Partner with product and engineering teams to design for resilience, capacity, and recovery — systems that fail gracefully, recover quickly, and meet customer reliability expectations; drive automation and reduce toil by improving platform tooling, CI/CD, standards, and self-service capabilities.
Requirements
  • Experience leading, mentoring, or managing engineers.
  • Strong grasp of SRE/platform engineering practices, including Infrastructure-as-code, observability, incident management, on-call operations, and post-incident reviews.
  • Confidence working with cloud platforms, and a pragmatic approach to automation and reducing operational toil.
  • Clear, structured communication with both engineers and stakeholders, especially when handling operational risk or coordinating incidents.
  • A collaborative, learning-focused approach that builds psychological safety and values curiosity over blame.

Site Reliability Engineering Lead in Cardiff employer: LexisNexis Risk Solutions

At LexisNexis Risk Solutions, we pride ourselves on fostering a collaborative and innovative work culture that prioritises employee well-being and professional growth. As a Site Reliability Engineering Lead, you will not only lead a talented team dedicated to enhancing platform reliability but also benefit from our commitment to continuous learning and development opportunities. With a focus on operational excellence and a supportive environment, we empower our employees to thrive while making a meaningful impact in the risk assessment industry.

LexisNexis Risk Solutions

Contact Detail:

LexisNexis Risk Solutions Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineering Lead in Cardiff

Tip Number 1

Network like a pro! Reach out to your connections in the industry, attend meetups, and engage in online forums. The more people you know, the better your chances of landing that SRE Lead role.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions to open-source. This gives potential employers a taste of what you can bring to the table.

Tip Number 3

Prepare for interviews by practising common SRE scenarios and incident management questions. Mock interviews with friends or mentors can help you nail down your responses and boost your confidence.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our team!

We think you need these skills to ace Site Reliability Engineering Lead in Cardiff

Leadership
Mentoring
Site Reliability Engineering (SRE)
Infrastructure-as-Code
Observability
Incident Management
On-Call Operations

Some tips for your application 🫡

Show Your Passion for Reliability:When writing your application, let us see your enthusiasm for improving platform reliability and resilience. Share specific examples of how you've tackled similar challenges in the past, as this will resonate with our mission at StudySmarter.

Highlight Your Leadership Skills:As a Site Reliability Engineering Lead, we want to know about your experience in leading teams. Make sure to mention any mentoring or coaching you've done, and how you've supported your team's growth and wellbeing, especially during on-call situations.

Be Clear and Structured:Communication is key in this role! Use clear and structured language in your application to demonstrate your ability to convey complex ideas simply. This will show us that you can effectively communicate with both engineers and stakeholders.

Tailor Your Application:Make your application stand out by tailoring it to our specific needs. Reference the job description and align your skills and experiences with what we're looking for. And remember, applying through our website is the best way to get noticed!

How to prepare for a job interview at LexisNexis Risk Solutions

Know Your SRE Fundamentals

Brush up on your Site Reliability Engineering principles, especially around incident management and observability. Be ready to discuss how you've implemented these practices in past roles, as this will show your depth of knowledge and experience.

Showcase Your Leadership Skills

Prepare examples of how you've led teams, managed conflicting priorities, and supported team wellbeing. Highlight your approach to mentoring and developing talent, as this role requires strong people management skills.

Communicate Clearly and Calmly

During the interview, practice clear and structured communication. Be prepared to explain complex technical concepts in a way that non-technical stakeholders can understand, especially when discussing operational risks or incident responses.

Demonstrate a Collaborative Mindset

Emphasise your experience working with cross-functional teams, particularly in embedding secure-by-default operations. Share examples of how you've fostered a learning-focused environment that values curiosity and psychological safety.