Site Reliability Engineering Lead in Cardiff

Site Reliability Engineering Lead in Cardiff

Cardiff Full-Time 80000 - 100000 € / year (est.) No home office possible
RELX Group

At a Glance

  • Tasks: Lead a team to enhance platform reliability and drive automation for better service health.
  • Company: Join a forward-thinking tech company focused on operational excellence and teamwork.
  • Benefits: Competitive salary, career development opportunities, and a supportive work environment.
  • Other info: Dynamic role with opportunities for growth and collaboration across teams.
  • Why this job: Make a real impact by improving platform resilience and helping others succeed.
  • Qualifications: Experience in leading teams and strong knowledge of SRE practices.

The predicted salary is between 80000 - 100000 € per year.

As a Site Reliability Engineering Lead, you'll lead and partner with cross-functional teams to keep our platforms reliable, resilient, secure, and continuously improving. If you're passionate about operational excellence and helping others succeed, we'd love to hear from you.

In this role, you will lead a team of Site Reliability Engineers focused on improving the reliability, resilience, and operational readiness of the platforms your team supports. You'll partner closely with engineering, product, and security teams to reduce operational risk, strengthen incident response, and drive meaningful automation that improves service health and customer outcomes.

  • Lead and develop a team of SREs - set direction, manage conflicting priorities and trade-offs, remove blockers, support wellbeing on-call, and keep work focused on the highest reliability risks and opportunities.
  • People management: hire and onboard talent, provide regular coaching and feedback, support career development, and contribute to performance and progression processes.
  • Own service reliability for the platforms your team supports: define and evolve operation metrics, uphold standards for observability, monitoring, alerting, and operational readiness.
  • Work closely with Security and Engineering to embed secure-by-default operations (e.g., patching, access controls, secrets management) and support audit and compliance needs.
  • Participate in the on-call rota (including escalation/incident leadership as needed) and continuously improve runbooks, alerts, and operational readiness.
  • Act as a senior escalation point during incidents, providing calm, structured coordination to restore service quickly and safely, and ensuring clear stakeholder communications.
  • Lead blameless post-incident reviews and Root Cause Analyses (RCAs), ensuring actions are prioritised, tracked, and shared across teams.
  • Partner with product and engineering teams to design for resilience, capacity, and recovery - systems that fail gracefully, recover quickly, and meet customer reliability expectations; drive automation and reduce toil by improving platform tooling, CI/CD, standards, and self-service capabilities.

Experience leading, mentoring, or managing engineers. Strong grasp of SRE/platform engineering practices, including Infrastructure-as-code, observability, incident management, on-call operations, and post-incident reviews. Confidence working with cloud platforms, and a pragmatic approach to automation and reducing operational toil. Clear, structured communication with both engineers and stakeholders, especially when handling operational risk or coordinating incidents. A collaborative, learning-focused approach that builds psychological safety and values curiosity over blame.

Site Reliability Engineering Lead in Cardiff employer: RELX Group

As a Site Reliability Engineering Lead, you will thrive in a dynamic and supportive work culture that prioritises operational excellence and employee growth. Our commitment to continuous improvement and collaboration ensures that you will have ample opportunities to develop your skills while leading a talented team dedicated to enhancing platform reliability. Located in a vibrant area, we offer a unique blend of professional challenges and a fulfilling work environment, making us an exceptional employer for those seeking meaningful and rewarding careers.

RELX Group

Contact Detail:

RELX Group Recruiting Team

StudySmarter Expert Advice🀫

We think this is how you could land Site Reliability Engineering Lead in Cardiff

✨Tip Number 1

Network like a pro! Reach out to your connections in the industry, attend meetups, and engage in online forums. The more people you know, the better your chances of landing that SRE Lead role.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to reliability and automation. This gives potential employers a taste of what you can bring to the table.

✨Tip Number 3

Prepare for interviews by practising common SRE scenarios. Think about how you'd handle incidents, improve service health, and collaborate with cross-functional teams. Confidence is key!

✨Tip Number 4

Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining our team. Plus, it makes tracking your application super easy for us.

We think you need these skills to ace Site Reliability Engineering Lead in Cardiff

Leadership
People Management
Operational Excellence
Incident Management
Service Reliability
Observability
Monitoring

Some tips for your application 🫑

Show Your Passion for Reliability:When writing your application, let us see your enthusiasm for operational excellence. Share specific examples of how you've improved reliability in past roles or projects. This will help us understand your commitment to keeping platforms resilient and secure.

Highlight Team Leadership Experience:As a Site Reliability Engineering Lead, you'll be managing a team. Make sure to showcase your experience in leading and mentoring engineers. Talk about how you've supported career development and managed conflicting priorities to keep the team focused on high-impact tasks.

Be Clear and Structured:We value clear communication, especially when it comes to operational risk. In your application, use a structured format to present your experiences and skills. This will demonstrate your ability to communicate effectively with both technical teams and stakeholders.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our culture and values!

How to prepare for a job interview at RELX Group

✨Know Your SRE Fundamentals

Brush up on your knowledge of Site Reliability Engineering practices. Be ready to discuss Infrastructure-as-Code, observability, and incident management. This will show that you understand the core principles and can lead a team effectively.

✨Showcase Your Leadership Skills

Prepare examples of how you've led teams in the past, especially in high-pressure situations. Talk about how you’ve managed conflicting priorities and supported team wellbeing during on-call duties. This will demonstrate your capability to lead and mentor others.

✨Communicate Clearly and Calmly

During the interview, practice clear and structured communication. You might be asked about handling incidents or operational risks, so articulate your thought process and how you would coordinate with stakeholders. This reflects your ability to manage crises effectively.

✨Emphasise Collaboration and Continuous Improvement

Highlight your experience working with cross-functional teams, particularly in driving automation and improving service health. Discuss how you foster a collaborative environment that values learning and psychological safety, which is crucial for an SRE Lead role.