Lead Site Reliability Engineer in London
Lead Site Reliability Engineer

Lead Site Reliability Engineer in London

London Full-Time 48000 - 72000 £ / year (est.) No home office possible
H

At a Glance

  • Tasks: Lead the design and improvement of resilient, high-performance systems for health and wellness.
  • Company: Holland & Barrett, a tech-driven organisation focused on health and wellness.
  • Benefits: Autonomy, collaborative culture, and opportunities for real impact in technology.
  • Why this job: Shape the future of technology while empowering millions with your expertise.
  • Qualifications: 5-8+ years in SRE or operational engineering, strong coding skills, and cloud experience.
  • Other info: Join a modern engineering culture that values innovation and continuous improvement.

The predicted salary is between 48000 - 72000 £ per year.

About the role

Own Reliability. Shape the Platform. Empower Millions. At Holland & Barrett, we are transforming into a truly product- and platform-led technology organisation - and we are looking for a Lead Site Reliability Engineer who is excited by scale, complexity, and impact. Our mission? Build and evolve the resilient, high-performance systems that power health and wellness for millions of customers. If you are obsessed with reliability, driven by automation, and thrive in high-ownership engineering cultures, this is your opportunity to lead from the front.

Responsibilities

  • Architect and improve cloud-native systems with reliability as a first-class principle.
  • Shape SLIs/SLOs, error budgets, capacity planning, and performance strategies.
  • Continuously evolve availability, efficiency, and resilience across our platforms.
  • Mentor SREs, platform engineers, and developers across the organisation.
  • Champion automation, observability, DevSecOps, and modern operational practices.
  • Influence engineering culture and architectural direction.
  • Own and lead high-severity incident response with calm, clarity, and technical depth.
  • Run world-class post-incident reviews and drive meaningful, measurable improvements.
  • Strengthen monitoring, alerting, on-call practices, and reliability processes.
  • Support resilience validation through load testing, stress testing, and chaos engineering.
  • Build tools and automation that remove toil and accelerate teams.
  • Develop CI/CD pipelines and Infrastructure-as-Code environments.
  • Drive consistency, repeatability, and self-service across engineering.
  • Partner with Security, Platform, and Engineering teams to align reliability with security and resilience goals.
  • Lead teams toward better design, operational readiness, and measurable service health.
  • Contribute to documentation, runbooks, and operational processes that scale.

Qualifications

  • 5-8+ years in SRE, Platform, Cloud Infrastructure, or operational engineering roles.
  • Hands-on experience architecting and improving large-scale, distributed systems.
  • Strong coding proficiency in Python, Go, Bash, or similar automation-focused languages.
  • Expertise with observability stacks: Datadog, Prometheus, Grafana, OpenTelemetry.
  • Deep AWS experience across EC2, EKS, Lambda, VPC, DynamoDB, S3, CloudFront, RDS, IAM, KMS, and more.
  • Proficiency with Terraform, CloudFormation, or AWS CDK.
  • Incident response leadership and root-cause analysis expertise.
  • Excellent documentation and communication skills.
  • Strong analytical and troubleshooting abilities.

Bonus

  • Experience mentoring or leading engineers within SRE or platform teams.
  • Experience with load testing, stress testing, and chaos engineering.
  • A passion for uplifting engineering culture through tooling, automation, and reliability-first thinking.

Why Build the Future with Holland & Barrett?

Technology is at the heart of our mission to make health & wellness accessible to everyone. As a Lead SRE, you won’t just keep systems running - you’ll design the reliability, resilience, and operational maturity that accelerates our entire business.

We offer:

  • A modern engineering culture built on autonomy, experimentation, and learning.
  • The chance to create real impact across critical customer and internal platforms.
  • A collaborative team that values innovation, continuous improvement, and technical excellence.

If you are ready to lead reliability for platforms with massive real-world impact, we would love to meet you. Apply now and help shape the future of H&B Technology.

Lead Site Reliability Engineer in London employer: Holland and Barrett

At Holland & Barrett, we pride ourselves on fostering a modern engineering culture that champions autonomy, innovation, and continuous learning. As a Lead Site Reliability Engineer, you will have the unique opportunity to make a significant impact on our health and wellness platforms while enjoying a collaborative work environment that prioritises technical excellence and employee growth. Join us in shaping the future of technology in a role where your expertise will empower millions of customers.
H

Contact Detail:

Holland and Barrett Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Lead Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Reach out to current employees at Holland & Barrett on LinkedIn. Ask them about their experiences and any tips they might have for the interview process. This insider info can give you a leg up!

✨Tip Number 2

Prepare for technical interviews by brushing up on your coding skills and system design principles. Practice common SRE scenarios and be ready to discuss your past projects in detail. We want to see your problem-solving skills in action!

✨Tip Number 3

Showcase your passion for reliability and automation during interviews. Share specific examples of how you've improved systems or processes in previous roles. This will demonstrate that you're not just a fit for the role, but also aligned with our mission.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Holland & Barrett.

We think you need these skills to ace Lead Site Reliability Engineer in London

Cloud-Native Systems Architecture
SLIs/SLOs Management
Capacity Planning
Performance Strategies
Incident Response Leadership
Automation
Observability
DevSecOps Practices
Load Testing
Stress Testing
Chaos Engineering
CI/CD Pipeline Development
Infrastructure-as-Code
AWS Services Proficiency
Coding Proficiency in Python, Go, Bash

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Lead Site Reliability Engineer role. Highlight your hands-on experience with cloud-native systems and any relevant coding proficiencies. We want to see how you can own reliability and shape platforms!

Craft a Compelling Cover Letter: Your cover letter is your chance to show us your passion for reliability and automation. Share specific examples of how you've influenced engineering culture or led incident responses in the past. Let your personality shine through while keeping it professional!

Showcase Your Technical Skills: Don’t forget to mention your expertise with tools like Datadog, Terraform, and AWS services. We’re looking for someone who can drive consistency and self-service across engineering, so make sure we see your technical depth in your application.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Plus, it shows us you’re keen on joining our team at Holland & Barrett!

How to prepare for a job interview at Holland and Barrett

✨Know Your Stuff

Make sure you brush up on your knowledge of cloud-native systems and reliability principles. Be ready to discuss your hands-on experience with AWS services and observability stacks like Datadog or Prometheus. The more specific examples you can provide, the better!

✨Showcase Your Leadership Skills

Since this role involves mentoring and leading teams, be prepared to share instances where you've successfully guided others through complex situations. Highlight your incident response leadership and how you've driven improvements after high-severity incidents.

✨Demonstrate Your Automation Passion

Talk about your experience with automation tools and practices. Whether it's CI/CD pipelines or Infrastructure-as-Code, show how you've used these to enhance efficiency and reduce toil in previous roles. This will resonate well with their focus on automation.

✨Cultural Fit Matters

Holland & Barrett values a collaborative and innovative culture. Be ready to discuss how you've contributed to engineering culture in past positions. Share your thoughts on how you can uplift their engineering practices through tooling and reliability-first thinking.

Lead Site Reliability Engineer in London
Holland and Barrett
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

H
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>