At a Glance
- Tasks: Lead the design and improvement of resilient, high-performance systems for health and wellness.
- Company: Join Holland & Barrett, a tech-driven company transforming health and wellness access.
- Benefits: Enjoy health benefits, discounts, learning opportunities, and a supportive work culture.
- Other info: Inclusive environment that values diverse perspectives and promotes career growth.
- Why this job: Make a real impact by shaping reliability and operational excellence in technology.
- Qualifications: 5-8+ years in SRE or cloud infrastructure with strong coding skills.
The predicted salary is between 80000 - 100000 £ per year.
About H & B Own Reliability. Shape the Platform. Empower Millions. At Holland & Barrett, we're transforming into a truly product- and platform-led technology organisation — and we're looking for a Lead Site Reliability Engineer who's excited by scale, complexity, and impact. Our mission? Build and evolve the resilient, high-performance systems that power health and wellness for millions of customers. If you're obsessed with reliability, driven by automation, and thrive in high-ownership engineering cultures, this is your opportunity to lead from the front.
About the Role
- Reliability & Performance at Scale
- Architect and improve cloud-native systems with reliability as a first-class principle.
- Shape SLIs/SLOs, error budgets, capacity planning, and performance strategies.
- Continuously evolve availability, efficiency, and resilience across our platforms.
- Technical Leadership That Raises the Bar
- Mentor SREs, platform engineers, and developers across the organisation.
- Champion automation, observability, DevSecOps, and modern operational practices.
- Influence engineering culture and architectural direction.
- Operational Excellence
- Own and lead high-severity incident response with calm, clarity, and technical depth.
- Run world-class post-incident reviews and drive meaningful, measurable improvements.
- Strengthen monitoring, alerting, on-call practices, and reliability processes.
- Support resilience validation through load testing, stress testing, and chaos engineering.
- Automation, Tooling & Engineering Efficiency
- Build tools and automation that remove toil and accelerate teams.
- Develop CI/CD pipelines and Infrastructure-as-Code environments.
- Drive consistency, repeatability, and self-service across engineering.
- Cross-Team Collaboration
- Partner with Security, Platform, and Engineering teams to align reliability with security and resilience goals.
- Lead teams toward better design, operational readiness, and measurable service health.
- Contribute to documentation, runbooks, and operational processes that scale.
Key Requirements
- 5–8+ years in SRE, Platform, Cloud Infrastructure, or operational engineering roles.
- Hands-on experience architecting and improving large-scale, distributed systems.
- Strong coding proficiency in Python, Go, Bash, or similar automation-focused languages.
- Expertise with observability stacks: Datadog, Prometheus, Grafana, OpenTelemetry.
- Deep AWS experience across EC2, EKS, Lambda, VPC, DynamoDB, S3, CloudFront, RDS, IAM, KMS, and more.
- Proficiency with Terraform, CloudFormation, or AWS CDK.
- Incident response leadership and root-cause analysis expertise.
- Excellent documentation and communication skills.
- Strong analytical and troubleshooting abilities.
Bonus
- Experience mentoring or leading engineers within SRE or platform teams.
- Experience with load testing, stress testing, and chaos engineering.
- A passion for uplifting engineering culture through tooling, automation, and reliability-first thinking.
Why Build the Future with Holland & Barrett?
Technology is at the heart of our mission to make health & wellness accessible to everyone. As a Lead SRE, you won't just keep systems running — you'll design the reliability, resilience, and operational maturity that accelerates our entire business.
What we offer:
- Wellbeing & Lifestyle Benefits
- Health Cash Plan
- Life Assurance
- Bonus Scheme - Based on company & personal performance
- Virtual GP
- Private Medical care
- FREE at-home blood test kit
- Holiday Purchase option
- Pension Contribution scheme
- Access to ‘Wellhub' with gyms, studios and wellbeing apps
- Discounts & Savings
- 25% Colleague Discount with FREE Standard Delivery
- Exclusive Discounts from a wide range of partners
- £/€50 Annual Product Allowance to spend in store
- Learning & Development
- Access to a variety of learning opportunities, including Level 2-5 Apprenticeships, Workshops and our Digital Learning Library AND MORE!
Holland and Barrett is an equal opportunity employer. We welcome diverse perspectives and are committed to creating an inclusive environment for all colleagues. We understand that when our colleagues are listened to, respected and valued for who they are, we build an organisation with belonging at its heart – making health and wellness a way of life for everyone.
Lead Site Reliability Engineer employer: 慨正橡扯
At Holland & Barrett, we pride ourselves on being a forward-thinking employer that champions innovation and employee growth. As a Lead Site Reliability Engineer, you'll be at the forefront of transforming our technology landscape, with access to extensive learning opportunities, a supportive work culture, and a comprehensive benefits package that prioritises your wellbeing. Join us in making health and wellness accessible to millions while enjoying a collaborative environment that values your contributions and fosters professional development.
StudySmarter Expert Advice🤫
We think this is how you could land Lead Site Reliability Engineer
✨Tip Number 1
Network like a pro! Reach out to current or former employees at Holland & Barrett on LinkedIn. A friendly chat can give you insider info and might just get your application noticed.
✨Tip Number 2
Show off your skills in action! If you have a portfolio or GitHub with projects related to SRE, share it during interviews. It’s a great way to demonstrate your expertise and passion for reliability.
✨Tip Number 3
Prepare for those tricky technical questions! Brush up on your coding skills and be ready to discuss your experience with cloud-native systems and incident response. Practice makes perfect!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the team.
We think you need these skills to ace Lead Site Reliability Engineer
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience in SRE and cloud infrastructure. We want to see how your skills align with our mission to build resilient systems for health and wellness.
Showcase Your Technical Skills:Don’t hold back on showcasing your coding proficiency and experience with observability stacks. Mention specific projects where you’ve used Python, Go, or tools like Datadog and Prometheus to demonstrate your hands-on expertise.
Highlight Leadership Experience:If you've mentored engineers or led incident response teams, make sure to include that! We’re looking for someone who can influence engineering culture and lead from the front, so share those experiences.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates during the process!
How to prepare for a job interview at 慨正橡扯
✨Know Your Tech Inside Out
Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS services and observability stacks like Datadog and Prometheus. Brush up on your coding skills in Python or Go, as you might be asked to solve technical problems on the spot.
✨Showcase Your Leadership Skills
Prepare examples of how you've mentored others or led incident responses in previous roles. Holland & Barrett is looking for someone who can influence engineering culture, so be ready to discuss how you've championed automation and operational excellence in your past experiences.
✨Demonstrate Problem-Solving Abilities
Be prepared to tackle hypothetical scenarios related to high-severity incidents or system failures. Think through your approach to root-cause analysis and how you would lead a post-incident review, highlighting your analytical and troubleshooting skills.
✨Align with Their Mission
Understand Holland & Barrett's mission to make health and wellness accessible. Be ready to discuss how your experience aligns with their goals and how you can contribute to building resilient systems that have a real-world impact on customers' lives.