At a Glance
- Tasks: Lead reliability initiatives and enhance operational performance across critical production systems.
- Company: Join a leading tech organisation focused on innovation and excellence.
- Benefits: Competitive day-rate, flexible working arrangements, and opportunities for professional growth.
- Why this job: Make a real impact by driving reliability and scalability in cutting-edge tech environments.
- Qualifications: 7+ years in site reliability or systems engineering with strong programming skills.
- Other info: Dynamic role with mentorship opportunities and a focus on best practices.
The predicted salary is between 54000 - 84000 £ per year.
I am partnering with a leading tech organisation to recruit a Senior Site Reliability Engineer on a day-rate contract for 12 months. This is a hands-on, high-impact role working closely with engineering teams to drive reliability, scalability, and operational excellence across critical production systems.
What You’ll Do:
- Lead reliability initiatives and own operational performance across core services
- Define and refine SLIs, SLOs, and error budgets aligned with business outcomes
- Drive sophisticated incident management, post-incident analysis, and remediation planning
- Influence system architecture for high availability, resilience, and multi-region disaster recovery
- Build automation and CI/CD pipelines, applying safe deployment patterns like canary, blue/green, or progressive delivery
- Develop observability solutions (metrics, logs, traces) and troubleshoot performance bottlenecks
- Mentor engineers and embed SRE best practices across the organisation
- Operate cloud-native and containerised workloads at scale, leveraging IaC tools to manage resilient platforms
What You Bring:
- 7+ years in site reliability, production, or systems engineering roles
- Hands-on experience with cloud platforms (AWS, Azure, GCP) and Kubernetes
- Strong programming skills (Python, Go, Java) for automation and tooling
- Proven experience leading high-severity incidents and delivering systemic improvements
- Deep understanding of distributed systems, fault isolation, and scalability
Bonus Experience:
- Multi-cloud or multi-region resilience architecture
- Observability tools (Prometheus, Grafana, Datadog)
- IaC experience (Terraform, CloudFormation)
Site Reliability Engineer in London employer: KennedyPearce Consulting
Contact Detail:
KennedyPearce Consulting Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer in London
✨Tip Number 1
Network like a pro! Reach out to your connections in the tech world, especially those in site reliability. A friendly chat can lead to insider info about job openings that aren't even advertised yet.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions to SRE practices. This gives potential employers a taste of what you can bring to the table.
✨Tip Number 3
Prepare for interviews by brushing up on your incident management and cloud platform knowledge. Be ready to discuss real-life scenarios where you've driven reliability and scalability in production systems.
✨Tip Number 4
Don't forget to apply through our website! We have loads of opportunities waiting for talented Site Reliability Engineers like you. Plus, it’s a great way to get noticed by our hiring team.
We think you need these skills to ace Site Reliability Engineer in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud platforms, programming skills, and any relevant projects that showcase your ability to drive reliability and operational excellence.
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how your background aligns with the responsibilities outlined in the job description. Don’t forget to mention specific achievements that demonstrate your impact.
Showcase Your Technical Skills: Be sure to highlight your hands-on experience with tools like Kubernetes, Terraform, and observability solutions. Mention any specific incidents you've managed or improvements you've implemented to show you can handle high-severity situations.
Apply Through Our Website: We encourage you to apply through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any updates from us!
How to prepare for a job interview at KennedyPearce Consulting
✨Know Your Stuff
Make sure you brush up on your technical skills, especially around cloud platforms like AWS, Azure, or GCP. Be ready to discuss your hands-on experience with Kubernetes and how you've used programming languages like Python or Go for automation.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of high-severity incidents you've managed. Talk about the steps you took for incident management and how you drove systemic improvements. This will demonstrate your ability to handle pressure and lead effectively.
✨Understand the Role's Impact
Familiarise yourself with SLIs, SLOs, and error budgets. Be prepared to discuss how these metrics align with business outcomes and how you've influenced system architecture for reliability and resilience in your previous roles.
✨Be Ready to Mentor
Since mentoring is part of the role, think about how you've embedded SRE best practices in past positions. Share your approach to guiding engineers and fostering a culture of operational excellence within teams.