At a Glance
- Tasks: Lead the design of reliable cloud systems and mentor engineering teams.
- Company: Mission-driven tech organisation scaling a cloud-native platform.
- Benefits: Competitive salary, hybrid work, and opportunities for professional growth.
- Other info: Join a dynamic team focused on innovation and real-world impact.
- Why this job: Shape the future of reliability in a cutting-edge tech environment.
- Qualifications: Strong SRE or DevOps background with leadership experience.
The predicted salary is between 80000 - 100000 £ per year.
We are working with a mission-led technology organisation that is continuing to scale a fully cloud-native platform as part of a major initiative. As they move away from traditional data centres, they are investing heavily in building a highly reliable, scalable and observable cloud platform.
As a Lead SRE, you will act as a technical leader within the reliability function, setting direction and driving best practices across engineering teams. This is still a hands-on role, but with added ownership around shaping strategy, influencing architecture and mentoring engineers. You will be solving complex engineering problems across distributed systems, while helping define how reliability is embedded across the wider platform as it continues to scale.
Key Responsibilities- Leading the design and evolution of monitoring and observability systems
- Defining and driving SLOs, SLIs and error budgets across teams
- Owning incident management processes, post-mortems and continuous improvement
- Partnering with engineering teams to design resilient, fault-tolerant systems
- Driving automation across infrastructure, deployments and operational workflows
- Contributing to capacity planning, performance optimisation and cost efficiency
- Providing technical leadership through design reviews and architectural decisions
- Mentoring engineers and influencing reliability best practices across the organisation
- GCP and AWS
- Kubernetes and containerised workloads
- Terraform and Infrastructure as Code
- Prometheus, Grafana, Datadog and modern observability tooling
- CI/CD pipelines and automation tooling
- Python, Go or similar scripting languages
- Distributed systems at scale
- Strong background in SRE, DevOps or Platform Engineering at a senior or lead level
- Proven experience leading technical direction or mentoring engineers
- Experience running and supporting production systems at scale
- Strong understanding of observability, monitoring and reliability principles
- Hands-on experience with cloud infrastructure and Kubernetes
- Experience with Infrastructure as Code (Terraform or similar)
- Comfortable debugging complex systems across infrastructure and application layers
- Passionate about automation, reliability and improving engineering standards
This is a great opportunity to step into a technical leadership role, combining hands-on engineering with the chance to shape how reliability is delivered across a modern, cloud-native platform with real-world impact.
Lead SRE in City of London employer: Pulse Recruit
Contact Detail:
Pulse Recruit Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Lead SRE in City of London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry on LinkedIn or at meetups. We can’t stress enough how important it is to make connections; you never know who might have the inside scoop on job openings.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those related to SRE and cloud platforms. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on technical questions and scenarios relevant to SRE. We recommend practising with friends or using mock interview platforms to get comfortable discussing your experience and problem-solving approach.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!
We think you need these skills to ace Lead SRE in City of London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Lead SRE role. Highlight your background in SRE, DevOps, or Platform Engineering, and don’t forget to mention any hands-on experience with cloud infrastructure and Kubernetes.
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about reliability and automation. Share specific examples of how you've led technical direction or mentored engineers in the past.
Showcase Your Technical Skills: In your application, be sure to highlight your experience with tools like Terraform, Prometheus, and Grafana. Mention any projects where you’ve designed resilient systems or driven automation across infrastructure – we love seeing real-world impact!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity to shape the future of our cloud-native platform.
How to prepare for a job interview at Pulse Recruit
✨Know Your Tech Inside Out
Make sure you’re well-versed in the technologies mentioned in the job description, like GCP, AWS, Kubernetes, and Terraform. Brush up on your knowledge of observability tools like Prometheus and Grafana, as you might be asked to discuss how you’ve used them in past projects.
✨Showcase Your Leadership Skills
As a Lead SRE, you'll need to demonstrate your ability to lead and mentor. Prepare examples of how you've influenced technical direction or improved processes in previous roles. Think about specific instances where you’ve driven best practices or led a team through a challenging incident.
✨Prepare for Scenario-Based Questions
Expect questions that assess your problem-solving skills in real-world scenarios. Be ready to discuss how you would handle incidents, define SLOs, or improve system reliability. Practising these scenarios can help you articulate your thought process clearly during the interview.
✨Ask Insightful Questions
Interviews are a two-way street! Prepare thoughtful questions about the company’s approach to reliability, their current challenges, or how they measure success in their SRE teams. This shows your genuine interest in the role and helps you gauge if it’s the right fit for you.