At a Glance
- Tasks: Design and automate monitoring systems while ensuring reliability for millions of users.
- Company: Mission-driven tech organisation focused on building a scalable cloud platform.
- Benefits: Hybrid work model, competitive salary, and opportunities for professional growth.
- Why this job: Join a team tackling complex engineering challenges with real-world impact.
- Qualifications: Experience in SRE, DevOps, or Platform Engineering with cloud infrastructure knowledge.
- Other info: Dynamic environment with a focus on automation and engineering efficiency.
The predicted salary is between 70000 - 90000 £ per year.
Location: London (Hybrid – 1 day per week in office)
We are working with a mission-led technology organisation that is continuing to scale a fully cloud-native platform as part of a major initiative. As they move away from traditional data centres, they are investing heavily in building a highly reliable, scalable and observable cloud platform. As a Senior SRE, you will play a key role in ensuring the reliability and performance of systems that support millions of customers. This is a hands-on engineering role where you will work closely with platform, cloud and product teams to embed reliability into everything they build. You will be solving complex engineering problems across distributed systems, helping improve observability, automation and incident response as the platform continues to scale.
Key Responsibilities
- Designing, improving and automating monitoring and observability systems
- Defining and managing SLOs, SLIs and error budgets
- Supporting incident response, root cause analysis and post-mortems
- Working with engineering teams to design resilient, fault-tolerant systems
- Driving automation across infrastructure, deployments and operations
- Contributing to capacity planning, performance tuning and cost optimisation
- Participating in design reviews to improve reliability and scalability
Tech Environment
- GCP and AWS
- Kubernetes and containerised workloads
- Terraform and Infrastructure as Code
- Prometheus, Grafana, Datadog and modern observability tooling
- CI/CD pipelines and automation tooling
- Python, Go or similar scripting languages
- Distributed systems at scale
About You
- Strong background in SRE, DevOps or Platform Engineering
- Experience running and supporting production systems at scale
- Strong understanding of observability, monitoring and reliability principles
- Hands-on experience with cloud infrastructure and Kubernetes
- Experience with Infrastructure as Code (Terraform or similar)
- Comfortable debugging complex systems across infrastructure and application layers
- Passionate about automation and improving engineering efficiency
This is a great opportunity to join a team building a platform with real-world impact, combining complex engineering challenges with a mission to contribute to a future.
Senior SRE employer: Pulse Recruit
Contact Detail:
Pulse Recruit Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior SRE
✨Tip Number 1
Network like a pro! Reach out to current employees on LinkedIn or attend industry meetups. A friendly chat can give you insider info and maybe even a referral!
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those related to cloud infrastructure and automation. This gives you a chance to demonstrate your hands-on experience.
✨Tip Number 3
Prepare for the technical interview by brushing up on your SRE principles and tools like Terraform and Kubernetes. Practice solving real-world problems to show you can think on your feet!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!
We think you need these skills to ace Senior SRE
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Senior SRE role. Highlight your background in SRE, DevOps, or Platform Engineering, and don’t forget to mention your hands-on experience with cloud infrastructure and Kubernetes.
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about building reliable systems and how your experience can contribute to our mission. Be sure to mention specific projects where you've improved observability or automated processes.
Showcase Your Technical Skills: In your application, be sure to highlight your technical expertise, especially with tools like Terraform, Prometheus, and Grafana. Mention any experience you have with CI/CD pipelines and automation tooling, as these are key for the role.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity to join our team!
How to prepare for a job interview at Pulse Recruit
✨Know Your Tech Inside Out
Make sure you’re well-versed in the tech stack mentioned in the job description. Brush up on your knowledge of GCP, AWS, Kubernetes, and Terraform. Be ready to discuss how you've used these tools in past projects, especially in relation to observability and automation.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of complex engineering problems you've solved. Think about incidents you've managed, how you approached root cause analysis, and what improvements you implemented post-mortem. This will demonstrate your hands-on experience and ability to drive reliability.
✨Understand SLOs and SLIs
Familiarise yourself with Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Be prepared to discuss how you’ve defined and managed these in previous roles, and how they contribute to system reliability. This shows you understand the importance of performance metrics in an SRE role.
✨Emphasise Your Passion for Automation
Talk about your enthusiasm for automation and improving engineering efficiency. Share examples of how you've driven automation in infrastructure, deployments, or operations. This aligns perfectly with the role's focus on building a scalable cloud platform and will resonate with the interviewers.