At a Glance
- Tasks: Lead Site Reliability Engineering and drive operational excellence across large-scale platforms.
- Company: Dynamic tech company offering a hybrid working model in Reading.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Join a forward-thinking team and elevate your career in a supportive environment.
- Why this job: Shape the future of SRE while collaborating with diverse teams and technologies.
- Qualifications: Expertise in Kubernetes, SRE principles, and experience in multi-cloud environments.
The predicted salary is between 70000 - 90000 £ per year.
Salary: £70,000 - 90,000 per year
Requirements
- Deep expertise in Kubernetes and/or OpenShift
- Experience working in multi-cloud or hybrid cloud environments
- Strong understanding of SRE principles, including SLOs, SLAs, error budgets, and reliability engineering
- Hands-on experience with observability tooling such as Prometheus, Grafana, OpenTelemetry, Loki, and Tempo
- Strong knowledge of Infrastructure as Code and GitOps tools such as Helm, Kustomize, ArgoCD, and Tekton
- Experience with CI/CD pipelines and automation
- Proven ability to operate as a technical leader in complex, multi-team environments
- Must be eligible for Security Clearance
Responsibilities
- Act as the technical authority for Site Reliability Engineering across complex, large-scale platforms
- Drive reliability, availability, and operational excellence across multi-team and multi-vendor environments
- Define and implement SRE strategy, standards, and best practices, including SLAs, SLOs, and error budgets
- Embed reliability principles into platform and service design from the outset
- Lead reliability reviews, operational readiness activities, and toil reduction initiatives
- Drive automation across monitoring, incident response, and remediation
- Act as the technical escalation point for major incidents and high-risk releases
- Lead blameless post-incident reviews and ensure continuous improvement
- Establish observability and capacity management practices using modern tooling
- Identify and eliminate systemic reliability risks and operational inefficiencies
- Collaborate with engineering, platform, security, and operations teams across multiple vendors
- Provide coaching and mentorship to engineers to raise SRE capability across the organisation
Technologies
- ArgoCD
- CI/CD
- Cloud
- GitOps
- Grafana
- Helm
- Kubernetes
- OpenTelemetry
- OpenShift
- Prometheus
- Security
- DevOps
- Embedded
We are hiring an experienced SRE Technical Lead for a senior, client-facing leadership role based in Reading with a hybrid working model across home, office, and client sites in the UK. We offer the opportunity to shape Site Reliability Engineering across complex, large-scale platforms, combining hands-on technical delivery with strategic leadership. In this role, you will help embed SRE practices across the full service lifecycle, improve reliability and operational excellence, and work closely with multi-team and multi-vendor environments.
SRE Technical Lead employer: Sivara GmbH
As an SRE Technical Lead at our company, you will thrive in a dynamic and inclusive work culture that prioritises innovation and collaboration. We offer competitive salaries, flexible hybrid working arrangements, and ample opportunities for professional growth through mentorship and leadership development. Join us in Reading to make a meaningful impact on large-scale platforms while enjoying the benefits of a supportive environment that values your expertise and contributions.