At a Glance
- Tasks: Build and operate resilient platforms while leading incident response and driving reliability improvements.
- Company: World-leading cybersecurity tech firm using AI to combat advanced cyber threats.
- Benefits: Competitive salary, flexible work, free lunches, and generous holiday allowance.
- Other info: Dynamic environment with excellent career growth and personal development opportunities.
- Why this job: Join a diverse team and make a real impact in cybersecurity with cutting-edge technology.
- Qualifications: Strong SRE or DevOps experience, programming skills, and cloud platform expertise.
The predicted salary is between 80000 - 80000 £ per year.
Salary: £80,000 - £80,000 per year
Requirements:
- Strong experience in SRE, DevOps, or infrastructure engineering
- Strong programming or scripting skills in at least one language such as Go, Python, or similar
- In-depth experience with cloud platforms AWS and/or Azure
- Experience with observability tools such as Prometheus, Grafana, or Datadog
- Experience leading incident response and driving reliability improvements
- Proficiency with container orchestration such as Kubernetes and Infrastructure-as-Code such as Terraform, Pulumi, or similar
- Good understanding of networking, Linux OS, and distributed systems
- Collaborative mindset with strong communication skills
Responsibilities:
- Build and operate highly available, scalable, and resilient platforms
- Work closely with Platform Engineering and DevSecOps to drive reliability across the technology stack
- Improve observability and automate operational processes
- Help ensure systems remain secure, performant, and easy to operate
- Lead incident response activities
- Champion a culture of continuous improvement
- Collaborate with engineering teams to embed reliability into service design
- Define and evolve reliability standards
- Contribute to capacity planning and performance optimisation
- Mentor fellow engineers
- Help shape the tools, platforms, and practices that support reliable service delivery at scale
Technologies: AI, AWS, Azure, Cloud, Datadog, DevSecOps, DevOps, Grafana, Support, Kubernetes, Linux, Prometheus, Python, REST, Terraform
We are a world-leading cybersecurity technology business using AI to protect clients across the globe from advanced cyber threats. You will join a highly talented, diverse team in our Cambridge office twice a week, with the flexibility of working from home the rest of the time. We offer a great team atmosphere, free lunches, problem-solving sessions, and a competitive package including bonus, pension, private medical insurance, life assurance, enhanced parental leave, employee assistance, 23 days holiday plus your birthday off, charity giving schemes, and personal training and development budgets.
Lead Site Reliability Engineer SRE AWS Azure in Milton employer: Sivara GmbH
Join a world-leading cybersecurity technology business in Cambridge, where innovation meets collaboration. We foster a vibrant work culture that prioritises employee well-being with benefits like free lunches, flexible working arrangements, and generous training budgets, ensuring you have the resources to grow your skills and career. With a focus on continuous improvement and a supportive team atmosphere, this is an excellent opportunity for those seeking meaningful and rewarding employment in a cutting-edge environment.