Site Reliability Engineer

Site Reliability Engineer

Freelance 70000 - 90000 £ / year (est.) No home office possible
Damco Solutions

At a Glance

  • Tasks: Lead SRE practices, mentor teams, and enhance platform reliability using Azure and Terraform.
  • Company: Dynamic tech company in London focused on innovation and collaboration.
  • Benefits: Competitive contract pay, flexible working arrangements, and opportunities for professional growth.
  • Other info: Join a high-performing team and shape the future of platform engineering.
  • Why this job: Make a real impact by driving SRE culture and improving operational excellence.
  • Qualifications: Deep expertise in SRE principles, Azure, Terraform, and incident management.

The predicted salary is between 70000 - 90000 £ per year.

Job Type: Contract

Location: London, UK

Key Responsibilities:

  • Act as a Technical Authority (SME) for both Azure and Terraform, guiding teams on SRE practices and approving production changes.
  • This role is platform-focused, requiring deep expertise in SRE principles, Azure Landing Zones (Hub-and-Spoke), Terraform, DevOps enablement, monitoring/observability, and incident management.
  • Address long-term reliability and operational risks while building and mentoring SRE teams.
  • Design, implement, and operate Azure Hub-and-Spoke Landing Zone architectures.
  • Reduce operational toil through automation and platform improvements.
  • Own and evangelize SRE principles including availability, reliability, scalability, resilience, and operational maturity.
  • Define Terraform best practices, state management, drift detection, and CI/CD integration.
  • Build and maintain CI/CD foundations using GitHub Actions.
  • Design and standardize monitoring and observability across the Azure platform.
  • Lead and participate in major incident management following ITIL processes.
  • Partner with security teams to implement least-privilege access and secure-by-default architectures.
  • Enforce governance using Azure Policy and standardized platform controls.
  • Lead, mentor, and grow a high-performing SRE/platform engineering team.
  • Drive SRE culture across the organization and set technical direction, standards, and operational maturity goals.
  • Clearly explain and apply SRE concepts (SLIs, SLOs, error budgets, toil reduction, blameless postmortems).
  • Define and track platform-level SLIs/SLOs and ensure alignment with business objectives.
  • Strong hands-on knowledge of RBAC and IAM in Azure, Managed Identities, Azure Key Vault for secrets, keys, and certificates.

Site Reliability Engineer employer: Damco Solutions

As a leading employer in the tech industry, we offer an exceptional work environment for Site Reliability Engineers in London, where innovation meets collaboration. Our commitment to employee growth is evident through continuous learning opportunities and mentorship programmes, fostering a culture that values reliability, scalability, and operational excellence. With competitive benefits and a focus on work-life balance, we empower our teams to thrive while making a meaningful impact in the realm of cloud infrastructure.
Damco Solutions

Contact Detail:

Damco Solutions Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Network like a pro! Attend meetups, webinars, or tech conferences related to Site Reliability Engineering. It's a great way to connect with industry professionals and get your name out there.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving Azure and Terraform. This will give potential employers a taste of what you can do and how you approach SRE challenges.

✨Tip Number 3

Prepare for interviews by brushing up on SRE principles and practices. Be ready to discuss your experience with incident management and automation, as well as how you've reduced operational toil in past roles.

✨Tip Number 4

Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining our team. Plus, it makes the application process smoother for everyone involved.

We think you need these skills to ace Site Reliability Engineer

Azure
Terraform
Site Reliability Engineering (SRE) principles
DevOps enablement
Monitoring and observability
Incident management
Automation
CI/CD integration
GitHub Actions
ITIL processes
Security best practices
Azure Policy governance
RBAC and IAM in Azure
Managed Identities
Azure Key Vault

Some tips for your application 🫡

Show Off Your Expertise: Make sure to highlight your deep knowledge of SRE principles, Azure, and Terraform in your application. We want to see how you can be a Technical Authority and guide teams effectively!

Tailor Your Application: Don’t just send a generic CV! Tailor your application to reflect the key responsibilities mentioned in the job description. We love seeing how your experience aligns with our needs.

Be Clear and Concise: When writing your application, keep it clear and to the point. We appreciate straightforward communication, especially when it comes to complex topics like incident management and CI/CD.

Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications and ensures you don’t miss out on any important updates!

How to prepare for a job interview at Damco Solutions

✨Know Your SRE Principles

Make sure you brush up on your understanding of SRE principles like SLIs, SLOs, and error budgets. Be ready to discuss how you've applied these concepts in past roles, as this will show your depth of knowledge and practical experience.

✨Showcase Your Azure Expertise

Since the role is heavily focused on Azure, be prepared to talk about your hands-on experience with Azure Landing Zones and RBAC. Bring examples of how you've implemented security measures and governance using Azure Policy, as this will demonstrate your technical authority.

✨Demonstrate Automation Skills

Highlight your experience with Terraform and CI/CD processes, especially using GitHub Actions. Discuss specific instances where you've reduced operational toil through automation, as this aligns perfectly with the job's focus on platform improvements.

✨Emphasise Team Leadership

This role involves mentoring and leading an SRE team, so be ready to share your experiences in building high-performing teams. Talk about how you've fostered a culture of reliability and operational maturity, and any strategies you've used to guide teams on SRE practices.

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>