Site Reliability Engineer (London Area)
Site Reliability Engineer (London Area)

Site Reliability Engineer (London Area)

London Full-Time No home office possible
C

Overview

This role is critical to the success of the digital services strategy, cloud infrastructure, and overall data platform. Sitting within the broader Digital Client technology team, this hire will focus on improving release and support processes, and enhancing infrastructure performance, supportability, scalability, and cost efficiency.

This person will also serve as the key contact for escalations, major incidents, and release support during European business hours.

Key Responsibilities

  • Act as L3 escalation contact for critical production issues and major incidents
  • Lead or support release cycles and ensure production readiness
  • Collaborate with stakeholders, including Front Office and executive teams
  • Drive remediation of audit findings and ensure alignment with enterprise security standards
  • Ensure operational readiness of digital products (resiliency, observability, etc.)
  • Collaborate with quality engineering to meet non-functional requirements
  • Maintain compliance with enterprise tech strategy and protection mandates

Skills & Experience

  • Strong experience in Microsoft Azure
  • Experience developing automation and orchestration using Azure Python SDK, Terraform, and GitHub Runners
  • Expertise in automating serverless PaaS solutions (Azure App Services, Databricks, Cosmos DB, SQL, Data Lake, HDInsight)
  • Proficient in writing Azure Resource Manager (ARM) templates and Terraform scripts
  • Understanding of cloud security (RBAC, audit logging, authentication)
  • Experience troubleshooting network and application access to Azure resources (NSGs, routing)
  • Familiar with deploying databases and infrastructure via CI/CD pipelines
  • Knowledge of Azure RBAC, Active Directory, and Ping integration
  • Solid grasp of cloud provisioning, disaster recovery, performance monitoring, and business continuity

Soft Skills

  • Strong leadership and communication skills
  • Critical thinking and problem-solving under pressure
  • Self-motivated, works well independently and as part of a team
  • Familiarity with Agile/Scrum environments
C

Contact Detail:

Caspian One Recruiting Team

Site Reliability Engineer (London Area)
Caspian One
C
  • Site Reliability Engineer (London Area)

    London
    Full-Time

    Application deadline: 2027-06-11

  • C

    Caspian One

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>