Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

Slough Full-Time 48000 - 72000 £ / year (est.) No home office possible
C

At a Glance

  • Tasks: Join our team to enhance cloud infrastructure and support digital services.
  • Company: Be part of a leading digital client technology team driving innovation.
  • Benefits: Enjoy flexible working hours and opportunities for professional growth.
  • Why this job: Make a real impact on digital products while collaborating with top stakeholders.
  • Qualifications: Strong Azure experience and automation skills are essential.
  • Other info: Ideal for self-motivated individuals who thrive in fast-paced environments.

The predicted salary is between 48000 - 72000 £ per year.

This role is critical to the success of the digital services strategy, cloud infrastructure, and overall data platform. Sitting within the broader Digital Client technology team, this hire will focus on improving release and support processes, and enhancing infrastructure performance, supportability, scalability, and cost efficiency. This person will also serve as the key contact for escalations, major incidents, and release support during European business hours.

Key Responsibilities

  • Act as L3 escalation contact for critical production issues and major incidents
  • Lead or support release cycles and ensure production readiness
  • Collaborate with stakeholders, including Front Office and executive teams
  • Drive remediation of audit findings and ensure alignment with enterprise security standards
  • Ensure operational readiness of digital products (resiliency, observability, etc.)
  • Collaborate with quality engineering to meet non-functional requirements
  • Maintain compliance with enterprise tech strategy and protection mandates

Skills & Experience

  • Strong experience in Microsoft Azure
  • Experience developing automation and orchestration using Azure Python SDK, Terraform, and GitHub Runners
  • Expertise in automating serverless PaaS solutions (Azure App Services, Databricks, Cosmos DB, SQL, Data Lake, HDInsight)
  • Proficient in writing Azure Resource Manager (ARM) templates and Terraform scripts
  • Understanding of cloud security (RBAC, audit logging, authentication)
  • Experience troubleshooting network and application access to Azure resources (NSGs, routing)
  • Familiar with deploying databases and infrastructure via CI/CD pipelines
  • Knowledge of Azure RBAC, Active Directory, and Ping integration
  • Solid grasp of cloud provisioning, disaster recovery, performance monitoring, and business continuity

Soft Skills

  • Strong leadership and communication skills
  • Critical thinking and problem-solving under pressure
  • Self-motivated, works well independently and as part of a team
  • Familiarity with Agile/Scrum environments

Site Reliability Engineer employer: Caspian One

As a Site Reliability Engineer within our dynamic Digital Client technology team, you will thrive in an innovative work culture that prioritises collaboration and continuous improvement. We offer competitive benefits, including professional development opportunities and a supportive environment that encourages growth and learning. Located in a vibrant city, our company not only values your contributions but also provides a unique chance to impact our digital services strategy while enjoying a fulfilling work-life balance.
C

Contact Detail:

Caspian One Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with Microsoft Azure and its services, especially those mentioned in the job description. Having hands-on experience or projects that showcase your skills in Azure will make you stand out during discussions.

✨Tip Number 2

Prepare to discuss your experience with automation and orchestration tools like Terraform and the Azure Python SDK. Be ready to share specific examples of how you've implemented these technologies to improve processes or solve problems.

✨Tip Number 3

Brush up on your knowledge of cloud security practices, particularly around RBAC and audit logging. Being able to articulate how you've ensured security compliance in past roles will demonstrate your understanding of critical operational requirements.

✨Tip Number 4

Showcase your soft skills by preparing examples of how you've led teams or projects under pressure. Highlighting your leadership and communication abilities will be crucial, especially since this role involves collaboration with various stakeholders.

We think you need these skills to ace Site Reliability Engineer

Microsoft Azure
Automation and Orchestration
Azure Python SDK
Terraform
GitHub Runners
Serverless PaaS Solutions
Azure App Services
Databricks
Cosmos DB
SQL
Data Lake
HDInsight
Azure Resource Manager (ARM) Templates
Cloud Security
RBAC
Audit Logging
Authentication
Network Troubleshooting
Application Access
CI/CD Pipelines
Active Directory
Ping Integration
Cloud Provisioning
Disaster Recovery
Performance Monitoring
Business Continuity
Leadership Skills
Communication Skills
Critical Thinking
Problem-Solving
Self-Motivated
Team Collaboration
Agile/Scrum Familiarity

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in Microsoft Azure and automation tools like Terraform and Python SDK. Use specific examples that demonstrate your skills in cloud infrastructure and incident management.

Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the Site Reliability Engineer role. Mention how your background aligns with the key responsibilities, such as leading release cycles and collaborating with stakeholders.

Showcase Soft Skills: Don't forget to mention your soft skills, especially leadership and communication abilities. Provide examples of how you've successfully worked under pressure or led a team in an Agile/Scrum environment.

Proofread Your Application: Before submitting, carefully proofread your application materials. Check for any spelling or grammatical errors, and ensure that all information is clear and concise. A polished application reflects your attention to detail.

How to prepare for a job interview at Caspian One

✨Showcase Your Technical Skills

Be prepared to discuss your experience with Microsoft Azure and the specific tools mentioned in the job description, such as Terraform and Python SDK. Highlight any projects where you've successfully implemented automation or orchestration solutions.

✨Demonstrate Problem-Solving Abilities

Expect scenario-based questions that test your critical thinking and problem-solving skills. Prepare examples of how you've handled major incidents or production issues in the past, focusing on your approach and the outcomes.

✨Emphasise Collaboration and Communication

Since this role involves working with various stakeholders, be ready to discuss your experience in collaborative environments. Share examples of how you've effectively communicated technical information to non-technical teams or executives.

✨Understand the Company’s Digital Strategy

Research the company's digital services strategy and cloud infrastructure. Being knowledgeable about their goals and challenges will allow you to tailor your responses and demonstrate your genuine interest in contributing to their success.

Site Reliability Engineer
Caspian One
C
  • Site Reliability Engineer

    Slough
    Full-Time
    48000 - 72000 £ / year (est.)

    Application deadline: 2027-06-06

  • C

    Caspian One

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>