Senior Site Reliability Engineer
Senior Site Reliability Engineer

Senior Site Reliability Engineer

London Full-Time 90000 £ / year No home office possible
S

At a Glance

  • Tasks: Enhance system performance and reliability while automating infrastructure tasks.
  • Company: Join a leading global SaaS company transforming student community management.
  • Benefits: Enjoy fully remote work, excellent pension, life insurance, and generous holiday allowance.
  • Why this job: Be part of a dynamic team driving innovation in cloud technology for millions of students.
  • Qualifications: Experience in SRE/DevOps with strong skills in C#, Java, and cloud platforms like AWS or Azure.
  • Other info: Opportunity to work in a supportive environment focused on learning and development.

Senior DevOps Engineer / Senior Site Reliability Engineer

Learn more about the general tasks related to this opportunity below, as well as required skills.

Fully Remote working for candidates based in the UK – Salary £80k to £100k (depending on experience) + Benefits

We are looking for a Senior DevOps Engineer that has strong C# code knowledge combined with strong knowledge of DevOps tools like Kubernetes (EKS or ideally AKS) and Azure

Cloud platform. We are looking for a DevOps Engineer with a strong understanding of C# code combined with experience of monitoring tools like DataDog, Grafana and Prometheus to join a growing global Cloud Infrastructure team supporting SaaS products.

Our client are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior DevOps Engineer to join their UK Cloud Infrastructure team.

Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. DevOps are expected to be experienced in software engineering principals, operational discipline, and automation.

The Cloud and DevOps team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this company’s unique SaaS platform is an essential platform in the life of millions of University students across the globe.

In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You\’ll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a \”follow the sun\” model to operate our products on a multi-region cloud platform.

Role Responsibilities:

* Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design

* Identify and implement technical solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.

* Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents

* Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth

* Conduct performance tests to identify and remediate bottlenecks

* Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.

* Monitor, review and tune databases to ensure high availability and performance

* Collaborate with product engineering teams to design/build fit-for-purpose and observable software

Required Skills and Experience:

* Proven experience in a SR DevOps / Site Reliability Engineering role and having strong code development experience in C# or similar OO development language.

* Experience of supporting .Net applications as a DevOps Engineer is a big bonus in this role

* Production experience operating containerization technologies – ideally with Kubernetes and/or Docker. Strong preference for AKS or EKS experience as well.

* Proficiency with one or more public cloud providers such as Azure, AWS or GCP

* Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.

* Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.

* Proven track record of maintaining highly-available and performant production environments.

* Ability to identify and implement effective mitigation strategies and operational playbooks.

Useful / Bonus Skills to have:

* Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy

* Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus

* Experience in database management/performance tuning, particularly MSSQL.

Employee benefits:

* Opportunity to be a part of a 30+ year well-established, high-performance SaaS company.

* Excellent Company Pension scheme and Life Insurance,

* Excellent holiday allowance.

* A supportive team environment with emphasis on learning and development opportunities

* Working with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.

This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. xiskglj If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.

Please apply with your CV to find out more

Senior Site Reliability Engineer employer: Stratospherec Ltd

Join a leading global digital SaaS software company that champions innovation and continuous improvement in the student community management sector. With a fully remote work culture, competitive salary, and a strong emphasis on employee growth through learning opportunities, this role as a Senior Site Reliability Engineer offers a unique chance to collaborate with passionate professionals while enhancing system performance and reliability. Enjoy excellent benefits including a robust pension scheme, generous holiday allowance, and the support of a high-performing team dedicated to your success.
S

Contact Detail:

Stratospherec Ltd Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Site Reliability Engineer

Tip Number 1

Familiarise yourself with the specific tools and technologies mentioned in the job description, such as Kubernetes, Docker, and Azure or AWS. Having hands-on experience with these platforms will not only boost your confidence but also demonstrate your capability to potential employers.

Tip Number 2

Engage with online communities or forums related to Site Reliability Engineering and DevOps. Networking with professionals in the field can provide insights into the role and may even lead to referrals or recommendations for job openings.

Tip Number 3

Prepare to discuss your previous software engineering experience in detail, especially how it relates to reliability and automation. Be ready to share specific examples of projects where you improved system performance or reduced manual work through automation.

Tip Number 4

Consider obtaining relevant certifications, such as those in cloud platforms or DevOps practices. These credentials can enhance your profile and show your commitment to professional development, making you a more attractive candidate for the role.

We think you need these skills to ace Senior Site Reliability Engineer

Proficiency in C# or Java or similar OO development language
Scripting languages (Bash, Python, PowerShell)
Experience with Kubernetes and/or Docker
Knowledge of Azure, AWS or GCP
Familiarity with Infrastructure as Code (IaC) tools like Terraform, Ansible, or CloudFormation
Experience with monitoring and observability tools (DataDog, Prometheus, Grafana)
Ability to maintain highly-available and performant production environments
Technical leadership and mentoring skills
Problem-solving and analytical skills
Experience in CI/CD tooling (Azure DevOps, GitHub Actions, Octopus Deploy)
Database management and performance tuning skills, particularly MSSQL
Strong communication and collaboration skills
Operational discipline and automation expertise

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience in software engineering, particularly with C# or Java. Emphasise your familiarity with DevOps tools like Kubernetes and Docker, as well as your experience with cloud platforms such as Azure or AWS.

Showcase Relevant Experience: In your application, provide specific examples of your past work that demonstrate your ability to improve system reliability and performance. Mention any projects where you implemented automation or Infrastructure as Code.

Highlight Soft Skills: Since soft skills are important for this role, include examples of your teamwork, mentoring, and communication abilities. Describe situations where you led knowledge-sharing sessions or collaborated with cross-functional teams.

Craft a Strong Cover Letter: Write a cover letter that connects your background in software engineering with the responsibilities of a Senior Site Reliability Engineer. Explain why you're excited about the opportunity to work in a fully remote environment and how you can contribute to the company's goals.

How to prepare for a job interview at Stratospherec Ltd

Showcase Your Technical Expertise

Be prepared to discuss your experience with C#, Java, and any other OO languages you've worked with. Highlight specific projects where you implemented DevOps tools like Kubernetes or Docker, and be ready to explain how you improved system reliability.

Demonstrate Problem-Solving Skills

Expect questions that assess your ability to identify and resolve issues in cloud infrastructure. Prepare examples of past challenges you've faced and the strategies you employed to overcome them, particularly in high-availability environments.

Emphasise Collaboration and Leadership

Since this role involves mentoring and working with global teams, share experiences where you led knowledge-sharing sessions or collaborated on projects. Discuss how you foster teamwork and support others in achieving their goals.

Prepare for Scenario-Based Questions

You might be asked to solve hypothetical problems related to system performance or incident response. Practice articulating your thought process clearly and logically, demonstrating your approach to automation and operational playbooks.

Senior Site Reliability Engineer
Stratospherec Ltd
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

S
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>