At a Glance
- Tasks: Enhance system performance and reliability while automating infrastructure tasks.
- Company: Join a leading global SaaS company transforming student community management.
- Benefits: Enjoy fully remote work, excellent pension, life insurance, and generous holiday allowance.
- Why this job: Be part of a dynamic team driving innovation in cloud technology for millions of students.
- Qualifications: Experience in SRE/DevOps with strong skills in C#, Java, and cloud platforms like AWS or Azure.
- Other info: Opportunity to work in a supportive environment focused on learning and development.
Senior Site Reliability Engineer / ex – software engineer
While professional experience and qualifications are key for this role, make sure to check you have the preferable soft skills before applying if required.
Fully Remote working for candidates based in the UK β Salary Β£85k to Β£90k + Benefits
We are looking for a Senior Site Reliability Engineer / DevOps Engineer that has come from a Software Development Background in the past and who still has strong C# or Java or other similar OO development language combined with strong knowledge of DevOps tools like Kubernetes and/or Docker and Azure or AWS Cloud platforms. We are looking for a Senior Site Reliability Engineer with a Software Engineering background to join their growing global Cloud Infrastructure team supporting their SaaS products.
Our client who are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior Site Reliability Engineer / DevOps Engineer to join their UK Cloud Infrastructure team.
Senior Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. SREs are expected to be experienced in software engineering principals, operational discipline, and automation.
The SRE team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this companyβs unique SaaS platform is an essential platform in the life of millions of University students across the globe.
In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You\’ll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a \”follow the sun\” model to operate our products on a multi-region cloud platform.
Role Responsibilities:
* Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design
* Identify and implement technical solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.
* Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents
* Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth
* Conduct performance tests to identify and remediate bottlenecks
* Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.
* Monitor, review and tune databases to ensure high availability and performance
* Collaborate with product engineering teams to design/build fit-for-purpose and observable software
Required Skills and Experience:
* Proven experience in a SRE / DevOps / Platform Engineering role and having previously worked in a Software Engineering role in .Net and C# or Java or similar OO development language.
* Proficiency in C# or Java or another OO development language β alongside knowledge of scripting languages like Bash, Python or PowerShell
* Production experience operating containerization technologies – ideally with Kubernetes and/or Docker.
* Proficiency with one or more public cloud providers such as Azure, AWS or GCP
* Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.
* Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.
* Proven track record of maintaining highly-available and performant production environments.
* Ability to identify and implement effective mitigation strategies and operational playbooks.
Useful / Bonus Skills to have:
* Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy
* Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus
* Experience in database management/performance tuning, particularly MSSQL.
Employee benefits:
* Opportunity to be a part of a 30+ year well-established, high-performance SaaS company.
* Excellent Company Pension scheme and Life Insurance,
* Excellent holiday allowance.
* A supportive team environment with emphasis on learning and development opportunities
* Working with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.
This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.
Please apply with your CV to find out more
Senior Site Reliability Engineer employer: Stratospherec Ltd
Contact Detail:
Stratospherec Ltd Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land Senior Site Reliability Engineer
β¨Tip Number 1
Familiarise yourself with the specific tools and technologies mentioned in the job description, such as Kubernetes, Docker, and Azure or AWS. Having hands-on experience with these platforms will not only boost your confidence but also demonstrate your capability to potential employers.
β¨Tip Number 2
Engage with online communities or forums related to Site Reliability Engineering and DevOps. Networking with professionals in the field can provide insights into the role and may even lead to referrals or recommendations for job openings.
β¨Tip Number 3
Prepare to discuss your previous software engineering experience in detail, especially how it relates to reliability and automation. Be ready to share specific examples of projects where you improved system performance or reduced manual work through automation.
β¨Tip Number 4
Consider obtaining relevant certifications, such as those in cloud platforms or DevOps practices. These credentials can enhance your profile and show your commitment to professional development, making you a more attractive candidate for the role.
We think you need these skills to ace Senior Site Reliability Engineer
Some tips for your application π«‘
Tailor Your CV: Make sure your CV highlights your experience in software engineering, particularly with C# or Java. Emphasise your familiarity with DevOps tools like Kubernetes and Docker, as well as your experience with cloud platforms such as Azure or AWS.
Showcase Relevant Experience: In your application, provide specific examples of your past work that demonstrate your ability to improve system reliability and performance. Mention any projects where you implemented automation or Infrastructure as Code.
Highlight Soft Skills: Since soft skills are important for this role, include examples of your teamwork, mentoring, and communication abilities. Describe situations where you led knowledge-sharing sessions or collaborated with cross-functional teams.
Craft a Strong Cover Letter: Write a cover letter that connects your background in software engineering with the responsibilities of a Senior Site Reliability Engineer. Explain why you're excited about the opportunity to work in a fully remote environment and how you can contribute to the company's goals.
How to prepare for a job interview at Stratospherec Ltd
β¨Showcase Your Technical Expertise
Be prepared to discuss your experience with C#, Java, and any other OO languages you've worked with. Highlight specific projects where you implemented DevOps tools like Kubernetes or Docker, and be ready to explain how you improved system reliability.
β¨Demonstrate Problem-Solving Skills
Expect questions that assess your ability to identify and resolve issues in cloud infrastructure. Prepare examples of past challenges you've faced and the strategies you employed to overcome them, particularly in high-availability environments.
β¨Emphasise Collaboration and Leadership
Since this role involves mentoring and working with global teams, share experiences where you led knowledge-sharing sessions or collaborated on projects. Discuss how you foster teamwork and support others in achieving their goals.
β¨Prepare for Scenario-Based Questions
You might be asked to solve hypothetical problems related to system performance or incident response. Practice articulating your thought process clearly and logically, demonstrating your approach to automation and operational playbooks.