At a Glance
- Tasks: Support and enhance cloud infrastructure while resolving critical production issues.
- Company: Join a forward-thinking digital services team focused on innovation and efficiency.
- Benefits: Enjoy remote work flexibility and opportunities for professional growth.
- Why this job: Be part of a dynamic team driving impactful digital solutions in a collaborative environment.
- Qualifications: Strong Azure experience and automation skills are essential; familiarity with Agile is a plus.
- Other info: This role is key during European business hours, ensuring operational excellence.
The predicted salary is between 30000 - 50000 £ per year.
This role is critical to the success of the digital services strategy, cloud infrastructure, and overall data platform. Sitting within the broader Digital Client technology team, this hire will focus on improving release and support processes, and enhancing infrastructure performance, supportability, scalability, and cost efficiency.
This person will also serve as the key contact for escalations, major incidents, and release support during European business hours.
Key Responsibilities- Act as L3 escalation contact for critical production issues and major incidents
- Lead or support release cycles and ensure production readiness
- Collaborate with stakeholders, including Front Office and executive teams
- Drive remediation of audit findings and ensure alignment with enterprise security standards
- Ensure operational readiness of digital products (resiliency, observability, etc.)
- Collaborate with quality engineering to meet non-functional requirements
- Maintain compliance with enterprise tech strategy and protection mandates
- Strong experience in Microsoft Azure
- Experience developing automation and orchestration using Azure Python SDK, Terraform, and GitHub Runners
- Expertise in automating serverless PaaS solutions (Azure App Services, Databricks, Cosmos DB, SQL, Data Lake, HDInsight)
- Proficient in writing Azure Resource Manager (ARM) templates and Terraform scripts
- Understanding of cloud security (RBAC, audit logging, authentication)
- Experience troubleshooting network and application access to Azure resources (NSGs, routing)
- Familiar with deploying databases and infrastructure via CI/CD pipelines
- Knowledge of Azure RBAC, Active Directory, and Ping integration
- Solid grasp of cloud provisioning, disaster recovery, performance monitoring, and business continuity
- Strong leadership and communication skills
- Critical thinking and problem-solving under pressure
- Self-motivated, works well independently and as part of a team
- Familiarity with Agile/Scrum environments
Contact Detail:
Caspian One Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (Home-based)
✨Tip Number 1
Familiarise yourself with Microsoft Azure and its services, especially those mentioned in the job description. Being able to discuss specific Azure tools and your experience with them will show that you're well-prepared for the role.
✨Tip Number 2
Brush up on your automation skills, particularly with Azure Python SDK, Terraform, and GitHub Runners. Consider working on a small project or two that showcases your ability to automate processes, as this will be a key part of the job.
✨Tip Number 3
Prepare to discuss your experience with incident management and how you've handled critical production issues in the past. Having concrete examples ready will demonstrate your capability to act as an L3 escalation contact.
✨Tip Number 4
Since collaboration is key in this role, think about how you can effectively communicate with various stakeholders. Be ready to share examples of how you've worked with different teams, especially in Agile/Scrum environments.
We think you need these skills to ace Site Reliability Engineer (Home-based)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with Microsoft Azure and automation tools like Terraform and GitHub Runners. Use specific examples that demonstrate your expertise in cloud infrastructure and serverless PaaS solutions.
Craft a Compelling Cover Letter: In your cover letter, explain why you are passionate about Site Reliability Engineering and how your skills align with the responsibilities outlined in the job description. Mention your experience with critical production issues and your ability to collaborate with various stakeholders.
Showcase Soft Skills: Don't forget to mention your soft skills such as leadership, communication, and problem-solving abilities. Provide examples of how you've successfully worked under pressure or led a team in an Agile/Scrum environment.
Proofread Your Application: Before submitting your application, carefully proofread your CV and cover letter for any spelling or grammatical errors. A polished application reflects your attention to detail and professionalism.
How to prepare for a job interview at Caspian One
✨Showcase Your Technical Skills
Be prepared to discuss your experience with Microsoft Azure and the specific tools mentioned in the job description, such as Terraform and the Azure Python SDK. Highlight any projects where you've successfully implemented automation or orchestration.
✨Demonstrate Problem-Solving Abilities
Expect to face scenario-based questions that test your critical thinking and problem-solving skills. Prepare examples from your past experiences where you effectively resolved production issues or major incidents.
✨Communicate Clearly and Confidently
Strong communication skills are essential for this role. Practice articulating your thoughts clearly, especially when discussing complex technical concepts. Be ready to explain how you would collaborate with stakeholders and lead teams.
✨Understand the Company’s Digital Strategy
Research the company's digital services strategy and be ready to discuss how your skills align with their goals. Showing that you understand their mission and how you can contribute will set you apart from other candidates.