At a Glance
- Tasks: Join our team to design and maintain resilient systems while automating tasks.
- Company: We're a dynamic tech company based in London, focused on service reliability and performance.
- Benefits: Enjoy a competitive salary, collaborative culture, and opportunities for professional growth.
- Why this job: Be part of a passionate team driving innovation and improving system health with cutting-edge technology.
- Qualifications: 8+ years in SRE or DevOps; strong cloud and automation skills required.
- Other info: This is an onsite role, perfect for those who thrive in a vibrant office environment.
The predicted salary is between 48000 - 64000 £ per year.
Location: London, UK – Onsite (5 days/week)
Employment Type: Permanent
Salary: Up to £80,000 per annum (Gross)
About the Role:
We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our London-based team. This role is ideal for someone passionate about service reliability, scalability, and performance. As an SRE, you will collaborate with development and operations teams to automate infrastructure, enhance observability, and reduce manual processes (TOIL) to improve overall system health.
Key Responsibilities:
- Design, build, and maintain scalable, resilient systems and services.
- Automate routine tasks and eliminate manual effort using scripting and infrastructure-as-code.
- Collaborate with development teams to ensure best practices for deployment, monitoring, and performance tuning.
- Drive incident management processes, root cause analysis, and continuous improvement of system reliability.
- Maintain and improve observability using monitoring and logging tools.
- Optimize cloud infrastructure usage and costs.
Primary Skills & Experience:
- Strong hands-on experience with cloud platforms, especially AWS (experience with GCP or Azure is a plus).
- Deep understanding of Container Orchestration technologies such as Kubernetes and Docker.
- Proficiency in monitoring and logging tools including: Datadog, Splunk, Dynatrace, AppDynamics, Prometheus, Grafana, ELK Stack, CloudWatch, Gremlin, ThousandEyes.
- Experience with Terraform, Jenkins, GitLab CI, PostgreSQL, Redis, and Kong API Gateway.
- Solid understanding of networking, security best practices, and infrastructure automation.
- Exposure to AWS ECS, Atlas, and internal tooling integrations.
- Diagramming and documentation skills using Lucidchart and PlantUML.
Secondary Skills:
- Familiarity with ServiceNow (SNOW) and JIRA for incident and task tracking.
- Competency in Shell scripting, Linux system administration, Bitbucket, and Akamai.
- Experience working within DevOps pipelines and CI/CD frameworks.
Qualifications:
- Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
- 8+ years of relevant experience in SRE, DevOps, or Infrastructure Engineering roles.
Senior Site Reliability Engineer employer: Cipher7
Contact Detail:
Cipher7 Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Site Reliability Engineer
✨Tip Number 1
Familiarise yourself with the specific tools and technologies mentioned in the job description, such as AWS, Kubernetes, and Terraform. Having hands-on experience or projects showcasing these skills can significantly boost your chances.
✨Tip Number 2
Network with current or former employees of StudySmarter on platforms like LinkedIn. Engaging with them can provide insights into the company culture and the role, which can be invaluable during interviews.
✨Tip Number 3
Prepare to discuss real-world scenarios where you've improved system reliability or automated processes. Be ready to share specific examples that demonstrate your problem-solving skills and technical expertise.
✨Tip Number 4
Stay updated on the latest trends in Site Reliability Engineering and cloud technologies. Being knowledgeable about recent developments can help you stand out during discussions and show your passion for the field.
We think you need these skills to ace Senior Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with cloud platforms, container orchestration, and automation tools. Use specific examples that demonstrate your skills in service reliability and system performance.
Craft a Compelling Cover Letter: In your cover letter, express your passion for site reliability engineering. Mention how your background aligns with the responsibilities of the role, particularly in automating infrastructure and improving system health.
Showcase Relevant Projects: If you have worked on projects involving AWS, Kubernetes, or any of the monitoring tools mentioned, be sure to include these in your application. Describe your role and the impact of your contributions.
Highlight Continuous Learning: Mention any relevant certifications or courses you've completed, especially those related to cloud technologies or DevOps practices. This shows your commitment to staying updated in the field.
How to prepare for a job interview at Cipher7
✨Showcase Your Technical Skills
Be prepared to discuss your hands-on experience with cloud platforms, especially AWS. Highlight specific projects where you've implemented container orchestration technologies like Kubernetes and Docker, as well as your proficiency with monitoring tools such as Datadog or Prometheus.
✨Demonstrate Problem-Solving Abilities
Expect questions that assess your incident management skills and your approach to root cause analysis. Share examples of how you've driven continuous improvement in system reliability and how you handle unexpected outages.
✨Emphasise Collaboration
As an SRE, collaboration is key. Be ready to discuss how you've worked with development teams to implement best practices for deployment and performance tuning. Mention any tools you've used for task tracking, like JIRA or ServiceNow.
✨Prepare Questions About the Company’s Infrastructure
Show your interest in the role by preparing insightful questions about the company's current infrastructure and challenges they face. This demonstrates your enthusiasm for the position and your proactive mindset towards improving their systems.