Senior Site Reliability Engineer

Job Board

Companies

Tes

Senior Site Reliability Engineer

Full-Time 90000 - 90000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Design and implement best SRE practices to enhance system reliability and performance.
Company: Join a leading tech company with a focus on innovation and collaboration.
Benefits: Enjoy 25 days annual leave, state-of-the-art offices, and extensive learning opportunities.
Other info: Hybrid working model with excellent career growth potential.
Why this job: Make a real impact by optimising cloud infrastructure and ensuring system security.
Qualifications: Experience in SRE/DevOps, strong problem-solving skills, and knowledge of cloud platforms required.

The predicted salary is between 90000 - 90000 £ per year.

Location: Sheffield, London, Talbot Green or Yeovil

Working Pattern: Hybrid, includes 3 days each week in the office

Contract Type: Full time, permanent

Salary: Up to £90,000 per annum

Role Overview

As a Senior SRE Engineer, you will be pivotal in designing and implementing best SRE practices while fostering a culture of continuous improvement and optimization. You will collaborate closely with development and operations teams to improve the platform stability and performance, ensuring that our systems are reliable, secure, and scalable.

Key Responsibilities

Infrastructure Management: Manage and scale cloud-based infrastructure (e.g., AWS, Azure, GCP). Apply Infrastructure as Code (IaC) principles for provisioning and configuration management.
Security and Compliance: Collaborate with the security team to implement best practices for system and data security. Ensure systems comply with relevant industry standards and regulations.
Monitoring and Performance: Set up and maintain monitoring and alerting systems for early issue detection and resolution. Continuously optimize system performance and resource usage.
Documentation: Create and maintain thorough documentation for SRE/platform processes, tools, and practices.

Experience

Proven experience in a SRE/DevOps/Platform role, with a strong background in both software development and operations.
Knowledge of CI/CD tools (e.g., Jenkins, GitLab CI/CD, Travis CI).
Proficiency in scripting and automation (e.g., Bash, Python, Ansible).
Strong experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Strong hands-on experience of at least one major public cloud platform (e.g., AWS, Azure, GCP).
Strong problem-solving and troubleshooting abilities in a timebound situation (Major incidents).
Clear communication and incident management experience.
Demonstrable strong hands-on experience with Terraform.
Knowledge of microservices architecture.
Familiarity with security best practices and tools.
Demonstrable experience of monitoring/observability tools preferred (Grafana, Prometheus, PagerDuty, uptime).

Knowledge

Cloud Platforms: Strong knowledge of AWS, Azure, or GCP, including cloud architecture, services, and security models.
Containerization & Orchestration: In-depth understanding of Docker and Kubernetes for deploying and managing containerized applications.
Infrastructure as Code (IaC): Knowledge of IaC frameworks, particularly Terraform, to manage cloud infrastructure via code.
Microservices Architecture: Familiarity with microservices design patterns and deployment strategies in a cloud-native environment.
Monitoring & Observability: Understanding of monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK) to ensure system performance and issue tracking.

Skills

CI/CD Tools: Hands-on experience with Jenkins, GitLab CI/CD, Travis CI, or similar tools for building CI/CD pipelines.
Scripting & Automation: Proficiency in scripting languages like Bash and Python, along with automation tools such as Ansible for managing configurations and deployments.
Containerization & Orchestration: Practical skills in deploying and managing containers using Docker and orchestrating workloads using Kubernetes.
Cloud Platform Management: Expertise in managing and scaling cloud environments on AWS, Azure, or GCP, leveraging services for compute, storage, networking, and security.
Infrastructure as Code (IaC): Skilled in using Terraform to automate provisioning and management of cloud infrastructure.
Troubleshooting & Problem Solving: Strong analytical skills for identifying and resolving complex system issues, especially in production environments.
Collaboration & Communication: Excellent ability to work under pressure (e.g., in a Major incident).

Qualifications

Certifications (Preferred): Holding certifications such as AWS Certified DevOps Engineer, CKA (Certified Kubernetes Administrator), or other relevant credentials.

Benefits

25 days annual leave rising to 30
State of the art offices
Access to a range of benefits via My Benefits World
Free eye care cover
Life Assurance
Cycle to Work Scheme
EAP (Employee assistance programme)
Quarterly Team Socials
Access to an extensive Learning and Development menu

Senior Site Reliability Engineer employer: Tes

As a Senior Site Reliability Engineer, you will thrive in a dynamic and innovative environment that prioritises employee growth and collaboration. With state-of-the-art offices in Sheffield, London, Talbot Green, or Yeovil, the company offers a hybrid working pattern, generous annual leave, and access to extensive learning and development opportunities, ensuring you can advance your career while enjoying a supportive work culture. Join us to be part of a team that values continuous improvement and embraces cutting-edge technology, making it an excellent employer for those seeking meaningful and rewarding employment.

Contact Details:

Tes Recruitment Team

View Tes profile

StudySmarter Expert Advice🤫

We think this is how you could land Senior Site Reliability Engineer

✨Tip Number 1

Network like a pro! Reach out to folks in the industry on LinkedIn or at local meetups. You never know who might have the inside scoop on job openings or can put in a good word for you.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This gives potential employers a taste of what you can do beyond just a CV.

✨Tip Number 3

Prepare for interviews by practising common SRE scenarios and problem-solving questions. Mock interviews with friends or mentors can help you feel more confident and ready to impress.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace Senior Site Reliability Engineer

Cloud Infrastructure Management

Infrastructure as Code (IaC)

AWS

Azure

GCP

CI/CD Tools

Scripting and Automation

Docker

Kubernetes

Terraform

Monitoring and Observability

Problem-Solving Skills

Collaboration Skills

Communication Skills

Security Best Practices

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Site Reliability Engineer role. Highlight your experience with cloud platforms, IaC, and any relevant tools like Terraform or Kubernetes. We want to see how your skills match what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how your background makes you a perfect fit for our team. Don't forget to mention specific projects or achievements that showcase your expertise.

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled complex issues in past roles. We love seeing candidates who can think on their feet and come up with innovative solutions, especially in high-pressure situations!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you're keen on joining the StudySmarter family!

How to prepare for a job interview at Tes

✨Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS, Azure, GCP, Docker, and Kubernetes. Brush up on your knowledge of Infrastructure as Code with Terraform and be ready to discuss how you've used these tools in past projects.

✨Showcase Problem-Solving Skills

Prepare to share specific examples of how you've tackled major incidents or complex system issues. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your analytical skills and ability to work under pressure.

✨Emphasise Collaboration

As a Senior Site Reliability Engineer, collaboration is key. Be ready to discuss how you've worked with development and operations teams in the past. Highlight any experiences where you’ve fostered a culture of continuous improvement and optimisation.

✨Prepare Questions

Have a list of insightful questions ready to ask your interviewers. This could include inquiries about their current SRE practices, team dynamics, or future projects. It shows your genuine interest in the role and helps you assess if the company is the right fit for you.

Senior Site Reliability Engineer

Tes

Apply Now

Senior Site Reliability Engineer

At a Glance

Senior Site Reliability Engineer employer: Tes

StudySmarter Expert Advice🤫

We think you need these skills to ace Senior Site Reliability Engineer

Some tips for your application 🫡

How to prepare for a job interview at Tes

Company

Product

Help