Site Reliability Engineer in London

Site Reliability Engineer in London

London Full-Time 30000 - 50000 £ / year (est.) Home office (partial)
M

At a Glance

  • Tasks: Support and enhance cloud systems for millions of users, ensuring reliability and scalability.
  • Company: Join a forward-thinking tech company focused on innovation and collaboration.
  • Benefits: Enjoy competitive pay, health perks, remote work options, and growth opportunities.
  • Why this job: Be at the forefront of cloud technology and make a real impact on system reliability.
  • Qualifications: Experience in software development, Kubernetes, and cloud solutions is essential.
  • Other info: Dynamic team environment with a strong focus on continuous learning and career advancement.

The predicted salary is between 30000 - 50000 £ per year.

We are seeking an experienced Site Reliability Engineer to join the Cloud Enabling team to play a crucial role in maturing our SRE capability and contributing to the resiliency, availability, and security of our infrastructure and software.

Day to day:

  • Support systems that serve customers and billions of requests monthly, ensuring availability, scalability, and resiliency.
  • Act as a key technical contributor in liaising with SRE guilds to drive improvements in cloud deployments, monitoring solutions, CI/CD pipelines, and cost optimisation.
  • Drive innovation by exploring new technologies and methodologies to enhance SRE capabilities, including AI tooling and automation opportunities.
  • Manage high-throughput systems in production to deliver customer value beyond proof-of-concepts.
  • Implement SLAs/SLOs/SLIs for software and data teams.
  • Develop tooling for efficient incident triage, granular alerting, well-defined runbooks, and auto‑resolving mechanisms.
  • Serve as a subject matter expert in engineering conversations related to site reliability, fostering a culture of continuous learning and development.

Proven hands-on experience in software development, testing, monitoring, and operational stability at scale. Production experience with Kubernetes and monitoring tools such as Datadog or Dynatrace. Strong knowledge of automation, CI/CD, and best practices. Experience running post-mortems, defining SLAs/SLIs/SLOs, and participating in support rotas. Coding/scripting experience (Python/Bash) in a commercial setting. Database knowledge, streaming and batch operations, and API design. Good background with Kubernetes (ideally microservice architectures using Istio service mesh). Extensive experience with cloud-native solutions (ideally Google Cloud). Solid understanding of cloud storage, networking, and resource provisioning.

Site Reliability Engineer in London employer: McGregor Boyall

At McGregor Boyall, we pride ourselves on fostering a dynamic and inclusive work culture that empowers our employees to thrive. As a Site Reliability Engineer, you will have the opportunity to work with cutting-edge technologies in a collaborative environment, driving innovation and enhancing our cloud capabilities. We offer comprehensive benefits, continuous learning opportunities, and a commitment to your professional growth, making us an exceptional employer for those seeking meaningful and rewarding careers in technology.
M

Contact Detail:

McGregor Boyall Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with other Site Reliability Engineers. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those involving Kubernetes, CI/CD pipelines, and cloud-native solutions. This gives potential employers a taste of what you can bring to the table.

✨Tip Number 3

Prepare for technical interviews by brushing up on your coding and scripting skills, particularly in Python and Bash. Practice common SRE scenarios and be ready to discuss your experience with monitoring tools like Datadog or Dynatrace.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace Site Reliability Engineer in London

Site Reliability Engineering
Containerisation
Google Cloud Platform (GCP)
CI/CD Pipelines
Kubernetes
Monitoring Tools (Datadog, Dynatrace)
Automation
Incident Management
SLA/SLO/SLI Implementation
Coding/Scripting (Python, Bash)
Database Knowledge
API Design
Cloud-Native Solutions
Networking
Resource Provisioning

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with Kubernetes, CI/CD pipelines, and cloud-native solutions. We want to see how your skills align with the role, so don’t be shy about showcasing your hands-on experience!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about site reliability engineering and how you can contribute to our Cloud Enabling team. Let us know what excites you about the role and our company.

Showcase Your Technical Skills: Don’t forget to mention your coding/scripting experience, especially in Python or Bash. We love seeing practical examples of how you've implemented SLAs/SLOs/SLIs or driven improvements in cloud deployments.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!

How to prepare for a job interview at McGregor Boyall

✨Know Your Tech Inside Out

Make sure you brush up on your knowledge of Kubernetes, CI/CD pipelines, and cloud-native solutions, especially Google Cloud. Be ready to discuss your hands-on experience with monitoring tools like Datadog or Dynatrace, as well as your coding skills in Python or Bash.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled high-throughput systems and improved availability or resiliency. Think about incidents you've managed and how you implemented SLAs/SLOs/SLIs to enhance operational stability.

✨Demonstrate Continuous Learning

Highlight your passion for exploring new technologies and methodologies. Discuss any AI tooling or automation projects you've worked on, and how they contributed to enhancing SRE capabilities. This shows you're not just about maintaining the status quo.

✨Engage in Engineering Conversations

Be prepared to act as a subject matter expert during discussions. Share your insights on best practices in site reliability and how you've fostered a culture of continuous learning in your previous roles. This will show your potential employer that you can contribute to their team dynamic.

Site Reliability Engineer in London
McGregor Boyall
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

M
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>