Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

London Full-Time 36000 - 60000 £ / year (est.) No home office possible
N

At a Glance

  • Tasks: Improve system reliability and performance while collaborating with engineers and product owners.
  • Company: Join an inclusive team committed to innovation and professional development.
  • Benefits: Enjoy a collaborative work environment with opportunities for growth and learning.
  • Why this job: Be part of a dynamic team that values creativity and continuous improvement in tech.
  • Qualifications: Strong knowledge of site reliability engineering and programming languages required.
  • Other info: Experience with observability tools and cloud environments is a plus.

The predicted salary is between 36000 - 60000 £ per year.

Join us as a Site Reliability Engineer. In this key role, you’ll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services.

You’ll enjoy significant stakeholder interaction, working in collaboration with engineers and product owners to ensure a principled approach to deliver change in a safe and secure way. This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development.

As our Site Reliability Engineer, you’ll work closely with our feature team and other colleagues to meet defined service level objectives and continually improve system and environment reliability. You’ll define SLOs, SLIs and error budgets that support finding the right balance between risk reliability and continuous improvement.

You’ll also provide structure and help to our release process, suggesting and making improvements where possible. You’ll scale systems sustainably through mechanisms like automation, evolving them by pushing for changes that improve reliability and velocity. We’ll also look to you to coach and provide guidance to colleagues and the wider team, leading where required.

In addition to this, you’ll:

  • Proactively contribute new ideas and innovations to meet short term and longer-term goals
  • Continually balance and manage any potential risks
  • Be accountable for the day-to-day development and health of both production and non-production environments and respond to any incidents as required
  • Provide technical expertise and input to establish the risk tolerance of products and services
  • Communicate incident status updates clearly and frequently to other teams, customers and stakeholders and support blameless post-mortems

The skills you’ll need:

We’re looking for someone with strong knowledge of reliability systems thinking and experience of site reliability engineering. You’ll need experience of using a data driven and scientific approach to fact finding. We’ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes.

We’re also looking for:

  • Good knowledge and experience of programming languages
  • Strong knowledge of deploy and release services, automation, and troubleshooting
  • Experience of utilising tools and technology across the software development lifecycle
  • Experience using mathematical and statistical models to assess trends
  • Strong communication skills with the ability to proactively engage with a wide range of stakeholders
  • In depth experience with observability tools such as Grafana, Prometheus and OpenTelemetry
  • Strong knowledge of public cloud environments such as AWS and GCP, and Infrastructure as Code tools such as Terraform

Site Reliability Engineer employer: NatWest

As a Site Reliability Engineer with us, you'll be part of an inclusive and innovative team that values collaboration and professional growth. Our commitment to employee development is reflected in our supportive work culture, where you can actively contribute to meaningful projects while enjoying the benefits of flexible working arrangements and access to cutting-edge technologies. Located in a vibrant area, we offer unique opportunities for networking and personal advancement, making us an excellent employer for those seeking a rewarding career in technology.
N

Contact Detail:

NatWest Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with the specific tools mentioned in the job description, like Grafana and Prometheus. Having hands-on experience or projects showcasing your skills with these observability tools can set you apart during interviews.

✨Tip Number 2

Engage with the Site Reliability Engineering community online. Join forums, attend webinars, or participate in discussions on platforms like LinkedIn. This not only helps you learn but also expands your network, which could lead to referrals.

✨Tip Number 3

Prepare to discuss real-world scenarios where you've improved system reliability or managed incidents. Use the STAR method (Situation, Task, Action, Result) to structure your responses, demonstrating your problem-solving skills effectively.

✨Tip Number 4

Showcase your understanding of financial services and how it relates to site reliability. Being able to connect your technical skills with business impact will demonstrate your value to the team and align with the company's goals.

We think you need these skills to ace Site Reliability Engineer

Site Reliability Engineering
Reliability Systems Thinking
Data-Driven Decision Making
Financial Services Knowledge
Programming Languages
Deploy and Release Services
Automation
Troubleshooting
Software Development Lifecycle
Mathematical and Statistical Modelling
Observability Tools (Grafana, Prometheus, OpenTelemetry)
Public Cloud Environments (AWS, GCP)
Infrastructure as Code (Terraform)
Strong Communication Skills
Stakeholder Engagement

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in site reliability engineering and showcases your knowledge of reliability systems thinking. Include specific examples of how you've improved system reliability or performance in previous roles.

Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Discuss how your skills align with the job description, particularly your experience with programming languages, automation, and observability tools like Grafana and Prometheus.

Showcase Your Problem-Solving Skills: Provide examples in your application that demonstrate your ability to use a data-driven approach to problem-solving. Highlight any experience you have with incident response and managing risks in production environments.

Highlight Stakeholder Engagement: Emphasise your strong communication skills and your experience working with various stakeholders. Mention any instances where you successfully communicated incident status updates or led post-mortems, as this is crucial for the role.

How to prepare for a job interview at NatWest

✨Showcase Your Technical Expertise

Be prepared to discuss your experience with programming languages and site reliability engineering. Highlight specific projects where you've implemented automation or troubleshooting techniques, as this will demonstrate your hands-on skills.

✨Understand the Role of SLOs and SLIs

Familiarise yourself with Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Be ready to explain how you have defined and managed these in previous roles, as they are crucial for balancing reliability and risk.

✨Communicate Clearly and Effectively

Since the role involves significant stakeholder interaction, practice articulating complex technical concepts in a clear and concise manner. Prepare examples of how you've communicated incident updates or collaborated with cross-functional teams.

✨Demonstrate a Data-Driven Approach

Prepare to discuss how you've used data and statistical models to assess trends and make informed decisions. This will show your ability to apply a scientific approach to problem-solving, which is essential for a Site Reliability Engineer.

Site Reliability Engineer
NatWest
N
Similar positions in other companies
Europas größte Jobbörse für Gen-Z
discover-jobs-cta
Discover now
>