Senior Site Reliability Engineer

Job Board

Companies

Orgvue

Senior Site Reliability Engineer

Full-Time 70000 - 90000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Lead the charge in scaling and securing our AWS and Kubernetes infrastructure.
Company: Join Orgvue, a top-tier organisational design software platform based in London.
Benefits: Enjoy hybrid working, wellbeing perks, private medical insurance, and generous holiday allowance.
Other info: Be part of a dynamic team focused on innovation and growth.
Why this job: Make a real impact on reliability culture while collaborating with diverse teams.
Qualifications: Proven SRE experience with strong Kubernetes and AWS skills required.

The predicted salary is between 70000 - 90000 £ per year.

Orgvue is a leading organizational design and planning software platform that captures the power of data visualization and modelling to build more adaptable, and better performing organizations. HR, finance and business leaders use Orgvue for actionable insight and analysis that helps them make faster workforce decisions in a constantly changing world. Orgvue is used by the world's largest and best-known enterprises and management consulting firms to visualize and confidently build the businesses they want tomorrow, today. The company is headquartered in London, with offices in Philadelphia, The Hague, Toronto, and Sydney.

We are seeking a Principal Site Reliability Engineer who will be a senior technical leader focused on scaling and hardening our AWS- and Kubernetes-based infrastructure.

Role

In this role you will work across product, platform, and operations teams to ensure our systems are reliable, observable, and resilient, even at scale. This role combines hands‑on technical capability with strategic vision, helping us build a world‑class reliability culture and a robust engineering foundation for growth. We’re looking for someone who has technical expertise, is a great communicator and enjoys collaborating across multiple teams.

Responsibilities

Define and enforce SLOs, SLIs, and error budgets across critical services
Crafting and implementing a cloud infrastructure and tooling strategy
Work across our Org to level up SRE practices
Help implement robust observability metrics, logs & traces using our observability tool
Guide the team in building automated, self‑healing systems
Own and evolve our incident response processes, including on‑call practices and post‑mortem culture
Mentor engineers across the org on best practices in reliability, operational readiness, and scalable infrastructure
Drive Infrastructure as Code (IaC) using Terraform, Kubernetes, CloudFormation and GitOps practices
Collaborate closely with security, DevOps, and software teams to ensure compliance, scalability, and operational excellence
Evaluate and introduce tools, patterns, and practices that improve the performance and reliability of our SaaS platform

Requirements

Demonstrable experience leading SRE transformations
Deep hands‑on expertise with Kubernetes (EKS preferred) in production environments
Strong experience with AWS core services (EC2, EKS, RDS, S3, ALB/NLB, IAM, CloudWatch, etc.)
Expert in Infrastructure as Code using tools such as Terraform, with knowledge of GitOps workflows
Strong background in observability: metrics, visualization, logging, and tracing
Understanding of automation, SDLC, CI/CD pipelines, deployment automation, and blue/green or canary releases
Proven experience with incident management, disaster recovery planning, root cause analysis, and post‑incident reviews

Benefits

Hybrid working – 1+ days a week in the London office
Wellbeing: Sanctus Coaching, Virtual fitness sessions, Wellbeing webinars, Annual Wellbeing day
Subsidised Gym Membership
Private Medical Insurance (including Dental and Vision) and Life Assurance
25 days holiday (increasing to 30 days at a rate of 1 extra day per year)
Employer pension contribution of 5% of your gross salary, if you contribute a minimum of 3%
Season ticket Loan
Cycle to Work Scheme
Annual Discretionary Bonus

Here at Orgvue we promote individualism and a diverse workforce to build on our future success.

Senior Site Reliability Engineer employer: Orgvue

Orgvue is an exceptional employer that fosters a culture of innovation and collaboration, making it an ideal place for a Senior Site Reliability Engineer to thrive. With a strong focus on employee wellbeing, including hybrid working options, comprehensive health benefits, and opportunities for professional growth, Orgvue empowers its team members to excel in their roles while contributing to the success of leading enterprises worldwide. Located in London, employees benefit from a vibrant city atmosphere and access to a diverse range of resources and networking opportunities.

Contact Details:

Orgvue Recruitment Team

View Orgvue profile

StudySmarter Expert Advice🤫

We think this is how you could land Senior Site Reliability Engineer

✨Tip Number 1

Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. A friendly chat can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! If you’ve got a portfolio or GitHub with projects, make sure to share it. It’s a great way to demonstrate your expertise beyond the written application.

✨Tip Number 3

Prepare for interviews by practising common questions and scenarios related to SRE. We recommend doing mock interviews with friends or using online platforms to get comfortable.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!

We think you need these skills to ace Senior Site Reliability Engineer

AWS

Kubernetes

Infrastructure as Code (IaC)

Terraform

CloudFormation

GitOps

Observability

Metrics

Logging

Tracing

Incident Management

Disaster Recovery Planning

Root Cause Analysis

Post-Incident Reviews

Collaboration Skills

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Site Reliability Engineer role. Highlight your experience with AWS, Kubernetes, and Infrastructure as Code. We want to see how your skills align with what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how you can contribute to our reliability culture. Keep it engaging and relevant to the job description.

Showcase Your Technical Skills:Don’t hold back on showcasing your technical expertise! Mention specific projects where you've implemented observability metrics or led SRE transformations. We love seeing real-world examples of your work.

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any important updates from us!

How to prepare for a job interview at Orgvue

✨Know Your Tech Inside Out

Make sure you brush up on your knowledge of AWS and Kubernetes, especially if you've worked with EKS. Be ready to discuss specific projects where you've implemented Infrastructure as Code using Terraform or similar tools. This will show that you not only understand the theory but have practical experience too.

✨Showcase Your Communication Skills

As a Senior Site Reliability Engineer, you'll need to collaborate across teams. Prepare examples of how you've effectively communicated complex technical concepts to non-technical stakeholders. This will demonstrate your ability to bridge the gap between tech and business needs.

✨Prepare for Scenario-Based Questions

Expect questions about incident management and disaster recovery planning. Think of real-life scenarios where you had to lead a post-incident review or implement a self-healing system. Sharing these experiences will highlight your problem-solving skills and reliability expertise.

✨Understand Orgvue's Culture and Values

Research Orgvue’s approach to organisational design and their commitment to building adaptable businesses. Be prepared to discuss how your values align with theirs and how you can contribute to fostering a world-class reliability culture within the company.

Senior Site Reliability Engineer

Orgvue

Apply Now

Senior Site Reliability Engineer

At a Glance

Senior Site Reliability Engineer employer: Orgvue

StudySmarter Expert Advice🤫

We think you need these skills to ace Senior Site Reliability Engineer

Some tips for your application 🫡

How to prepare for a job interview at Orgvue

Company

Product

Help