Site Reliability Engineering (SRE) Manager in Basingstoke
Site Reliability Engineering (SRE) Manager

Site Reliability Engineering (SRE) Manager in Basingstoke

Basingstoke Full-Time 120000 - 120000 £ / year (est.) Home office possible
Halian Technology Limited

At a Glance

  • Tasks: Lead incident management and ensure system reliability in a hands-on SRE role.
  • Company: Join a forward-thinking tech company that values true SRE principles.
  • Benefits: Competitive salary, fully remote work, and opportunities for professional growth.
  • Why this job: Make a real impact on system reliability and lead a small, dynamic team.
  • Qualifications: Expertise in AWS, Linux troubleshooting, and experience leading technical teams.
  • Other info: Work in a supportive environment focused on SRE excellence and innovation.

The predicted salary is between 120000 - 120000 £ per year.

UK Remote Permanent | Up to £120,000 | Fully Remote (UK Only)

This is NOT a DevOps role. Real SRE work only.

We are looking for a true Senior Site Reliability Engineer with deep incident management experience, strong operational ownership, and expert Linux/AWS troubleshooting skills. This role is focused entirely on reliability, availability, incident response, and systems engineering, not building CI/CD pipelines or acting as DevOps by another name.

Leadership Requirement

Small Team Technical Lead. You must have experience leading a small engineering team (2-5 people), defining technical direction, improving on-call processes, and owning reliability strategy. This is a hands-on role with real SRE leadership, not people management.

About the Role

As a Senior SRE, you will own the reliability, resilience, and operational health of large-scale AWS/Linux systems. You will join an engineering organisation where SRE principles are fully embedded, respected, and treated as a distinct discipline.

Key Responsibilities

  • Lead major incidents, mitigation, RCA, and preventative improvements
  • Own and refine SLIs, SLOs, and error budgets
  • Reduce operational toil through automation
  • Deep-dive Linux debugging, performance tuning, and systems analysis
  • Strengthen observability, monitoring, and alerting
  • Provide technical leadership to a small SRE/engineering group
  • Improve and manage on-call processes (PagerDuty, OpsGenie, etc.)
  • Collaborate with development teams to build reliability into system design

What You'll Bring

  • Strong AWS experience (EC2, networking, autoscaling, IAM, load balancing)
  • Deep Linux troubleshooting skills (performance, networking, debugging)
  • Real 24/7 production on-call experience
  • Hands-on incident management and postmortems
  • Experience mentoring or leading a small technical team
  • Scripting/automation with Python, Go, or Bash
  • Strong observability skills (Datadog, Prometheus, Grafana, CloudWatch)

Why This Role Appeals to Real SREs

You will be solving actual SRE problems: reliability, incidents, resilience, uptime. You will guide a small team through complex engineering challenges.

Site Reliability Engineering (SRE) Manager in Basingstoke employer: Halian Technology Limited

As a Senior Site Reliability Engineering (SRE) Manager at our company, you will thrive in a fully remote environment that champions innovation and technical excellence. We offer a collaborative work culture where SRE principles are deeply respected, alongside opportunities for professional growth and leadership development within a small, dedicated team. Enjoy the flexibility of remote work while tackling real-world challenges in reliability and operational health, making a meaningful impact on our systems and processes.
Halian Technology Limited

Contact Detail:

Halian Technology Limited Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineering (SRE) Manager in Basingstoke

✨Tip Number 1

Network, network, network! Reach out to your connections in the SRE community. Attend meetups or webinars where you can chat with industry professionals. You never know who might have a lead on that perfect role!

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your incident management experience and Linux troubleshooting projects. This gives potential employers a taste of what you can bring to the table.

✨Tip Number 3

Prepare for those interviews by brushing up on your technical knowledge and incident response strategies. Practice common SRE scenarios and be ready to discuss how you've handled real-world challenges in the past.

✨Tip Number 4

Don’t forget to apply through our website! We’re always on the lookout for talented individuals like you. Plus, it’s a great way to ensure your application gets the attention it deserves.

We think you need these skills to ace Site Reliability Engineering (SRE) Manager in Basingstoke

Incident Management
Operational Ownership
Linux Troubleshooting
AWS (EC2, Networking, Autoscaling, IAM, Load Balancing)
Systems Engineering
Technical Leadership
SLIs, SLOs, and Error Budgets
Automation (Python, Go, Bash)
Observability (Datadog, Prometheus, Grafana, CloudWatch)
On-Call Process Management
Performance Tuning
Collaboration with Development Teams
Deep-Dive Systems Analysis

Some tips for your application 🫡

Show Your SRE Skills: Make sure to highlight your deep incident management experience and Linux/AWS troubleshooting skills in your application. We want to see how you've tackled real SRE challenges, so don’t hold back on the details!

Be Clear About Leadership Experience: Since this role involves leading a small engineering team, it’s crucial to showcase your experience in defining technical direction and improving on-call processes. We’re looking for someone who can take charge, so let us know how you’ve done that before!

Tailor Your Application: Don’t just send a generic CV! Tailor your application to reflect the specific responsibilities and requirements mentioned in the job description. We appreciate when candidates take the time to align their experiences with what we’re looking for.

Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications better and ensures you don’t miss out on any important updates from us!

How to prepare for a job interview at Halian Technology Limited

✨Know Your SRE Fundamentals

Make sure you brush up on your SRE principles and practices. Understand the key concepts like SLIs, SLOs, and error budgets, as these will likely come up in conversation. Being able to discuss how you've applied these in real-world scenarios will show your depth of knowledge.

✨Showcase Your Incident Management Skills

Prepare to share specific examples of major incidents you've managed. Talk about your role in the incident response, how you led the team through it, and what improvements you implemented afterwards. This will demonstrate your hands-on experience and leadership capabilities.

✨Demonstrate Technical Leadership

Since this role involves leading a small team, be ready to discuss your experience in guiding others. Highlight any mentoring you've done or how you've defined technical direction for a team. This will help illustrate your ability to lead without being purely a people manager.

✨Get Familiar with Tools and Technologies

Make sure you're well-versed in the tools mentioned in the job description, like AWS services, Linux troubleshooting, and observability platforms. If you have experience with specific tools like Datadog or Grafana, be prepared to discuss how you've used them to improve system reliability.

Site Reliability Engineering (SRE) Manager in Basingstoke
Halian Technology Limited
Location: Basingstoke

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>