Sr. Site Reliability Engineer in Cheltenham

Job Board

Companies

Obsidian Security

Sr. Site Reliability Engineer

Sr. Site Reliability Engineer in Cheltenham

Cheltenham Full-Time 95000 - 117000 € / year (est.) No home office possible

Apply Now

At a Glance

Tasks: Ensure reliability and scalability of a cutting-edge SaaS platform while collaborating with diverse teams.
Company: Join Obsidian Security, a leader in SaaS security backed by top investors.
Benefits: Enjoy competitive pay, flexible time off, and comprehensive healthcare benefits.
Other info: Dynamic work environment with opportunities for personal and professional growth.
Why this job: Tackle real-world challenges in cloud infrastructure and make a difference for global enterprises.
Qualifications: 3-6 years in Site Reliability Engineering or related fields; AWS/GCP experience required.

The predicted salary is between 95000 - 117000 € per year.

Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we’ve built a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Our team includes leaders who helped define the categories of endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, we’re transforming how SaaS is secured—in the era of agentic AI. Today, Obsidian is trusted by global enterprises like Snowflake, T-Mobile, and Pure Storage. We protect more than 200 organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand—including many of the world’s largest Fortune 1000 and Global 2000 companies. With strong global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise on the horizon, we’re scaling quickly toward long-term growth and IPO readiness. Join us as we define the future of SaaS security!

At Obsidian, our Sr. Site Reliability Engineers ensure the reliability, scalability, and operational excellence of a complex multi-tenant SaaS platform serving enterprise and financial customers. As an SRE, you will work closely with DevOps, Platform Engineering, and product teams to improve system observability, incident response, and service resilience across the platform. This is a hands-on engineering role focused on building operational excellence through monitoring, automation, debugging, and continuous improvement. You will help ensure that issues are detected and addressed quickly while contributing to systems that improve platform reliability at scale.

Key Responsibilities

Reliability Engineering: Improve the reliability, availability, and resiliency of Obsidian’s production systems and distributed services
Detection & Observability: Build and maintain monitoring, alerting, dashboards, and observability tooling to enhance system visibility and reduce operational noise
Incident Response & Operations: Support incident response, on-call operations, troubleshooting, and postmortem processes to drive operational excellence
Collaboration: Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability-focused workflows
Execution: Automate infrastructure operations, deployment workflows, and platform tooling across Kubernetes, cloud infrastructure, and data pipelines

Required Qualifications

3-6 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles
Experience operating and supporting production systems in AWS and/or GCP
Familiarity with Kubernetes and Helm in cloud-native environments
Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms
Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent
Strong troubleshooting and debugging skills across distributed systems and microservices
Experience writing automation or infrastructure tooling using scripting or programming languages
Strong systems thinking and a collaborative engineering mindset

Preferred Qualifications

AI Agent development experience
Experience supporting SaaS platforms in production environments
Familiarity with incident management and postmortem practices
Exposure to infrastructure-as-code and GitOps workflows
Understanding of SLI/SLO concepts and operational metrics
Experience with enterprise-scale monitoring or customer-facing production systems

Why This Role

Work on reliability challenges across a large-scale distributed SaaS platform
Build and improve observability and operational tooling used across engineering
Gain hands-on experience with cloud infrastructure, Kubernetes, and production systems
Help safeguard critical services for enterprise and financial customers

What Success Looks Like

Production issues are detected and resolved quickly
Monitoring and alerting provide clear, actionable operational insights
Reliability metrics and operational practices improve over time
Engineering teams can effectively troubleshoot and self-serve observability
Automation reduces operational toil and improves platform stability

Employee Benefits

Our competitive benefits packages are designed to support our employees' well-being, both at work and at home. Our US based employees enjoy:

Competitive compensation with equity and 401k
Comprehensive healthcare with dental and vision coverage
Flexible paid time off and paid holiday time off
12 weeks of new parent or family leave
Personal and professional development resources

For more details on our US benefits, or for information on our international benefits, please see here.

Pay Transparency

Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.

At Obsidian, we are proud to be an equal-opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact accommodations@obsidiansecurity.com.

Information collected and processed as part of any job applications you choose to submit is subject to Obsidian’s Applicant Privacy Policy.

Base Salary Range £95,000 - £117,000 GBP

Sr. Site Reliability Engineer in Cheltenham employer: Obsidian Security

Obsidian Security is an exceptional employer that prioritises employee well-being and professional growth, offering competitive compensation packages, comprehensive healthcare, and flexible paid time off. Our collaborative work culture fosters innovation and continuous improvement, allowing Sr. Site Reliability Engineers to tackle complex challenges while contributing to the security of major SaaS platforms. With a focus on diversity and inclusion, we empower our team members to thrive in a dynamic environment as we scale towards long-term growth and IPO readiness.

Contact Detail:

Obsidian Security Recruiting Team

View Obsidian Security Profile

StudySmarter Expert Advice🤫

We think this is how you could land Sr. Site Reliability Engineer in Cheltenham

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current employees at Obsidian. A friendly chat can sometimes lead to a referral, which is a golden ticket in the job hunt.

✨Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repository showcasing your projects, especially those related to Site Reliability Engineering. This gives you a chance to demonstrate your expertise beyond just words on a CV.

✨Tip Number 3

Ace the interview by practising common SRE scenarios. Brush up on your troubleshooting skills and be ready to discuss how you've handled incidents in the past. We want to see your thought process in action!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Obsidian team.

We think you need these skills to ace Sr. Site Reliability Engineer in Cheltenham

Site Reliability Engineering

DevOps

Production Engineering

AWS

GCP

Kubernetes

Helm

Observability Tools

Monitoring Tools

Prometheus

Grafana

Datadog

CI/CD Systems

GitLab CI/CD

GitHub Actions

Troubleshooting Skills

Debugging Skills

Scripting Languages

Infrastructure Tooling

Systems Thinking

Collaboration

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the role of Sr. Site Reliability Engineer. Highlight your experience with AWS, GCP, and Kubernetes, and don’t forget to mention any relevant tools like Prometheus or Grafana. We want to see how your skills align with what we’re looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re passionate about SaaS security and how your background makes you a great fit for our team. Keep it concise but engaging—show us your personality!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled complex issues in production systems. We love seeing candidates who can think critically and act decisively, so share those stories that demonstrate your troubleshooting prowess!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, you’ll find all the details about the role and our company culture there!

How to prepare for a job interview at Obsidian Security

✨Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, like AWS, GCP, Kubernetes, and observability tools. Brush up on your experience with these platforms and be ready to discuss specific projects where you’ve used them.

✨Showcase Your Problem-Solving Skills

Prepare to share examples of how you've tackled production issues in the past. Think about incidents you've managed, what steps you took to resolve them, and how you improved processes afterwards. This will demonstrate your hands-on experience and troubleshooting abilities.

✨Understand SLI/SLO Concepts

Since this role focuses on reliability engineering, make sure you can explain SLI (Service Level Indicator) and SLO (Service Level Objective) concepts clearly. Be prepared to discuss how you’ve implemented these practices in previous roles to enhance system reliability.

✨Ask Insightful Questions

Prepare thoughtful questions about the company’s approach to incident management, their current challenges in SaaS security, and how they measure success in the SRE team. This shows your genuine interest in the role and helps you assess if it’s the right fit for you.

Sr. Site Reliability Engineer in Cheltenham

Obsidian Security

Location: Cheltenham

Apply Now

Sr. Site Reliability Engineer in Cheltenham

At a Glance

Sr. Site Reliability Engineer in Cheltenham employer: Obsidian Security

StudySmarter Expert Advice🤫

We think you need these skills to ace Sr. Site Reliability Engineer in Cheltenham

Some tips for your application 🫡

How to prepare for a job interview at Obsidian Security

Company

Product

Help