At a Glance
- Tasks: Lead the reliability vision for a cutting-edge SaaS platform and tackle complex system challenges.
- Company: Join Obsidian Security, a fast-growing tech company transforming SaaS security.
- Benefits: Enjoy competitive pay, flexible time off, and comprehensive healthcare benefits.
- Other info: Be part of a diverse team driving innovation in SaaS security.
- Why this job: Make a real impact by safeguarding critical infrastructure for major enterprises.
- Qualifications: 5+ years in SRE or related roles with strong technical leadership experience.
The predicted salary is between 70000 - 90000 ÂŁ per year.
Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we’ve built a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Our team includes leaders who helped define the categories of endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, we’re transforming how SaaS is secured—in the era of agentic AI. Today, Obsidian is trusted by global enterprises like Snowflake, T-Mobile, and Pure Storage. We protect more than 200 organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand—including many of the world’s largest Fortune 1000 and Global 2000 companies. With strong global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise on the horizon, we’re scaling quickly toward long‑term growth and IPO readiness. Join us as we define the future of SaaS security!
As a Staff SRE at Obsidian you will define and drive the company‑wide reliability vision for a complex, multi‑tenant SaaS platform serving enterprise and financial customers. You will operate as a strategic partner to DevOps and Platform Engineering leadership, shaping a unified reliability strategy that scales across the organization. Your core mandate: ensure Obsidian detects, diagnoses, and communicates system issues before customers are impacted—consistently and predictably. This is a hands‑on technical role that involves architecting and leading the implementation of systems that handle real‑world complexity, including upstream SaaS dependencies, sparse and noisy signals, and mission‑critical enterprise workloads.
Key Responsibilities
- Reliability Strategy & Architecture – Define and lead long‑term reliability strategy across services. Establish end‑to‑end system visibility frameworks and guide architecture for observability, detection, and resilience.
- Cross‑Org Leadership – Partner across teams to embed reliability, standardize SLI/SLOs, and serve as a technical escalation expert.
- Detection & Observability – Build intelligent detection systems (anomaly detection, connector health models) and enable self‑service observability.
- Incident Management – Define and evolve a tiered incident communication strategy, improve response practices, and lead post‑mortems to strengthen reliability and customer trust.
- Execution – Contribute hands‑on to system design, monitoring, and debugging across distributed systems and data pipelines.
Required Qualifications
- 5+ years in SRE, Production Engineering, or related roles
- 3+ years operating at a senior or technical leadership level (Staff or equivalent scope)
- Deep expertise in: AWS and/or GCP, Kubernetes and Helm, Observability stacks (Prometheus, Grafana, or equivalent)
- Proven experience designing and scaling reliability systems for multi‑tenant SaaS platforms
- Strong debugging and systems thinking across distributed microservices and legacy systems
- Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience
- Hands‑on engineering approach with a track record of building—not just configuring—reliability systems
Preferred Qualifications
- Experience in B2B SaaS serving enterprise or financial customers
- Familiarity with third‑party SaaS connector architectures and ingestion patterns
- Experience building anomaly detection or intelligent alerting systems
- Experience designing customer‑facing status pages and incident communication frameworks
Why This Role
- Drive org‑wide reliability strategy
- Own and build new detection & observability systems
- Tackle complex distributed systems challenges
- Safeguard critical infrastructure for financial customers
What Success Looks Like
- Issues caught and resolved before customer impact
- Reliability is measurable and continuously improving
- Teams self‑serve observability with scalable tools
- Clear, proactive incident communication builds trust
- Reliability becomes a competitive advantage
Employee Benefits
Our competitive benefits packages are designed to support our employees' well‑being, both at work and at home. Our US‑based employees enjoy:
- Competitive compensation with equity and 401k
- Comprehensive healthcare with dental and vision coverage
- Flexible paid time off and paid holiday time off
- 12 weeks of new parent or family leave
- Personal and professional development resources
Pay Transparency
Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.
Equal Employment Opportunity
At Obsidian, we are proud to be an equal‑opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact accommodations@obsidiansecurity.com. Information collected and processed as part of any job application you choose to submit is subject to Obsidian’s Applicant Privacy Policy.
Staff Site Reliability Engineer in Manchester employer: Norwest Venture
Contact Detail:
Norwest Venture Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Staff Site Reliability Engineer in Manchester
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, especially those at Obsidian. A friendly chat can open doors and give you insights that a job description just can't.
✨Tip Number 2
Show off your skills! If you've got a portfolio or any projects that highlight your SRE expertise, make sure to share them during interviews. Real-world examples speak volumes.
✨Tip Number 3
Prepare for technical challenges! Brush up on your debugging and systems thinking skills. Be ready to tackle some hands-on scenarios that showcase your problem-solving abilities.
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining the team at Obsidian.
We think you need these skills to ace Staff Site Reliability Engineer in Manchester
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with reliability strategies and SaaS platforms. We want to see how your skills align with our mission at Obsidian!
Showcase Your Technical Skills: Don’t hold back on detailing your hands-on experience with AWS, GCP, Kubernetes, and observability stacks. We’re looking for someone who can dive deep into the technical aspects of the role, so let us know what you’ve built!
Highlight Leadership Experience: Since this is a senior role, it’s crucial to demonstrate your leadership capabilities. Share examples of how you've led initiatives or teams in improving system reliability and incident management.
Apply Through Our Website: We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensure you’re considered for the Staff Site Reliability Engineer position!
How to prepare for a job interview at Norwest Venture
✨Know Your Stuff
Make sure you brush up on your technical skills, especially around AWS, GCP, and Kubernetes. Be ready to discuss your hands-on experience with reliability systems and how you've tackled complex distributed systems challenges in the past.
✨Showcase Your Leadership
As a Staff SRE, you'll need to demonstrate your ability to lead initiatives. Prepare examples of how you've partnered across teams to embed reliability and improve incident detection and response. Highlight any experience you have in shaping a reliability strategy.
✨Be Ready for Scenario Questions
Expect questions that test your problem-solving skills in real-world scenarios. Think about how you would handle system issues before they impact customers and be prepared to discuss your approach to incident management and post-mortems.
✨Communicate Clearly
Effective communication is key, especially when it comes to incident communication strategies. Practice explaining complex technical concepts in simple terms, as you'll need to build trust with both technical and non-technical stakeholders.