At a Glance
- Tasks: Ensure reliability and scalability of a cutting-edge SaaS platform while collaborating with diverse teams.
- Company: Join Obsidian Security, a fast-growing leader in SaaS security backed by top investors.
- Benefits: Enjoy competitive pay, flexible time off, and comprehensive healthcare benefits.
- Other info: Be part of a dynamic team with excellent growth opportunities and a focus on innovation.
- Why this job: Tackle real-world challenges in cloud infrastructure and make a difference for global enterprises.
- Qualifications: 2-5 years in Site Reliability Engineering or related fields, with strong troubleshooting skills.
The predicted salary is between 60000 - 80000 € per year.
Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we’ve built a complete SaaS security platform to reduce risk, detect and respond to threats, and prevent breaches at the source. Our team includes leaders who helped define the categories of endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black.
Now, we’re transforming how SaaS is secured—in the era of agentic AI. Today, Obsidian is trusted by global enterprises like Snowflake, T‑Mobile, and Pure Storage. We protect more than 200 organizations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand—including many of the world’s largest Fortune 1000 and Global 2000 companies. With strong global momentum, a growing partner ecosystem including SentinelOne, Databricks, and Google Cloud, and a major fundraise on the horizon, we’re scaling quickly toward long‑term growth and IPO readiness. Join us as we define the future of SaaS security!
At Obsidian, our Site Reliability Engineers ensure the reliability, scalability, and operational excellence of a complex multi‑tenant SaaS platform serving enterprise and financial customers. As an SRE, you will work closely with DevOps, Platform Engineering, and product teams to improve system observability, incident response, and service resilience across the platform. This is a hands‑on engineering role focused on building operational excellence through monitoring, automation, debugging, and continuous improvement. You will help ensure that issues are detected and addressed quickly while contributing to systems that improve platform reliability at scale.
Key Responsibilities- Reliability Engineering: Improve the reliability, availability, and resiliency of Obsidian’s production systems and distributed services
- Detection & Observability: Build and maintain monitoring, alerting, dashboards, and observability tooling to enhance system visibility and reduce operational noise
- Incident Response & Operations: Support incident response, on‑call operations, troubleshooting, and post‑mortem processes to drive operational excellence
- Collaboration: Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability‑focused workflows
- Execution: Automate infrastructure operations, deployment workflows, and platform tooling across Kubernetes, cloud infrastructure, and data pipelines
- 2–5 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles
- Experience operating and supporting production systems in AWS and/or GCP
- Familiarity with Kubernetes and Helm in cloud‑native environments
- Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms
- Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent
- Strong troubleshooting and debugging skills across distributed systems and microservices
- Experience writing automation or infrastructure tooling using scripting or programming languages
- Strong systems thinking and a collaborative engineering mindset
- Experience supporting SaaS platforms in production environments
- Familiarity with incident management and post‑mortem practices
- Exposure to infrastructure‑as‑code and GitOps workflows
- Understanding of SLI/SLO concepts and operational metrics
- Experience with enterprise‑scale monitoring or customer‑facing production systems
- Work on reliability challenges across a large‑scale distributed SaaS platform
- Build and improve observability and operational tooling used across engineering
- Gain hands‑on experience with cloud infrastructure, Kubernetes, and production systems
- Help safeguard critical services for enterprise and financial customers
- Production issues are detected and resolved quickly
- Monitoring and alerting provide clear, actionable operational insights
- Reliability metrics and operational practices improve over time
- Engineering teams can effectively troubleshoot and self‑serve observability
- Automation reduces operational toil and improves platform stability
Our competitive benefits packages are designed to support our employees' well‑being, both at work and at home. Our US based employees enjoy:
- Competitive compensation with equity and 401k
- Comprehensive healthcare with dental and vision coverage
- Flexible paid time off and paid holiday time off
- 12 weeks of new parent or family leave
- Personal and professional development resources
Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.
Equal Opportunity EmployerAt Obsidian, we are proud to be an equal‑opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact accommodations@obsidiansecurity.com. Information collected and processed as part of any job applications you choose to submit is subject to Obsidian’s Applicant Privacy Policy.
Site Reliability Engineer in Manchester employer: Obsidian Security
At Obsidian Security, we pride ourselves on fostering a dynamic and inclusive work culture that prioritises employee well-being and professional growth. As a Site Reliability Engineer, you will be part of a forward-thinking team dedicated to tackling complex challenges in SaaS security, with access to competitive benefits, flexible working arrangements, and opportunities for continuous learning and development. Join us in our UK office and contribute to shaping the future of enterprise security while enjoying a supportive environment that values diversity and innovation.
StudySmarter Expert Advice🤫
We think this is how you could land Site Reliability Engineer in Manchester
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, especially those at Obsidian. A friendly chat can open doors and give you insights that a job description just can't.
✨Tip Number 2
Show off your skills! If you've got a portfolio or GitHub with projects related to Site Reliability Engineering, make sure to highlight them. Real-world examples of your work can set you apart from the crowd.
✨Tip Number 3
Prepare for the interview by brushing up on your troubleshooting and debugging skills. Be ready to discuss how you've tackled production issues in the past—real-life scenarios are what they want to hear!
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining the team at Obsidian.
We think you need these skills to ace Site Reliability Engineer in Manchester
Some tips for your application 🫡
Tailor Your CV:Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with AWS, Kubernetes, and any relevant monitoring tools. We want to see how your skills align with what we're looking for!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for SaaS security and how your background makes you a great fit for our team. Let us know why you're excited about the opportunity at Obsidian.
Showcase Your Problem-Solving Skills:In your application, don’t forget to mention specific examples of how you've tackled reliability challenges in the past. We love seeing how you approach problem-solving and improve operational excellence.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy!
How to prepare for a job interview at Obsidian Security
✨Know Your Tech Stack
Familiarise yourself with the technologies mentioned in the job description, like AWS, Kubernetes, and monitoring tools such as Prometheus or Grafana. Be ready to discuss your hands-on experience with these platforms and how you've used them to improve system reliability.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of how you've tackled production issues in the past. Highlight your troubleshooting and debugging skills, especially in distributed systems, and be ready to explain your thought process during incident responses.
✨Understand SLI/SLO Concepts
Make sure you grasp the concepts of Service Level Indicators (SLIs) and Service Level Objectives (SLOs). Be prepared to discuss how you've implemented these practices in previous roles and how they contribute to operational excellence.
✨Ask Insightful Questions
Prepare thoughtful questions about the company's approach to reliability engineering and incident management. This shows your genuine interest in the role and helps you assess if the company’s culture aligns with your values.