Site Reliability Engineer in Salford

Site Reliability Engineer in Salford

Salford Full-Time 85000 - 103000 € / year (est.) Home office (partial)
Obsidian Security

At a Glance

  • Tasks: Ensure reliability and scalability of a cutting-edge SaaS platform while collaborating with diverse teams.
  • Company: Join Obsidian Security, a leader in SaaS security backed by top investors.
  • Benefits: Enjoy competitive pay, flexible time off, and comprehensive healthcare benefits.
  • Other info: Dynamic work environment with opportunities for personal and professional growth.
  • Why this job: Tackle real-world challenges in cloud infrastructure and make a difference for global enterprises.
  • Qualifications: 2-5 years in Site Reliability Engineering or related fields; experience with AWS/GCP and Kubernetes.

The predicted salary is between 85000 - 103000 € per year.

Founded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Our team includes leaders who helped define the categories of endpoint and identity security at CrowdStrike, Okta, Cylance, and Carbon Black. Now, we’re transforming how SaaS is secured—in the era of agentic AI. Today, Obsidian is trusted by global enterprises like Snowflake, T‑Mobile, and Pure Storage. We protect more than 200 organisations across North America, Europe, the Middle East, Southeast Asia, Australia, and New Zealand—including many of the world’s largest Fortune 1000 and Global 2000 companies.

At Obsidian, our Site Reliability Engineers ensure the reliability, scalability, and operational excellence of a complex multi‑tenant SaaS platform serving enterprise and financial customers. As an SRE, you will work closely with DevOps, Platform Engineering, and product teams to improve system observability, incident response, and service resilience across the platform. This is a hands‑on engineering role focused on building operational excellence through monitoring, automation, debugging, and continuous improvement.

Key Responsibilities

  • Reliability Engineering: Improve the reliability, availability, and resiliency of Obsidian’s production systems and distributed services
  • Detection & Observability: Build and maintain monitoring, alerting, dashboards, and observability tooling to enhance system visibility and reduce operational noise
  • Incident Response & Operations: Support incident response, on‑call operations, troubleshooting, and postmortem processes to drive operational excellence
  • Collaboration: Partner with engineering teams to implement SLI/SLO practices, operational standards, and reliability‑focused workflows
  • Execution: Automate infrastructure operations, deployment workflows, and platform tooling across Kubernetes, cloud infrastructure, and data pipelines

Required Qualifications

  • 2–5 years of experience in Site Reliability Engineering, DevOps, Production Engineering, or related roles
  • Experience operating and supporting production systems in AWS and/or GCP
  • Familiarity with Kubernetes and Helm in cloud‑native environments
  • Experience with observability and monitoring tools such as Prometheus, Grafana, Datadog, or similar platforms
  • Exposure to CI/CD systems such as GitLab CI/CD, GitHub Actions, ArgoCD, or equivalent
  • Strong troubleshooting and debugging skills across distributed systems and microservices
  • Experience writing automation or infrastructure tooling using scripting or programming languages
  • Strong systems thinking and a collaborative engineering mindset

Preferred Qualifications

  • AI Agent development experience
  • Experience supporting SaaS platforms in production environments
  • Familiarity with incident management and postmortem practices
  • Exposure to infrastructure‑as‑code and GitOps workflows
  • Understanding of SLI/SLO concepts and operational metrics
  • Experience with enterprise‑scale monitoring or customer‑facing production systems

Why This Role

  • Work on reliability challenges across a large‑scale distributed SaaS platform
  • Build and improve observability and operational tooling used across engineering
  • Gain hands‑on experience with cloud infrastructure, Kubernetes, and production systems
  • Help safeguard critical services for enterprise and financial customers

What Success Looks Like

  • Production issues are detected and resolved quickly
  • Monitoring and alerting provide clear, actionable operational insights
  • Reliability metrics and operational practices improve over time
  • Engineering teams can effectively troubleshoot and self‑serve observability
  • Automation reduces operational toil and improves platform stability

Employee Benefits

  • Competitive compensation with equity and 401k
  • Comprehensive healthcare with dental and vision coverage
  • Flexible paid time off and paid holiday time off
  • 12 weeks of new parent or family leave
  • Personal and professional development resources

Pay Transparency: Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company. At Obsidian, we are proud to be an equal‑opportunity employer. We value diversity and hire for talent, passion, and compassion.

Site Reliability Engineer in Salford employer: Obsidian Security

At Obsidian Security, we pride ourselves on being an exceptional employer that fosters a culture of innovation and collaboration. Our Site Reliability Engineers play a crucial role in ensuring the reliability of our cutting-edge SaaS platform, with ample opportunities for personal and professional growth in a dynamic environment. With competitive compensation, comprehensive healthcare benefits, and a commitment to diversity and inclusion, we empower our employees to thrive while safeguarding critical services for global enterprises.

Obsidian Security

Contact Detail:

Obsidian Security Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer in Salford

Tip Number 1

Network like a pro! Reach out to current employees at Obsidian on LinkedIn or through mutual connections. A friendly chat can give you insider info and might just get your foot in the door.

Tip Number 2

Show off your skills! Prepare a mini-project or a case study that highlights your experience with reliability engineering, Kubernetes, or monitoring tools. This hands-on demonstration can really set you apart during interviews.

Tip Number 3

Be ready for technical challenges! Brush up on your troubleshooting and debugging skills, as you might face real-world scenarios during the interview process. Practice makes perfect!

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Obsidian team.

We think you need these skills to ace Site Reliability Engineer in Salford

Site Reliability Engineering
DevOps
Production Engineering
AWS
GCP
Kubernetes
Helm

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with AWS, GCP, and Kubernetes, and don’t forget to mention any relevant tools like Prometheus or Grafana. We want to see how your skills align with what we’re looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re passionate about SaaS security and how your background makes you a great fit for our team. Keep it concise but engaging—show us your personality!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled challenges in previous roles. Whether it’s improving system reliability or automating processes, we love to see how you think critically and act decisively.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, you’ll find all the details about the role and our company culture there!

How to prepare for a job interview at Obsidian Security

Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, like AWS, GCP, Kubernetes, and monitoring tools. Brush up on your experience with these platforms and be ready to discuss specific projects where you’ve used them.

Showcase Your Problem-Solving Skills

Prepare to share examples of how you've tackled production issues in the past. Think about incidents you've managed, what steps you took to resolve them, and how you improved processes afterwards. This will demonstrate your hands-on experience and troubleshooting abilities.

Understand SLI/SLO Concepts

Since this role focuses on reliability, make sure you can explain SLI (Service Level Indicators) and SLO (Service Level Objectives) clearly. Be prepared to discuss how you’ve implemented these practices in previous roles and their impact on operational excellence.

Ask Insightful Questions

At the end of the interview, don’t forget to ask questions that show your interest in the company’s future and its approach to SaaS security. Inquire about their current challenges in reliability engineering or how they envision the role evolving with AI advancements.