Staff Site Reliability Engineer in Manchester
Staff Site Reliability Engineer

Staff Site Reliability Engineer in Manchester

Manchester Full-Time 124000 - 141000 ÂŁ / year (est.) Home office (partial)
Menlo Ventures

At a Glance

  • Tasks: Lead the reliability vision for a complex SaaS platform and ensure system issues are detected early.
  • Company: Join Obsidian, a forward-thinking tech company focused on reliability and innovation.
  • Benefits: Competitive salary, equity options, comprehensive healthcare, and flexible time off.
  • Other info: Dynamic work environment with opportunities for personal and professional growth.
  • Why this job: Shape the future of reliability in tech and safeguard critical infrastructure for enterprise clients.
  • Qualifications: 5+ years in SRE or Production Engineering with strong leadership and technical skills.

The predicted salary is between 124000 - 141000 ÂŁ per year.

As a Staff SRE at Obsidian, you will define and drive the company-wide reliability vision for a complex, multi-tenant SaaS platform serving enterprise and financial customers. You will operate as a strategic partner to DevOps and Platform Engineering leadership, shaping a unified reliability strategy that scales across the organization.

Your core mandate: ensure Obsidian detects, diagnoses, and communicates system issues before customers are impacted—consistently and predictably. This is a hands-on technical role that involves architecting and leading the implementation of systems that handle real-world complexity, including upstream SaaS dependencies, sparse and noisy signals, and mission-critical enterprise workloads.

Key Responsibilities
  • Reliability Strategy & Architecture - Define and lead long-term reliability strategy across services. Establish end-to-end system visibility frameworks and guide architecture for observability, detection, and resilience.
  • Cross-Org Leadership - Partner across teams to embed reliability, standardize SLI/SLOs, and serve as a technical escalation expert.
  • Detection & Observability - Build intelligent detection systems (anomaly detection, connector health models) and enable self-service observability.
  • Incident Management - Define and evolve a tiered incident communication strategy, improve response practices, and lead postmortems to strengthen reliability and customer trust.
  • Execution - Contribute hands‑on to system design, monitoring, and debugging across distributed systems and data pipelines.
Required Qualifications
  • 5+ years in SRE, Production Engineering, or related roles
  • 3+ years operating at a senior or technical leadership level (Staff or equivalent scope)
  • Deep expertise in:
  • AWS and/or GCP
  • Kubernetes and Helm
  • Observability stacks (Prometheus, Grafana, or equivalent)
  • CI/CD systems (GitLab CI/CD, ArgoCD, etc.)
  • Proven experience designing and scaling reliability systems for multi-tenant SaaS platforms
  • Strong debugging and systems thinking across distributed microservices and legacy systems
  • Demonstrated ability to lead initiatives that improve incident detection, response, and system resilience
  • Hands‑on engineering approach with a track record of building—not just configuring—reliability systems
  • Preferred Qualifications
    • Experience in B2B SaaS serving enterprise or financial customers
    • Familiarity with third‑party SaaS connector architectures and ingestion patterns
    • Experience building anomaly detection or intelligent alerting systems
    • Experience designing customer‑facing status pages and incident communication frameworks
    Why This Role
    • Drive org-wide reliability strategy
    • Own and build new detection & observability systems
    • Tackle complex distributed systems challenges
    • Safeguard critical infrastructure for financial customers
    What Success Looks Like
    • Issues caught and resolved before customer impact
    • Reliability is measurable and continuously improving
    • Teams self‑serve observability with scalable tools
    • Clear, proactive incident communication builds trust
    • Reliability becomes a competitive advantage
    Employee Benefits
    • Competitive compensation with equity and 401k
    • Comprehensive healthcare with dental and vision coverage
    • Flexible paid time off and paid holiday time off
    • 12 weeks of new parent or family leave
    • Personal and professional development resources

    For more details on our US benefits, or for information on our international benefits, please see here.

    Pay Transparency

    Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location, as well as the knowledge, skills and experience of the candidate. In addition to a competitive base salary, this position is eligible for equity awards and may be eligible for sales commission or incentive compensation based on the role or function within the company.

    Base Salary Range £124,000 — £141,000 GBP

    At Obsidian, we are proud to be an equal‑opportunity employer. We value diversity and hire for talent, passion, and compassion. In compliance with federal law, all persons hired will be required to submit satisfactory proof of identity and legal authorization. If you have a need that requires accommodation, please contact accommodations@obsidiansecurity.com.

    Information collected and processed as part of any job applications you choose to submit is subject to Obsidian’s Applicant Privacy Policy.

    Staff Site Reliability Engineer in Manchester employer: Menlo Ventures

    At Obsidian, we pride ourselves on fostering a dynamic and inclusive work culture that empowers our employees to thrive. As a Staff Site Reliability Engineer, you will not only play a pivotal role in shaping our reliability strategy but also benefit from competitive compensation, comprehensive healthcare, and ample opportunities for personal and professional development. Our commitment to innovation and collaboration ensures that you will be part of a team that values your contributions and supports your growth in a fast-paced, multi-tenant SaaS environment.
    Menlo Ventures

    Contact Detail:

    Menlo Ventures Recruiting Team

    StudySmarter Expert Advice 🤫

    We think this is how you could land Staff Site Reliability Engineer in Manchester

    ✨Tip Number 1

    Network like a pro! Reach out to folks in the industry, especially those already at Obsidian. A friendly chat can give you insights and maybe even a referral, which is always a bonus.

    ✨Tip Number 2

    Show off your skills! Prepare a portfolio or case studies that highlight your experience with reliability systems, especially in multi-tenant SaaS platforms. This will help us see how you tackle real-world challenges.

    ✨Tip Number 3

    Get ready for the interview! Brush up on your knowledge of AWS, GCP, Kubernetes, and observability stacks. We want to see your hands-on experience and how you approach problem-solving in complex systems.

    ✨Tip Number 4

    Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Obsidian.

    We think you need these skills to ace Staff Site Reliability Engineer in Manchester

    Reliability Strategy
    System Architecture
    Observability Frameworks
    Anomaly Detection
    Incident Management
    AWS
    GCP
    Kubernetes
    Helm
    Prometheus
    Grafana
    CI/CD Systems
    Debugging
    Systems Thinking
    Technical Leadership

    Some tips for your application 🫡

    Tailor Your Application: Make sure to customise your CV and cover letter for the Staff SRE role. Highlight your experience with AWS, GCP, and Kubernetes, as well as any relevant projects that showcase your reliability strategy skills.

    Showcase Your Technical Skills: We want to see your hands-on experience! Include specific examples of how you've built or improved reliability systems in previous roles. Mention tools like Prometheus and Grafana to demonstrate your expertise.

    Communicate Clearly: Your written application should reflect your ability to communicate complex ideas simply. Use clear language and structure your thoughts logically, just like you would when defining incident communication strategies.

    Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensure you’re considered for this exciting opportunity at Obsidian.

    How to prepare for a job interview at Menlo Ventures

    ✨Know Your Reliability Strategies

    Before the interview, brush up on your understanding of reliability strategies, especially in a multi-tenant SaaS environment. Be ready to discuss how you would define and implement a reliability vision at Obsidian, showcasing your experience with SLI/SLOs and incident management.

    ✨Showcase Your Technical Skills

    Prepare to dive deep into your technical expertise, particularly with AWS, GCP, Kubernetes, and observability stacks like Prometheus and Grafana. Have specific examples ready that demonstrate your hands-on experience in building and scaling reliability systems.

    ✨Cross-Org Collaboration is Key

    Highlight your experience in partnering with different teams to embed reliability practices. Think of examples where you've successfully led cross-functional initiatives, and be prepared to discuss how you can foster collaboration at Obsidian.

    ✨Communicate Clearly and Proactively

    Since incident communication is crucial for this role, practice articulating your thoughts clearly. Prepare to discuss how you would evolve incident communication strategies and build trust with customers through proactive updates during incidents.

    Staff Site Reliability Engineer in Manchester
    Menlo Ventures
    Location: Manchester

    Land your dream job quicker with Premium

    You’re marked as a top applicant with our partner companies
    Individual CV and cover letter feedback including tailoring to specific job roles
    Be among the first applications for new jobs with our AI application
    1:1 support and career advice from our career coaches
    Go Premium

    Money-back if you don't land a job in 6-months

    >