Senior Site Reliability Engineer
Senior Site Reliability Engineer

Senior Site Reliability Engineer

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
R

At a Glance

  • Tasks: Join Reddit as a Senior Site Reliability Engineer, enhancing system reliability and performance.
  • Company: Reddit is a vibrant community platform with over 101M daily active users.
  • Benefits: Enjoy flexible vacation, private medical care, and a supportive work environment.
  • Why this job: Make a real impact on one of the internet's largest platforms while collaborating with innovative teams.
  • Qualifications: 5+ years in software or site reliability engineering; proficiency in Go or Python required.
  • Other info: Reddit values diversity and offers accommodations for individuals with disabilities.

The predicted salary is between 43200 - 72000 £ per year.

Reddit is a community of communities, built on shared interests, passion, and trust. It is home to the most open and authentic conversations on the internet, with over 100,000 active communities and approximately 101M+ daily active unique visitors. Reddit SRE is rapidly innovating, working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before.

As a Senior Site Reliability Engineer on Reddit’s Infrastructure SRE team, you will use your knowledge of distributed systems and architecture to improve the reliability and performance of Reddit’s engineering platforms and services. This role involves close collaboration with the Compute, Traffic, and Observability infrastructure teams, owning a suite of tools for engineers to understand their creations, primarily based on open-source solutions at scale.

Your responsibilities will include:

  • Advise: Work closely with engineering teams in designing and developing resilient and highly performant systems, maintaining the foundational platform for running Reddit’s infrastructure.
  • Amplify: Identify and build capabilities into foundational Infrastructure and Platform services used by Reddit engineering teams. Deliver software to improve availability, scalability, latency, and efficiency of observability components.
  • Automate: Automate repetitive, manual, or risky tasks. Build tools and integrate systems to support Reddit’s evolution.
  • Diagnose: Use your knowledge of distributed systems to identify and fix network, system, and service-level issues. Practice sustainable incident response and drive structural improvement with blameless postmortems.
  • Optimize: Observe and improve performance, reduce costs, and enhance user experience. Contribute upstream changes to the open-source projects we use.

Qualifications:

  • 5+ years of experience in Software Engineering, Site Reliability Engineering, or a development-focused DevOps role.
  • Proficiency in one or more programming languages, predominantly Go and Python.
  • Experience with Kubernetes and Cloud systems.
  • Familiarity with distributed systems development, with bonus familiarity with tools like Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki.
  • Experience with the development and operation of high-traffic backend systems.
  • Demonstrated ability to debug, fix, and optimize code.
  • Troubleshooting skills spanning applications, networking (TCP/IP), and systems.
  • Strong working knowledge of Linux and containers.
  • Excellent communication and collaborative skills.

Benefits:

  • Pension Scheme
  • Private Medical and Dental Scheme
  • Life Assurance, Income Protection
  • Workspace benefit for your home office
  • Family Planning Support
  • Flexible Vacation & Reddit Global Days Off

Reddit is proud to be an equal opportunity employer, committed to building a workforce representative of the diverse communities we serve. We provide reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures.

Senior Site Reliability Engineer employer: Reddit, Inc.

Reddit is an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration among its teams. As a Senior Site Reliability Engineer, you'll benefit from a comprehensive range of perks including a robust pension scheme, private medical and dental coverage, and flexible vacation options, all while working in a vibrant environment that values diversity and personal growth. With opportunities to contribute to open-source projects and the chance to make a significant impact on one of the internet's largest platforms, Reddit provides a fulfilling and rewarding career path for those passionate about technology and community.
R

Contact Detail:

Reddit, Inc. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with the specific tools mentioned in the job description, such as Prometheus, Thanos, and Grafana. Having hands-on experience or contributing to these open-source projects can set you apart from other candidates.

✨Tip Number 2

Network with current or former Reddit employees on platforms like LinkedIn. Engaging in conversations about their experiences can provide valuable insights into the company culture and expectations, which you can leverage during interviews.

✨Tip Number 3

Prepare to discuss your experience with distributed systems and how you've tackled reliability issues in past roles. Be ready to share specific examples that demonstrate your problem-solving skills and ability to optimise performance.

✨Tip Number 4

Showcase your collaborative skills by highlighting any cross-functional projects you've worked on. Emphasising your ability to work closely with engineering teams will resonate well with the role's requirements.

We think you need these skills to ace Senior Site Reliability Engineer

Distributed Systems Knowledge
Proficiency in Go and Python
Kubernetes Expertise
Cloud Systems Familiarity
Experience with Prometheus, Thanos, Grafana, Vector
High-Traffic Backend Systems Development
Debugging and Code Optimization Skills
Networking Troubleshooting (TCP/IP)
Strong Linux Knowledge
Containerisation Skills
Incident Response Practices
Automation of Repetitive Tasks
Collaboration and Communication Skills
Risk Management and Mitigation

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in Software Engineering and Site Reliability Engineering. Emphasise your proficiency in programming languages like Go and Python, as well as your familiarity with tools such as Prometheus and Kubernetes.

Craft a Compelling Cover Letter: In your cover letter, express your passion for Reddit and its mission. Discuss how your skills align with the responsibilities of the Senior Site Reliability Engineer role, particularly in automating processes and optimising system performance.

Showcase Relevant Projects: Include specific examples of projects where you have improved system reliability or performance. Highlight any contributions to open-source projects, especially those related to the tools mentioned in the job description.

Prepare for Technical Questions: Be ready to discuss your experience with distributed systems and troubleshooting. Prepare to explain how you've handled incidents in the past and the steps you took to improve system resilience.

How to prepare for a job interview at Reddit, Inc.

✨Showcase Your Technical Skills

Be prepared to discuss your experience with distributed systems, Kubernetes, and the specific tools mentioned in the job description like Prometheus and Grafana. Highlight any projects where you've successfully implemented these technologies.

✨Demonstrate Problem-Solving Abilities

Expect to be asked about past challenges you've faced in site reliability or software engineering. Use the STAR method (Situation, Task, Action, Result) to structure your answers and showcase how you diagnosed and resolved issues.

✨Emphasise Collaboration

Since the role involves working closely with cross-functional teams, be ready to share examples of how you've collaborated with others in previous roles. Discuss how you communicate technical concepts to non-technical stakeholders.

✨Prepare for Cultural Fit Questions

Reddit values community and open conversations. Be ready to discuss how your personal values align with Reddit's mission and culture. Think about how you can contribute to a positive team environment and support the company's goals.

Senior Site Reliability Engineer
Reddit, Inc.
R
  • Senior Site Reliability Engineer

    London
    Full-Time
    43200 - 72000 £ / year (est.)

    Application deadline: 2027-05-25

  • R

    Reddit, Inc.

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>