Lead Site Reliability Engineer
Lead Site Reliability Engineer

Lead Site Reliability Engineer

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
B

At a Glance

  • Tasks: Design and build tools to solve complex problems and ensure system reliability.
  • Company: Bumble Inc. connects people through dating, friendship, and professional networking with a focus on healthy relationships.
  • Benefits: Enjoy a diverse and inclusive workplace with opportunities for growth and collaboration.
  • Why this job: Join a team that values problem-solving and continuous learning in a supportive environment.
  • Qualifications: Proficiency in Python or Golang, experience with CI/CD, Kubernetes, and cloud architectures required.
  • Other info: We encourage applicants from all backgrounds and are committed to making reasonable adjustments.

The predicted salary is between 43200 - 72000 £ per year.

Inclusion at Bumble Inc.

Bumble Inc. is an equal opportunity employer and we strongly encourage people of all ages, colour, lesbian, gay, bisexual, transgender, queer and non-binary people, veterans, parents, people with disabilities, and neurodivergent people to apply. We\’re happy to make any reasonable adjustments that will help you feel more confident throughout the process, please don\’t hesitate to let us know how we can help.

In your application, please feel free to note which pronouns you use (For example: she/her, he/him, they/them, etc).

At Bumble, Site Reliability Engineers (SRE) are responsible for ensuring the reliability, scalability and performance of software systems while bridging the gap between development, security and operations.

We proactively manage, automate, and safeguard our infrastructure to deliver a robust foundation for the business and an exceptional experience for our stakeholders.

What you\’ll be doing

  • Design and build new tools and services from the ground up to solve complex problems
  • Build automation frameworks to streamline repetitive tasks
  • Design and maintain scalable, highly available and fault-tolerant systems
  • Build and maintain observability tooling including logging, monitoring, tracing and alerting systems
  • Develop and maintain automation tooling to reduce manual intervention
  • Implement infrastructure as code (IaC) for infrastructure provisioning.
  • Monitor system health and performance, identifying and fixing issues
  • Respond to system outages, troubleshooting root causes and implementing preventative measures
  • Collaborate with engineering teams and security engineers to improve system reliability, security and performance
  • Participate in on-call rotations
  • Create and maintain documentation to improve knowledge sharing across teams

About you

  • Excellent problem solving, analytical skills
  • Strong communication and collaboration skills are a must
  • Proficiency in at least Python or Golang programming languages
  • Experience with CI/CD pipelines
  • Strong proficiency with Kubernetes architecture
  • Prior experience in SRE, System administration or DevOps roles
  • Strong proficiency with Linux/Unix operating systems, including hands-on experience in configuration and troubleshooting
  • Proficiency with using Puppet for configuration management, automation and system provisioning
  • Hands-on experience in monitoring and observability platforms such as Grafana, Prometheus, Elasticsearch, Jaeger
  • Experience with cloud architectures such as GCP or AWS
  • Familiarity with SQL databases and broker systems such as Kafka
  • You are a solution-oriented professional with a passion for problem-solving
  • You take pride in ensuring systems are performant, stable and efficient
  • You thrive in a collaborative environment
  • Continuous learning is important to you and you actively explore new tools and techniques.
  • You are curiosity-driven and are constantly seeking new ways to improve processes and implement new modern solutions
  • You are committed to ensuring quality is at the heart of every project.

About Us

Bumble Inc. is the parent company of Bumble, Badoo, Fruitz and Official. The Bumble platform enables people to build healthy and equitable relationships, through kind connections. Founded by Whitney Wolfe Herd in 2014, Bumble was one of the first dating apps built with women at the centre and connects people across dating (Bumble Date), friendship (Bumble BFF) and professional networking (Bumble Bizz). Badoo, which was founded in 2006, is one of the pioneers of web and mobile dating products. Fruitz, founded in 2017, encourages open and honest communication of dating intentions through playful fruit metaphors. Official is an app for couples that promotes open and honest communication between partners and was founded in 2020.

#J-18808-Ljbffr

Lead Site Reliability Engineer employer: Bumble

At Bumble Inc., we pride ourselves on fostering an inclusive and collaborative work environment where every voice is valued. As a Lead Site Reliability Engineer, you'll not only have the opportunity to work with cutting-edge technologies but also benefit from our commitment to continuous learning and professional growth. Our culture emphasizes innovation and teamwork, ensuring that you can thrive while contributing to meaningful projects that enhance the way people connect.
B

Contact Detail:

Bumble Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Lead Site Reliability Engineer

✨Tip Number 1

Familiarize yourself with the specific tools and technologies mentioned in the job description, such as Kubernetes, Python, and CI/CD pipelines. Having hands-on experience or projects that showcase your skills with these technologies can set you apart.

✨Tip Number 2

Highlight your problem-solving abilities by preparing examples of past challenges you've faced in SRE or DevOps roles. Be ready to discuss how you approached these issues and the impact of your solutions.

✨Tip Number 3

Since collaboration is key for this role, think about instances where you've successfully worked with cross-functional teams. Be prepared to share these experiences during discussions to demonstrate your strong communication skills.

✨Tip Number 4

Stay updated on the latest trends and best practices in site reliability engineering. Showing your commitment to continuous learning and curiosity about new tools can impress the hiring team and align with Bumble's values.

We think you need these skills to ace Lead Site Reliability Engineer

Problem Solving
Analytical Skills
Strong Communication Skills
Collaboration Skills
Proficiency in Python or Golang
Experience with CI/CD Pipelines
Strong Proficiency with Kubernetes
System Administration Experience
DevOps Experience
Proficiency with Linux/Unix
Hands-on Experience with Puppet
Monitoring and Observability Tools (Grafana, Prometheus, Elasticsearch, Jaeger)
Experience with Cloud Architectures (GCP, AWS)
Familiarity with SQL Databases
Knowledge of Broker Systems (Kafka)
Continuous Learning Mindset
Curiosity-Driven Approach
Commitment to Quality

Some tips for your application 🫡

Highlight Relevant Experience: Make sure to emphasize your experience in Site Reliability Engineering, DevOps, or System Administration. Mention specific projects where you designed and built tools or services, and detail your proficiency with Kubernetes, Python, or Golang.

Showcase Problem-Solving Skills: Bumble values problem-solving abilities. Include examples of complex issues you've resolved in previous roles, particularly those related to system performance, reliability, or automation.

Communicate Your Collaboration Skills: Since collaboration is key at Bumble, describe instances where you've worked effectively with engineering teams or security engineers. Highlight your communication skills and how they contributed to successful project outcomes.

Express Your Commitment to Continuous Learning: Mention any recent tools, techniques, or technologies you've explored or learned about. This shows your curiosity-driven mindset and commitment to staying updated in the field, which aligns with Bumble's values.

How to prepare for a job interview at Bumble

✨Showcase Your Problem-Solving Skills

As a Lead Site Reliability Engineer, you'll need to demonstrate your excellent problem-solving and analytical skills. Prepare examples from your past experiences where you successfully identified and resolved complex issues, particularly in system reliability or performance.

✨Highlight Your Collaboration Experience

Strong communication and collaboration skills are essential for this role. Be ready to discuss how you've worked with cross-functional teams, especially with engineering and security teams, to improve system reliability and performance.

✨Demonstrate Technical Proficiency

Make sure to highlight your proficiency in Python or Golang, as well as your experience with CI/CD pipelines and Kubernetes architecture. Be prepared to discuss specific projects where you utilized these technologies effectively.

✨Emphasize Continuous Learning

Bumble values curiosity and continuous learning. Share how you stay updated with new tools and techniques in the field of Site Reliability Engineering, and mention any recent projects or certifications that reflect your commitment to professional growth.

Lead Site Reliability Engineer
Bumble
B
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>