Senior Site Reliability Engineer in London
Senior Site Reliability Engineer

Senior Site Reliability Engineer in London

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
Go Premium
B

At a Glance

  • Tasks: Oversee service lifecycle, improve reliability, and lead a dynamic team.
  • Company: Blip, a leading tech company in sports entertainment under Flutter Entertainment.
  • Benefits: Inclusive culture, flexible work environment, and opportunities for professional growth.
  • Why this job: Join a passionate team and make a real impact in the tech world.
  • Qualifications: Experience in team management, SRE principles, and strong technical skills.
  • Other info: Diverse workplace encouraging unique perspectives and ideas.

The predicted salary is between 43200 - 72000 £ per year.

Blip is a leading tech company focused on software engineering solutions for sports entertainment. We operate at scale. As part of Flutter Entertainment, we play an essential role in the Group's goal of becoming the global leader in online sports betting and iGaming, developing innovative products and platforms for over 14 million monthly customers worldwide. We are serious about Tech. We are problem-solvers with big ambitions, keeping a people-first mindset at the core of our work. We prioritize flexibility as we strive to deliver the best technological products and tackle the greatest industry challenges. Recognizing that everyone brings their own strengths, backgrounds and new perspectives, we empower you to be yourself. That uniqueness shapes the culture of belonging we are so proud of.

The Role

We are seeking a motivated and experienced senior engineer to join our dynamic organisation. As a Senior Site Reliability Engineer in our UK&I division, you will be responsible for overseeing a group of employees, providing direction and support to ensure goals are met and operations run smoothly. If you have a strong background in team management and are ready to take on a new challenge, we want to hear from you. Come be a part of our team and make a positive impact on our organisation's success.

What You'll Be Doing

  • Engage in and improve the whole lifecycle of services—from design, deployment, operation, and refinement.
  • Take an active part in production problems root cause investigation, identification, and resolution (where necessary).
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Be an active part of performance and capacity testing.
  • Optimize reliability monitoring & alerting.
  • Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
  • Iteratively perform Auditing of performance and reliability vulnerabilities.
  • Define and revise Service Level Indicators (SLIs).
  • Practice sustainable incident response and blameless postmortems.

What You'll Bring

  • Deep familiarity building and troubleshooting release and build pipelines (ex Jenkins, buildkite, GitHub actions).
  • Experience implementing creative approach in monitoring distributed systems while leveraging industry best practices (ex instrumenting tagging taxonomy across disparate systems).
  • Experience building, managing, and deploying an application utilizing containerized microservices, in a distributed infrastructure (ex AWS, GCP, self hosted cloud).
  • Experience leveraging new technologies when it best serves a business need.
  • Comprehensive understanding of incident management best practices.
  • Opinionated and knowledgeable approach for implementing industry best practices.
  • Demonstrated experience developing teams, encouraging growth, serving as a technical mentor and leader.
  • Shows strength and comprehension in at least one programming language (ex. Java, Python, Scala, Kotlin).
  • Experience making large directional technical decisions (ex. Deciding which technology, or pattern to create or leverage).
  • Experience being 'on-call' for a service, and familiarity with incident notification tooling (ex. Pagerduty, Opsgenie).
  • Comprehensive understanding of SRE principles (ex. Working knowledge of the Google SRE book).
  • Demonstrated strength in leading a project in an agile/scrum environment.
  • Thrives in a diverse work environment.

We'd Like You To Master In

  • Experience managing complex telemetry solutions which directly contributed to overall reliability.
  • Design greenfield solutions leveraging Configuration Management/Infrastructure as Code tools (ex. Chef, puppet, Terraform).
  • Create automated tooling that contributed to multiple teams' velocity.
  • Demonstrated experience with project management best practices.
  • Shows the ability to break down large technical concepts into effective communication with stakeholders from across the organization.
  • Extensive knowledge of networking best practices, tools, and observability.
  • Experience developing and deploying automated service configuration at the edge (ex. CDN configuration, certificate renewal).
  • Work consulting with a team being able to advise on their technology, workflows, dev tooling, monitoring, alerting best practices.
  • Identified need for and lead development of automation that significantly reduced toil (ex Deployment pipelines, distributed dev environments).
  • Built and maintained a system and culture that supported and implemented SLOs.
  • Has shown to be a thought leader contributing to the broader industry conversation about SRE principles and topics (ex. Speaking at conferences).

We are committed to creating a diverse and inclusive workplace. We strongly encourage people from all backgrounds, ways of thinking, and working to apply. We are committed to including everyone regardless of their race, disability, age, gender identity, sexual orientation, and religion. Everyone brings different perspectives and experiences; you don't have to meet all the requirements listed to apply for this role.

If you need any adjustments to apply for the position and to ensure this role aligns with your needs, please send an email to accommodations@blip.pt. We will only respond to inquiries related to disabilities.

Senior Site Reliability Engineer in London employer: Blip

Blip is an exceptional employer that prioritises a people-first mindset, fostering a culture of belonging and innovation within the tech industry. Located in the vibrant UK&I division, employees benefit from flexible working arrangements, opportunities for professional growth, and the chance to make a significant impact on the future of online sports betting and iGaming. With a commitment to diversity and inclusion, Blip empowers its team members to bring their unique perspectives to the table, ensuring a dynamic and supportive work environment.
B

Contact Detail:

Blip Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current Blip employees on LinkedIn. A personal introduction can make all the difference when it comes to landing that interview.

✨Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repository showcasing your projects, especially those related to site reliability engineering. This gives you a chance to demonstrate your expertise beyond just words on a CV.

✨Tip Number 3

Ace the interview by practising common SRE scenarios. Think about how you would handle incidents, improve system reliability, and lead a team. We want to see your problem-solving skills in action!

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our awesome team at Blip.

We think you need these skills to ace Senior Site Reliability Engineer in London

Team Management
Root Cause Investigation
System Design Consulting
Capacity Planning
Performance Testing
Reliability Monitoring
Automation
Incident Management
Containerized Microservices
AWS
GCP
Programming Languages (Java, Python, Scala, Kotlin)
Agile/Scrum Methodologies
Configuration Management/Infrastructure as Code (Chef, Puppet, Terraform)
Project Management Best Practices
Networking Best Practices

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Senior Site Reliability Engineer role. Highlight your experience with monitoring distributed systems and managing complex telemetry solutions, as these are key to what we’re looking for.

Craft a Compelling Cover Letter: Use your cover letter to tell us why you’re passionate about tech and how your unique background can contribute to our team. Share specific examples of your leadership in agile environments and your approach to incident management.

Showcase Your Technical Skills: Don’t hold back on showcasing your technical prowess! Mention your familiarity with tools like Jenkins, AWS, or Terraform, and any programming languages you excel in. We want to see how you can bring your expertise to our dynamic organisation.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen to join our team at Blip!

How to prepare for a job interview at Blip

✨Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, like Jenkins, AWS, and containerized microservices. Brush up on your programming skills in languages like Java or Python, as you might be asked to demonstrate your knowledge during the interview.

✨Showcase Your Leadership Skills

As a Senior Site Reliability Engineer, you'll need to lead a team. Prepare examples of how you've successfully managed teams, mentored others, and made significant technical decisions. Highlight your experience in agile environments and how you’ve contributed to team growth.

✨Prepare for Problem-Solving Scenarios

Expect to tackle real-world problems during your interview. Be ready to discuss past incidents you've managed, how you approached root cause analysis, and the steps you took to resolve issues. This will showcase your incident management skills and your ability to think on your feet.

✨Emphasise Your People-First Mindset

Blip values a people-first approach, so be prepared to discuss how you foster a positive team culture. Share experiences where you’ve encouraged diversity and inclusion within your team, and how you’ve supported your colleagues in their professional development.

Senior Site Reliability Engineer in London
Blip
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

B
  • Senior Site Reliability Engineer in London

    London
    Full-Time
    43200 - 72000 £ / year (est.)
  • B

    Blip

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>