Senior Site Reliability Engineer - Edge in London

Senior Site Reliability Engineer - Edge in London

London Full-Time 60000 - 84000 £ / year (est.) No working from home possible
O

At a Glance

  • Tasks: Own the architecture and performance of our global entry points, ensuring high availability and security.
  • Company: Join a dynamic tech company focused on innovation and athlete experience.
  • Benefits: Enjoy a supportive environment with opportunities for personal growth and well-being.
  • Other info: Collaborate with a global team dedicated to building the future of technology.
  • Why this job: Make a real impact by optimising cutting-edge technology for athletes worldwide.
  • Qualifications: Expertise in networking, CDN management, and modern security protocols required.

The predicted salary is between 60000 - 84000 £ per year.

In the dynamic landscape of On, our technology thrives much like a spirited runner: always moving, always improving. We are building the foundation that allows our engineering organization to scale, innovate, and deliver "Wow" to athletes worldwide. To power this mission, we are seeking a Senior Site Reliability Engineer (SRE) - Edge who understands that reliability, security, and performance start at the transport layer.

You won’t just manage a CDN; you will own the architecture and performance of our global entry points, including our Apollo GraphQL API Gateway. You will leverage expert-level knowledge of HTTP/S, TCP, and DNS to optimize for global throughput. This is a hands-on senior role where you will troubleshoot advanced network bottlenecks, design our future content delivery strategy, and act as the technical authority for our Web Application Firewall (WAF), bot mitigation, and standardized service authentication.

Your Mission

  • Edge Architecture & API Gateway: Ensure high availability (99.95%+ uptime) for On’s digital platforms and our central Apollo GraphQL Gateway. You will design the "front door" of our infrastructure to be elastic, handling the unique scaling demands of both static web assets and complex federated API traffic.
  • Traffic Engineering & Segmentation: Lead the strategic roadmap for our CDN (Cloudflare) and networking stack. You will distinguish between the needs of customer-facing web applications and internal service-to-service communication, implementing optimized routing for each.
  • Environment Isolation & Security: Implement and maintain robust guardrails to protect our internal ecosystem. You will be responsible for restricting pre-production environments (e.g., Staging, QA) from the public internet using Zero Trust models, IP-based access controls, or OIDC-integrated tunnels.
  • Standardized Auth & Access: Drive the standardization of authentication and authorization at the edge. You will ensure that every request entering our network is consistently validated, providing a secure and seamless identity layer for all microservices.
  • Advanced Troubleshooting: Serve as the organization's "Level 3" expert for complex network traffic analysis. You are the one who dives into packet captures, TLS handshakes, and Apollo query latencies to find the root cause of global performance regressions.
  • Shielding the Origin: Take full ownership of our WAF and Bot Management strategy. You will design and implement measures to protect our services from DDoS attacks and malicious actors without impacting the legitimate athlete experience.
  • Infrastructure as Code (IaC): Treat the network and the gateway as code. You will manage edge configurations and gateway routing using Terraform, ensuring our security rules and routing logic are versioned, tested, and automated.

Your story

  • Networking & Gateway Authority: You have a deep understanding of the OSI model and experience managing API Gateways (specifically Apollo GraphQL). You understand how to optimize the "supergraph" for performance at the edge.
  • Edge & Security Specialist: Proven experience managing high-traffic CDN architectures (Cloudflare preferred) and a strong grasp of modern security protocols like OIDC, OAuth2, and JWT for standardizing service access.
  • Infrastructure Security: You have experience implementing "Zero Trust" architectures and managing private network connectivity to isolate internal environments from public exposure.
  • Cloud Native: You are comfortable in modern cloud environments (GCP/AWS) and have experience with Kubernetes (GKE), service mesh networking, and ingress controllers.
  • Automation First: You believe that manual changes are technical debt. You are proficient in Terraform and familiar with CI/CD workflows (GitHub Actions) for deploying networking changes safely.
  • Collaborative Leader: You enjoy working across teams (Security, DevEx, and Product) to solve horizontal problems. You can translate complex networking and auth concepts into actionable insights for non-experts.

About the Team

You will be joining the Platform Foundations group, a high-impact collective of engineers dedicated to building the "Engine" of On's technology. We manage our cloud infrastructure, Developer Experience (DevEx), and the Edge. We are a global team that values a "lead-by-example" culture. You will work alongside Staff and Principal engineers to bridge the gap between infrastructure and product, ensuring our technical investments directly accelerate the velocity of On’s mission.

On is a place that is centered around growth and progress. We offer an environment designed to give people the tools to develop holistically – to stay active, to learn, explore and innovate. Our distinctive approach combines a supportive, team-oriented atmosphere, with access to personal self-care for both physical and mental well-being, so each person is led by purpose.

On is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination.

We want to set everyone up for success, so here’s the lowdown on how we hire. Our process is a two-way street – bringing you into our culture, while helping us learn how you think. Our full process can last about eight weeks from application to offer, because we care about getting it right. These steps explain how we usually do things.

Before you get started, feel free to consider if you want to work with us. Strange question? Well, we give people a lot of space to navigate their day-to-day and that style isn’t for everyone. We want you to be passionate about what you do and be sure this is the right fit. Because when skills and passion combine – it creates that 'Wow' moment.

Step One: It starts with you... You’ll start by submitting your application to a specific role. We try to keep this step as simple as possible. We do get a lot of applications, but we review them all. If you’re a good fit to the role, a recruiter will follow up with you directly. If you didn’t receive a reply, or were unsuccessful this time around, we encourage you to look for other possible matches at On.

Senior Site Reliability Engineer - Edge in London employer: ON.com

At On, we pride ourselves on fostering a vibrant and inclusive work culture that prioritises personal growth and well-being. As a Senior Site Reliability Engineer in London, you will be part of a dynamic team dedicated to innovation and excellence, with access to cutting-edge technology and opportunities for professional development. Our commitment to a supportive environment ensures that you can thrive both personally and professionally while contributing to our mission of delivering exceptional experiences to athletes worldwide.

O

Contact Details:

ON.com Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior Site Reliability Engineer - Edge in London

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those already at On. A friendly chat can open doors and give you insights that a job description just can't.

Tip Number 2

Show off your skills in real-time! If you get the chance, participate in tech meetups or hackathons. It’s a great way to demonstrate your expertise and passion for SRE while connecting with potential colleagues.

Tip Number 3

Prepare for the interview by diving deep into On's tech stack. Familiarise yourself with Apollo GraphQL, Cloudflare, and Zero Trust models. The more you know, the better you can showcase how you fit into their mission.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in being part of the On team.

We think you need these skills to ace Senior Site Reliability Engineer - Edge in London

Site Reliability Engineering
HTTP/S
TCP
DNS
Apollo GraphQL API Gateway
Content Delivery Network (CDN)
Traffic Engineering

Some tips for your application 🫡

Be Yourself:When you're writing your application, let your personality shine through! We want to see the real you, so don’t be afraid to show your passion for technology and how it aligns with our mission.

Tailor Your Application:Make sure to customise your application for the Senior Site Reliability Engineer role. Highlight your experience with CDN architectures, API Gateways, and any relevant projects that showcase your skills in reliability and performance.

Showcase Your Achievements:Don’t just list your responsibilities; share your accomplishments! Use specific examples of how you've optimised systems or solved complex problems in previous roles to demonstrate your expertise.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role you’re excited about. Plus, it makes the process smoother for everyone!

How to prepare for a job interview at ON.com

Know Your Tech Inside Out

Make sure you have a solid grasp of the technologies mentioned in the job description, especially HTTP/S, TCP, and DNS. Brush up on your knowledge of Apollo GraphQL and CDN management, particularly with Cloudflare. Being able to discuss these topics confidently will show that you're not just familiar but truly understand how they impact performance and reliability.

Demonstrate Problem-Solving Skills

Prepare to showcase your troubleshooting abilities. Think of specific examples where you've tackled complex network issues or optimised performance. Use the STAR method (Situation, Task, Action, Result) to structure your answers, making it easy for the interviewers to see your thought process and the impact of your actions.

Showcase Your Collaborative Spirit

Since this role involves working across teams, be ready to discuss how you've successfully collaborated with others in the past. Share experiences where you translated technical concepts for non-experts or worked with security and product teams to solve problems. This will highlight your ability to communicate effectively and work well in a team-oriented environment.

Ask Insightful Questions

Prepare thoughtful questions about the company's culture, the team dynamics, and the specific challenges they face in their edge architecture. This not only shows your genuine interest in the role but also gives you a chance to assess if the company aligns with your values and career goals.