At a Glance
- Tasks: Own the architecture and performance of our global entry points and optimise network traffic.
- Company: Join a dynamic tech company focused on innovation and athlete experience.
- Benefits: Enjoy a supportive environment with growth opportunities and a focus on well-being.
- Why this job: Make a real impact on global performance while working with cutting-edge technology.
- Qualifications: Expertise in networking, API gateways, and modern security protocols required.
- Other info: Collaborative culture with a commitment to inclusivity and personal development.
The predicted salary is between 70000 - 90000 ÂŁ per year.
In the dynamic landscape of On, our technology thrives much like a spirited runner: always moving, always improving. We are building the foundation that allows our engineering organization to scale, innovate, and deliver "Wow" to athletes worldwide. To power this mission, we are seeking a Senior Site Reliability Engineer (SRE) – Edge who understands that reliability, security, and performance start at the transport layer.
You won’t just manage a CDN; you will own the architecture and performance of our global entry points, including our Apollo GraphQL API Gateway. You will leverage expert‑level knowledge of HTTP/S, TCP, and DNS to optimize for global throughput. This is a hands‑on senior role where you will troubleshoot advanced network bottlenecks, design our future content delivery strategy, and act as the technical authority for our Web Application Firewall (WAF), bot mitigation, and standardized service authentication.
Your mission
- Edge Architecture & API Gateway: Ensure high availability (99.95%+ uptime) for On’s digital platforms and our central Apollo GraphQL Gateway. You will design the "front door" of our infrastructure to be elastic, handling the unique scaling demands of both static web assets and complex federated API traffic.
- Traffic Engineering & Segmentation: Lead the strategic roadmap for our CDN (Cloudflare) and networking stack. You will distinguish between the needs of customer‑facing web applications and internal service‑to‑service communication, implementing optimized routing for each.
- Environment Isolation & Security: Implement and maintain robust guardrails to protect our internal ecosystem. You will be responsible for restricting pre‑production environments (e.g., Staging, QA) from the public internet using Zero Trust models, IP‑based access controls, or OIDC‑integrated tunnels.
- Standardized Auth & Access: Drive the standardization of authentication and authorization at the edge. You will ensure that every request entering our network is consistently validated, providing a secure and seamless identity layer for all microservices.
- Advanced Troubleshooting: Serve as the organization’s "Level 3" expert for complex network traffic analysis. You are the one who dives into packet captures, TLS handshakes, and Apollo query latencies to find the root cause of global performance regressions.
- Shielding the Origin: Take full ownership of our WAF and Bot Management strategy. You will design and implement measures to protect our services from DDoS attacks and malicious actors without impacting the legitimate athlete experience.
- Infrastructure as Code (IaC): Treat the network and the gateway as code. You will manage edge configurations and gateway routing using Terraform, ensuring our security rules and routing logic are versioned, tested, and automated.
Your story
- Networking & Gateway Authority: You have a deep understanding of the OSI model and experience managing API Gateways (specifically Apollo GraphQL). You understand how to optimize the "supergraph" for performance at the edge.
- Edge & Security Specialist: Proven experience managing high‑traffic CDN architectures (Cloudflare preferred) and a strong grasp of modern security protocols like OIDC, OAuth2, and JWT for standardizing service access.
- Infrastructure Security: You have experience implementing "Zero Trust" architectures and managing private network connectivity to isolate internal environments from public exposure.
- Cloud Native: You are comfortable in modern cloud environments (GCP/AWS) and have experience with Kubernetes (GKE), service mesh networking, and ingress controllers.
- Automation First: You believe that manual changes are technical debt. You are proficient in Terraform and familiar with CI/CD workflows (GitHub Actions) for deploying networking changes safely.
- Collaborative Leader: You enjoy working across teams (Security, DevEx, and Product) to solve horizontal problems. You can translate complex networking and auth concepts into actionable insights for non‑experts.
What We Offer
On is a place that is centered around growth and progress. We offer an environment designed to give people the tools to develop holistically – to stay active, to learn, explore and innovate. Our distinctive approach combines a supportive, team-oriented atmosphere, with access to personal self‑care for both physical and mental well‑being, so each person is led by purpose. On is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination.
Senior Site Reliability Engineer - Edge employer: On
Contact Detail:
On Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Site Reliability Engineer - Edge
✨Tip Number 1
Network like a pro! Attend industry meetups and tech conferences to connect with other SREs and potential employers. You never know who might be looking for someone with your skills!
✨Tip Number 2
Show off your expertise! Create a personal project or contribute to open-source that showcases your skills in managing CDN architectures or API Gateways. This can really make you stand out during interviews.
✨Tip Number 3
Practice makes perfect! Prepare for technical interviews by solving real-world problems related to network traffic analysis and security protocols. Use platforms like LeetCode or HackerRank to sharpen your skills.
✨Tip Number 4
Apply through our website! We love seeing candidates who are genuinely interested in joining our team. Tailor your application to highlight your experience with Cloudflare and Zero Trust models, and let us know why you're excited about the role!
We think you need these skills to ace Senior Site Reliability Engineer - Edge
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Senior Site Reliability Engineer role. Highlight your expertise in HTTP/S, TCP, and DNS, and don’t forget to mention any hands-on experience with CDN management and API Gateways.
Craft a Compelling Cover Letter: Your cover letter is your chance to show us your personality and passion for the role. Share specific examples of how you've tackled challenges in network performance or security, and explain why you're excited about joining our team at On.
Showcase Your Problem-Solving Skills: In your application, give us a glimpse into your troubleshooting process. Describe a complex network issue you resolved and the steps you took to get there. We love seeing how you think and approach problems!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows us you’re keen on being part of our team!
How to prepare for a job interview at On
✨Know Your Tech Inside Out
Make sure you have a solid grasp of the technologies mentioned in the job description, especially HTTP/S, TCP, and DNS. Brush up on your knowledge of Apollo GraphQL and CDN management, particularly with Cloudflare. Being able to discuss these topics confidently will show that you're not just familiar but truly understand how they impact performance and reliability.
✨Demonstrate Problem-Solving Skills
Prepare to showcase your troubleshooting abilities. Think of specific examples where you've tackled complex network issues or optimised traffic routing. Be ready to dive into technical details, like packet captures or TLS handshakes, as this role requires a hands-on approach to advanced problems.
✨Showcase Your Collaborative Spirit
This role involves working across various teams, so highlight your experience in collaboration. Share stories about how you've translated complex concepts for non-technical stakeholders or worked with security and product teams to solve problems. This will demonstrate your ability to communicate effectively and lead cross-functional initiatives.
✨Emphasise Automation and Security
Since the role focuses on Infrastructure as Code and Zero Trust models, be prepared to discuss your experience with Terraform and CI/CD workflows. Talk about how you've automated processes in the past and implemented security measures to protect environments. This will show that you align with the company's commitment to innovation and security.