At a Glance
- Tasks: Join us as a Site Reliability Engineer to optimise and automate our global Cloud platform.
- Company: Tyk, a leading API Management platform with a mission to connect every system in the world.
- Benefits: Unlimited paid holidays, remote work, employee share scheme, and generous parental leave.
- Other info: Flexible hours, a culture of continuous improvement, and a commitment to diversity and inclusion.
- Why this job: Be part of a dynamic team that values innovation and empowers you to make a real impact.
- Qualifications: Experience in SRE roles, strong cloud tech knowledge, and excellent communication skills.
The predicted salary is between 60000 - 80000 £ per year.
Who are Tyk, and what do we do? The Tyk API Management platform is helping to drive the connected world and power new products and services. We’re changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across various industries including retail, finance, telecoms, healthcare, and media. Founded in 2015 with offices in London - UK, London - Ontario, Atlanta and Singapore, we have many thousands of users of our B2B platform across the globe.
Our Mission: Tyk is on a mission to connect every system in the world. We’ve started by building an API Management platform. Total flexibility, default remote, radical responsibility. We offer unlimited paid holidays and remote working from anywhere in the world for everyone. Tyk was founded on the principle of offering flexibility and autonomy to our employees, which lets them achieve their best results.
The role: At Tyk, we’re obsessed with building software that solves problems. We count on our Site Reliability Engineers (SREs) to empower users with a rich feature set, high availability, and stellar performance. Our customer base is growing, so we’re seeking an experienced Senior SRE to optimise, automate, and improve our performance, using insights from massive‑scale data in real time. We want an original thinker, a challenger, a technical legend, an opinionated collaborator who wants to make things better.
- Lead hands‑on maintenance and optimisation of our global Cloud platform within SL(A/I/O)s you'll help define
- Collaborate to shape SRE strategy, then translate into actionable technical plans coordinated through SCRUM
- Identify reliability issues, drive root‑cause analysis, and implement solutions alongside your squad
- Lead performance tuning and fault finding through analysis of OS and application metrics
- Design and implement automation for common operational tasks and cloud‑operations workflows
- Develop proactive alerting, monitoring roadmap, and relevant dashboards; define and track KPIs
- Participate in on‑call rotation, ensuring effective incident response and resolution within SLAs
- Conduct blame‑free post‑mortems, document findings, and maintain operational runbooks
- Drive multi‑region and multi‑cloud platform expansion with focus on scalability and automation
- Optimise infrastructure performance and cost efficiency without impacting service delivery
- Engage with commercial teams on growth plans and translate into technical SRE strategies
- Coordinate penetration testing through provider liaison, technical setup, and environment configuration
- Champion continuous improvement across processes, communication, and team practices
- Model excellence in software design and knowledge sharing
- Plan and execute software upgrades to enhance cloud services
Experience required:
- Experience in an SRE role
- Strong knowledge of cloud technologies and SLA / SLO / SLI management
- Excellent communication and leadership skills
- Ability to analyse and improve operational processes and performance metrics
- Experience in software design, automation, and root‑cause analysis
- On‑call support experience and customer‑focused mindset
- Collaborative attitude with commercial and technical teams
- Launching and operating production Kubernetes clusters
- Designing and operating infrastructure on AWS and other providers
- Operating MongoDB (or other document database) clusters
- Operating Redis (or other key‑value storage) clusters
- Administering Linux servers
- Operating Prometheus and Grafana
- Operating logging collection and analysis system
- Participating in the on‑call rotation (4:00am – 16:00pm UTC)
Skills:
- Kubernetes (administrator)
- Go and/or Python (advanced)
- AWS / EKS (advanced)
- Linux (advanced)
- Terraform and IaC in general (proficient)
- Helm (proficient)
- MongoDB (or similar)
- Redis (or similar)
- Monitoring – Prometheus, Grafana, Thanos (familiar)
- Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.)
- Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP)
- Proactive, energetic, innovative and change‑oriented
- A desire to lead/mentor a team
Here’s why you should join us:
- Everyone has unlimited paid holidays.
- We have total flexibility in hours.
- Employee share scheme
- Generous maternity and paternity leave
- Volunteering days
- Employee Wellbeing platform
We all share the same vision – we value authenticity, respect, responsibility, independence, honesty, diversity and inclusion and most importantly treating others how you wish to be treated. We look for like‑minded people who bring their personalities to work every day, strive to achieve their personal goals and who are willing to challenge the way we do things.
Our values:
- It’s ok to screw up!
- The only stupid idea is the untested one!
- Trust starts with you – make it count!
- Assume best intent!
- Make things better!
Tyk is an equal opportunities employer and we are determined to ensure that no applicant or employee receives less favourable treatment on the grounds of gender, age, disability, religion, belief, sexual orientation, marital status, or race, or is disadvantaged by conditions or requirements which cannot be shown to be justifiable.
Site Reliability Engineer - APAC employer: Tyk Technologies
At Tyk, we pride ourselves on being an exceptional employer, offering a unique blend of total flexibility and radical responsibility that empowers our Site Reliability Engineers to thrive. With unlimited paid holidays, generous parental leave, and a strong commitment to employee wellbeing, we foster a collaborative and inclusive work culture where innovation is celebrated and personal growth is encouraged. Join us in our mission to connect every system in the world while enjoying the benefits of remote work and a supportive team environment.
StudySmarter Expert Advice🤫
We think this is how you could land Site Reliability Engineer - APAC
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current Tyk employees on LinkedIn. A friendly chat can sometimes lead to opportunities that aren’t even advertised!
✨Tip Number 2
Show off your skills! If you’ve got a GitHub or personal project that showcases your SRE expertise, make sure to highlight it during interviews. It’s a great way to demonstrate your hands-on experience and passion for the field.
✨Tip Number 3
Prepare for those technical interviews! Brush up on your cloud technologies, Kubernetes, and automation tools. Practising common SRE scenarios can help you feel more confident and ready to tackle any questions thrown your way.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Tyk team!
We think you need these skills to ace Site Reliability Engineer - APAC
Some tips for your application 🫡
Show Your Passion for SRE:When you're writing your application, let your enthusiasm for Site Reliability Engineering shine through! Share specific examples of how you've tackled challenges in the past and what excites you about optimising cloud platforms.
Tailor Your Application:Make sure to customise your application to reflect the skills and experiences that align with the job description. Highlight your expertise in cloud technologies, automation, and any relevant tools like Kubernetes or AWS to catch our eye!
Be Clear and Concise:We appreciate straightforward communication, so keep your application clear and to the point. Use bullet points where possible to make it easy for us to see your key achievements and skills at a glance.
Apply Through Our Website:Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, we love seeing applications come directly from our site!
How to prepare for a job interview at Tyk Technologies
✨Know Your Stuff
Make sure you brush up on your knowledge of cloud technologies, Kubernetes, and the specific tools mentioned in the job description. Tyk is looking for someone who can hit the ground running, so being able to discuss your experience with AWS, Prometheus, and Grafana will definitely give you an edge.
✨Show Your Problem-Solving Skills
Prepare to discuss past experiences where you've identified reliability issues and implemented solutions. Tyk values original thinkers, so be ready to share examples of how you've tackled challenges in your previous roles, especially in SRE contexts.
✨Communicate Clearly
Since excellent communication skills are a must for this role, practice articulating your thoughts clearly and concisely. Be prepared to explain complex technical concepts in a way that’s easy to understand, as you’ll need to collaborate with both technical and commercial teams.
✨Embrace the Culture
Familiarise yourself with Tyk's values and mission. They appreciate authenticity and a collaborative spirit, so think about how your personal values align with theirs. During the interview, don’t hesitate to express your enthusiasm for their flexible work culture and commitment to continuous improvement.