At a Glance
- Tasks: Manage and improve our global Tyk Cloud platform while solving reliability issues.
- Company: Join Tyk, a leading API Management platform with a mission to connect every system in the world.
- Benefits: Enjoy unlimited paid holidays, remote work, and a flexible schedule.
- Other info: Great career growth opportunities and a supportive, inclusive culture.
- Why this job: Be part of a dynamic team driving innovation in the tech industry.
- Qualifications: Experience with Kubernetes, AWS, and a passion for continuous improvement.
The predicted salary is between 60000 - 80000 £ per year.
The Tyk API Management platform is helping to drive the connected world and power new products and services. We’re changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across various industries including retail, finance, telecoms, healthcare, and media.
Founded in 2015 with offices in London – UK, London – Ontario, Atlanta, and Singapore, we have many thousands of users of our B2B platform across the globe. Our mission is to connect every system in the world by building an API Management platform.
Total flexibility, default remote, radical responsibility: We offer unlimited paid holidays and remote working from anywhere in the world for everyone. This principle allows our employees to achieve their best results and build the best possible team without barriers related to location or working hours.
The role: We’re looking for a Site Reliability Engineer to manage, maintain, improve, and provide support on our platform. You will be curious by nature, always looking for ways to improve, as we will look to you for new ideas, solutions, and metrics on how we can enhance the platform. You will also be our first line of incident management to our clients and will help define our response going forward.
Responsibilities:
- Maintaining global Tyk Cloud within SL(A/I/O)s and helping to define them.
- Identifying reliability issues and collaborating with your squad to solve them.
- Introducing new metrics and building relevant dashboards.
- Participating in the on-call rotation.
- Expanding multi-region and multi-cloud reach of the platform.
- Documenting operational knowledge.
- Conducting post-incident analysis.
- Contributing to our continuous improvement agenda.
- Ensuring the reliability of our new global Tyk Cloud platform.
- Automating operations and support.
- Writing and maintaining documentation on SRE processes and policies.
- Recommending and implementing ways to drive operational efficiency.
- Assisting in penetration testing for Cloud through liaising with our provider.
Experience:
- Launching and operating production scale Kubernetes clusters.
- Designing and operating infrastructure on AWS and other providers.
- Operating MongoDB (or other document database) clusters.
- Operating Redis (or other key-value storage) clusters.
- Operating Prometheus and Grafana.
- Operating logging collection and analysis systems.
- Participating in the on-call rotation (16:00pm – 4:00am UTC).
Skills:
- AWS / EKS (advanced).
- Terraform and IaC in general (proficient).
- Helm (proficient).
- MongoDB (or similar).
- Redis (or similar).
- Monitoring – Prometheus, Grafana, Thanos (familiar).
- Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.).
- Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP).
- Proactive, energetic, innovative, and change-oriented.
Nice to have:
- Bare metal infrastructure engineering.
- Familiarity with Rancher.
- CKA/CKAD/CKS certifications.
- Creating and delivering production software in Go language.
Here’s why you should join us:
- Unlimited paid holiday.
- Total flexibility in hours.
- Employee share scheme.
- Generous maternity and paternity leave.
- Company retreats.
Equal Opportunity Statement: Tyk is an equal opportunities employer and we are determined to ensure that no applicant or employee receives less favourable treatment on the grounds of gender, age, disability, religion, belief, sexual orientation, marital status, or race.
Remote Site Reliability Engineer — Global Cloud Platform employer: TYK TECHNOLOGIES LIMITED
At Tyk, we pride ourselves on being an exceptional employer, offering a unique blend of flexibility and autonomy that empowers our employees to thrive. With unlimited paid holidays, a fully remote work environment, and a strong commitment to continuous improvement, we foster a culture where innovation and creativity are at the forefront. Our diverse team enjoys generous parental leave, an employee share scheme, and opportunities for personal and professional growth, making Tyk a truly rewarding place to work.
StudySmarter Expert Advice🤫
We think this is how you could land Remote Site Reliability Engineer — Global Cloud Platform
✨Tip Number 1
Network like a pro! Reach out to current employees at Tyk on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for your interview. It’s all about making connections!
✨Tip Number 2
Prepare for the technical side! Brush up on your Kubernetes, AWS, and monitoring tools like Prometheus and Grafana. Be ready to showcase your skills in real-time scenarios during the interview.
✨Tip Number 3
Show your curiosity! Tyk values innovative thinkers, so come prepared with ideas on how you can improve their platform. Think about metrics you’d introduce or reliability issues you’ve encountered before.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Tyk team.
We think you need these skills to ace Remote Site Reliability Engineer — Global Cloud Platform
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the Site Reliability Engineer role. Highlight your experience with Kubernetes, AWS, and any relevant projects that showcase your problem-solving skills. We want to see how you can contribute to our mission!
Show Your Curiosity:In your application, let us know about your curiosity and eagerness to improve systems. Share examples of how you've identified issues in the past and what steps you took to resolve them. We love candidates who are proactive and innovative!
Be Clear and Concise:When writing your application, keep it clear and to the point. Use bullet points where necessary to make it easy for us to read. We appreciate straightforward communication, especially when it comes to technical details.
Apply Through Our Website:Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at Tyk!
How to prepare for a job interview at TYK TECHNOLOGIES LIMITED
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS, Kubernetes, and Terraform. Brush up on your knowledge of MongoDB and Redis too, as these are crucial for the role.
✨Show Your Problem-Solving Skills
Prepare to discuss past incidents you've managed or resolved. Think about specific examples where you identified reliability issues and how you worked with your team to solve them. This will demonstrate your proactive approach and ability to contribute to continuous improvement.
✨Understand Tyk's Mission
Familiarise yourself with Tyk’s mission to connect every system in the world. Be ready to share your thoughts on how you can contribute to this goal, especially in terms of operational efficiency and driving down costs without impacting service.
✨Ask Insightful Questions
Prepare a few thoughtful questions about the company culture, the team you'll be working with, and the challenges they face. This shows your genuine interest in the role and helps you assess if Tyk is the right fit for you.