At a Glance
- Tasks: Design and maintain scalable Kubernetes environments while ensuring platform reliability.
- Company: Join a high-performing engineering team in a dynamic tech environment.
- Benefits: Competitive daily rate, travel expenses covered, and hybrid working model.
- Why this job: Make an impact on cutting-edge projects while travelling across Europe.
- Qualifications: Strong experience with Kubernetes, networking, and Linux administration.
- Other info: Exciting opportunity for career growth in a collaborative setting.
The predicted salary is between 48000 - 72000 Β£ per year.
Location: UK/Ireland/Netherlands with EU travel required 2-3 times per year (4-12 weeks at a time).
Rate: circa Β£400-405 per day outside of IR35. Travel expensed.
Start: ASAP
Duration: 12 months
You will need to hold an EU passport in order to be able to work and travel in Europe freely for long periods at a time. We are looking for an experienced Site Reliability Engineer (SRE) with a background in Networking to join a high-performing engineering team on a long-term contract project. This contract is hybrid working remotely and then on site for periods of time during deployments with EU travel required 2-3 times per year, for up to 8 weeks at a time.
Responsibilities
- Design, operate, and maintain highly available and scalable Kubernetes environments
- Ensure platform reliability through proactive monitoring, alerting, and incident response
- Implement and maintain secure networking, PKI, and service mesh solutions
- Troubleshoot complex issues across Linux, Windows, networking, and hardware layers
- Collaborate with engineering teams to improve system resilience and operational maturity
- Support infrastructure automation and observability initiatives
Required Skills & Experience (Must Have)
- Kubernetes β strong hands-on experience operating production clusters in on-prem environment
- Networking β deep understanding of TCP/IP, routing, DNS, load balancing, and network troubleshooting
- Linux β strong administration and troubleshooting skills
- Windows (Basics) β working knowledge of Windows systems
- PKI β certificates, TLS, and secure communications
- Hardware β understanding of server hardware and physical infrastructure
- (Service Mesh) β traffic management, security, and observability
Nice to Have
- Windows (Advanced) β in-depth administration and troubleshooting
- Ansible β configuration management and automation
- Prometheus β monitoring and alerting solutions
Optional / Beneficial Experience
- Storage Technologies & Hardware β enterprise storage systems and concepts
- Firewall Hardware β deployment and operational experience
- Cilium β eBPF-based networking and security
- Grafana / Loki β metrics visualization and log aggregation
- Virtualization in Kubernetes β running or integrating virtualized workloads within Kubernetes environments
Site Reliability Engineer in London employer: The Chrysalis Programme
Contact Detail:
The Chrysalis Programme Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land Site Reliability Engineer in London
β¨Tip Number 1
Network like a pro! Reach out to fellow SREs and tech enthusiasts on LinkedIn or at local meetups. You never know who might have the inside scoop on job openings or can refer you directly.
β¨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those involving Kubernetes and networking. This gives potential employers a taste of what you can do beyond your CV.
β¨Tip Number 3
Prepare for interviews by brushing up on common SRE scenarios. Think about how youβd handle incidents or improve system reliability. Practising these responses will help you stand out during the interview process.
β¨Tip Number 4
Donβt forget to apply through our website! Weβve got some fantastic opportunities waiting for you, and applying directly can sometimes give you a leg up in the hiring process.
We think you need these skills to ace Site Reliability Engineer in London
Some tips for your application π«‘
Tailor Your CV: Make sure your CV highlights your experience with Kubernetes and networking. We want to see how your skills match the job description, so donβt be shy about showcasing your relevant projects!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why youβre the perfect fit for our Site Reliability Engineer role. We love seeing your personality come through, so keep it engaging and relevant.
Showcase Your Problem-Solving Skills: In your application, include examples of how you've tackled complex issues in past roles. Weβre looking for someone who can troubleshoot effectively, so share those success stories!
Apply Through Our Website: We encourage you to apply directly through our website. Itβs the best way for us to receive your application and ensures you donβt miss out on any important updates from our team!
How to prepare for a job interview at The Chrysalis Programme
β¨Know Your Kubernetes Inside Out
Make sure you can talk confidently about your hands-on experience with Kubernetes. Be ready to discuss specific challenges you've faced while operating production clusters and how you overcame them. This will show that youβre not just familiar with the technology, but that you can effectively manage it in real-world scenarios.
β¨Brush Up on Networking Fundamentals
Since a deep understanding of networking is crucial for this role, review key concepts like TCP/IP, routing, and DNS. Prepare to answer questions about network troubleshooting and load balancing. You might even want to bring examples of past experiences where you resolved complex networking issues.
β¨Demonstrate Your Problem-Solving Skills
Be prepared to tackle hypothetical scenarios or case studies during the interview. Think about how you would troubleshoot issues across different layers, including Linux and Windows. Show them your thought process and how you approach problem-solving, as this is vital for an SRE role.
β¨Show Your Collaborative Spirit
Collaboration is key in engineering teams, so be ready to discuss how you've worked with others to improve system resilience and operational maturity. Share specific examples of projects where teamwork made a difference, and highlight your communication skills to demonstrate that you can work well in a hybrid environment.