At a Glance
- Tasks: Join a global team to ensure high availability and reliability of cloud infrastructure.
- Company: GBST, a leader in wealth management technology with a commitment to innovation.
- Benefits: Flexible working, health insurance, discounts, and professional development opportunities.
- Other info: Enjoy excellent career growth and a supportive team culture.
- Why this job: Make a real impact in a dynamic environment while enhancing your tech skills.
- Qualifications: Experience with Kubernetes, AWS, and strong problem-solving abilities.
The predicted salary is between 60000 - 80000 £ per year.
About GBST
At GBST, we’re inspiring wealth innovation for wealth management and advice organisations globally. Our commitment to excellence, track record of continued and successful delivery, hard work and product excellence has earned us the trust and partnership of many of the world’s leading financial services organisations. We’ve invested heavily in transforming our technology stack to bring a truly immersive and digital experience to the front and back-office.
Opportunity
We’re now on the lookout for a Cloud Site Reliability Engineer. You’ll be joining a global, diverse team working with cross‑functional stakeholders. This is a permanent full‑time opportunity based in London.
Candidate Profile
- Ability to work on multiple tasks in parallel
- Problem solver
- Excellent communicator
- Desire to improve things
Required Skills
- Kubernetes
- Kubernetes and application troubleshooting
- Application deployment
- GitOps / ArgoCD
- K8s and application logging (Loki / fluent bit)
- Service Mesh (Linkerd preferred)
- Ingress Config / Troubleshooting (AWS LB Controller / Nginx)
- Autoscaling configuration (Karpenter)
- Certificate management (cert-manager)
- AWS Services: EKS, RDS, DMS, RDS Proxy, AWS Backup, API Gateway, RabbitMQ, AWS Transfer Family (SFTP / SFTP Connector), AWS NGFW, TGW, PrivateLink, AppStream, Lambda – Python, IAM, Kinesis, DynamoDB
- Terragrunt / Terraform
- Troubleshooting defects
- GitOps
- Helm / ArgoCD
- Observability Tooling: Grafana, Prometheus, Loki, Cloudwatch configuration/dashboard creation
- CI/CD: Git / Code Deploy / Code Pipeline
Platform Operations
- Managing and optimising our infrastructure to ensure high availability and system reliability
- Deliver 24/7 support via on‑call rotation for after‑hour issues
Experience & Qualifications
- Strong knowledge of container orchestration tools like Kubernetes and Docker
- Familiarity with deploying infrastructure as code (IaC) with Terraform and CloudFormation
- Chaos Engineering Proficiency: Understanding of implementing resilience testing strategies
- Designing and implementing chaos engineering tools such as AWS Fault Injection, Gremlin, Chaos Monkey, or LitmusChaos to design and execute fault injection experiments
- Knowledge of modern chaos engineering trends, such as adaptive resilience testing or AI‑driven fault detection
- Monitoring and observability: Experience with monitoring and observability tools such as Prometheus, ADOT, Grafana, Datadog, New Relic, Elastic Stack
- Strong understanding of instrumenting infrastructure with metrics, logging, and tracing
- Automation and scripting: Proficiency in scripting and automation languages such as Python, Go, Shell, Ruby, or Java
- Demonstrated ability to automate infrastructure and operational processes
- Incident management and root cause analysis: Participating in incident response processes, triage, mitigation, and communication
- Familiarity with incident management tools such as PagerDuty or Opsgenie
- Responding to production incidents, troubleshooting issues across the full stack, ensuring minimal downtime by driving root cause analysis and applying long‑term fixes
- Conducting blameless post‑mortems to identify root causes and derive actionable insights, ensuring continuous improvement
- Developing playbooks for common incidents, reducing MTTR
- Resilience and scalability design: Understanding of system design principles, scalability, and high‑availability architectures
- Practical experience with load testing and performance benchmarking tools such as JMeter, Locust, k6
- Designing and testing disaster recovery (DR) strategies to ensure minimal downtime and data integrity during failures
Benefits
- 2 days flexible/hybrid working arrangement
- Instant savings and discounts on major retailers across the country
- Private health insurance including dental and optical cover
- Non‑contributory pension scheme
- Salary sacrifice schemes – car, cycle to work and additional pension contributions
- Additional GBST & U day off every year
- Employee assistance program (EAP)
- LinkedIn learning
Cloud Site Reliability Engineer in London employer: GBST Holdings Ltd
At GBST, we pride ourselves on being an excellent employer, offering a dynamic work culture that fosters innovation and collaboration within our diverse global team. Based in London, our Cloud Site Reliability Engineers benefit from flexible working arrangements, comprehensive health insurance, and numerous professional development opportunities, including access to LinkedIn Learning. Join us to be part of a forward-thinking company that values your contributions and supports your growth in the ever-evolving tech landscape.
StudySmarter Expert Advice🤫
We think this is how you could land Cloud Site Reliability Engineer in London
✨Join the IT Consultancy Buzz
Get involved in local or virtual IT consultancy meetups and forums. This is where we can rub shoulders with industry professionals, get insights into what GBST Holdings Ltd values, and even spot unadvertised opportunities. Don't miss out on these chances to make a name for ourselves in the IT world!
✨Show Off Your Skills
Create a personal project or case study relevant to the challenges GBST Holdings Ltd might face. Use platforms like GitHub or Medium to share your findings. This not only demonstrates our consulting skills but shows a proactive attitude, making us stand out from the crowd when applying for that full-time gig.
✨Leverage LinkedIn for Connections
Follow and engage with the relevant thought leaders and influencers in IT consultancy on LinkedIn. Share insightful content and join discussions to gain visibility. A well-placed comment or shared article could catch the attention of someone at GBST Holdings Ltd!
✨Direct Apply to GBST Holdings Ltd
Let's not forget to apply directly through the GBST Holdings Ltd website! Tailor your application to showcase our understanding of their consulting style and how we can contribute to their projects. A personalised approach can make a huge difference in landing that full-time position!
We think you need these skills to ace Cloud Site Reliability Engineer in London
Some tips for your application 🫡
Showcase Your Problem-Solving Skills:In IT consulting, it's all about problem-solving, so make sure your CV highlights your analytical skills and any relevant projects you've tackled. Mention specific technologies or methodologies you've used to resolve issues or improve processes; this shows you can think critically and deliver results, which is vital for us at GBST Holdings Ltd.
Highlight Relevant Certifications:Certifications like ITIL, PMP, or even specific tech stack qualifications can really make you stand out. Make sure to include these in your CV, as they not only demonstrate your expertise but also your commitment to staying current in the field. We love seeing candidates who are proactive about their professional development!
Tailor Your Cover Letter:Your cover letter is your chance to connect personally with us at GBST Holdings Ltd. Share stories about your experiences in IT consulting, and how they shaped your desire to join our team. Mention why you’re excited about this particular role, and how you see yourself contributing to our projects.
Keep It Clear and Concise:We're all busy, so make sure your application is easy to read. Use bullet points for key achievements, and don’t overload us with jargon. A clean, professional layout goes a long way. Remember, the clearer your application, the more likely we are to invite you in for an interview!
How to prepare for a job interview at GBST Holdings Ltd
✨Brush Up on Your Technical Skills
For an IT consulting role, be ready to demonstrate your technical prowess. You might face questions on systems integration, cloud technologies, or even troubleshooting specific software. If you have experience with tools like AWS, Azure, or even specific programming languages, make sure you can talk about them fluently.
✨Showcase Your Problem-Solving Approach
IT consulting is all about solving problems for clients. Think about how you can illustrate your approach to a past challenge using the STAR method (Situation, Task, Action, Result). It's a great way to show how you tackle complex issues and come up with effective solutions.
✨Know the Business Impact of IT Solutions
When discussing your experiences, focus not just on the tech solutions you implemented, but also on their business impact. Employers want to see that you can connect IT with organisational goals. Prep examples that highlight how your tech contributions improved efficiency or reduced costs for past clients or projects.
✨Prepare for Behavioural Questions
Since IT consulting often involves teamwork and client interactions, expect behavioural questions that assess your interpersonal skills. Be prepared with examples that demonstrate your adaptability, communication skills, and how you handle client feedback. Before the interview, think of situations where you worked closely with clients to create effective IT strategies or changes.