At a Glance
- Tasks: Maintain cloud infrastructure and manage incidents for clients using Google Cloud Platform.
- Company: WALT Labs empowers businesses with tailored cloud technology solutions.
- Benefits: Enjoy 20 holiday days, private health insurance, and a supportive work environment.
- Why this job: Join a dynamic team, tackle exciting challenges, and enhance your skills in cloud technology.
- Qualifications: 8-10 years experience in cloud management, GCP expertise, and strong troubleshooting skills required.
- Other info: Full-time on-site role in Kings Cross, London, with opportunities for professional growth.
The predicted salary is between 43200 - 72000 £ per year.
WALT Labs, a leading managed service provider, is dedicated to empowering businesses by harnessing the power of cloud technology. Our team specializes in delivering customized solutions tailored to meet the unique needs of our clients, driving growth and operational efficiency across industries.
This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled Site Reliability Engineer with a strong focus on Google Cloud Platform (GCP) to join our dynamic team. In this role, you’ll be responsible for maintaining cloud infrastructure, managing incidents, and ensuring seamless operations for our clients. You’ll use tools like incident.io and JIRA to manage and resolve support requests efficiently.
Qualifications
- 8-10 years of experience managing applications and infrastructure performance.
- Proven experience with Google Cloud Platform (GCP) services.
- Familiarity with incident.io for incident tracking and management (or equivalent).
- Proficiency in using JIRA for task management and support workflows.
- Strong experience working with observability tools (Grafana).
- Strong troubleshooting and problem-solving skills in cloud environments.
- Understanding of cloud security and performance optimisation best practices.
- Knowledge of scripting or automation tools (e.g., Python, Terraform) is a plus.
- Excellent communication and customer service skills.
- Certifications in GCP (Professional certifications) are highly desirable.
- Ability to work under pressure and prioritise tasks effectively.
- Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).
Responsibilities
- Provide technical support and resolve issues related to Google Cloud Platform (GCP) services and AWS.
- Manage and respond to cloud incidents using incident.io, ensuring timely resolution.
- Use JIRA to log, track, and prioritize support tickets and workflow tasks.
- Monitor and maintain cloud infrastructure for performance, reliability, and security.
- Collaborate with teams to identify and implement solutions to technical challenges.
- Assist in deploying, configuring, and optimising GCP resources.
- Create and maintain documentation for troubleshooting processes and best practices.
- Proactively identify opportunities to improve cloud environments and support processes.
- Support clients and stakeholders by providing clear communication and updates during incident resolution.
- Stay up-to-date with the latest GCP developments and contribute to team knowledge sharing.
Benefits
- 20 holiday days + bank holidays (earn 1.5 days every 3 years).
- Private health insurance.
Site Reliability Engineer (City of London) employer: WALT Labs
Contact Detail:
WALT Labs Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (City of London)
✨Tip Number 1
Familiarise yourself with Google Cloud Platform (GCP) services and tools, as this role heavily focuses on them. Consider taking online courses or obtaining certifications to demonstrate your expertise and commitment to potential employers.
✨Tip Number 2
Gain hands-on experience with incident management tools like incident.io and JIRA. If you haven't used these tools before, try to find similar platforms to practice on, as familiarity with these systems will be crucial in your day-to-day responsibilities.
✨Tip Number 3
Showcase your troubleshooting and problem-solving skills by preparing examples of past incidents you've managed. Be ready to discuss how you approached these challenges and the outcomes, as this will highlight your capability to handle pressure effectively.
✨Tip Number 4
Stay updated on the latest trends and developments in cloud technology, particularly GCP. Engaging in relevant forums, attending webinars, or following industry leaders on social media can provide insights that may set you apart from other candidates.
We think you need these skills to ace Site Reliability Engineer (City of London)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with Google Cloud Platform (GCP) and any relevant tools like incident.io and JIRA. Use specific examples to demonstrate your skills in managing cloud infrastructure and resolving incidents.
Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the Site Reliability Engineer role at WALT Labs. Mention how your background aligns with their focus on cloud technology and your ability to work under pressure while providing excellent customer service.
Showcase Relevant Experience: When detailing your work history, emphasise your 8-10 years of experience in managing applications and infrastructure performance. Highlight any certifications in GCP and your familiarity with observability tools like Grafana.
Prepare for Technical Questions: Anticipate technical questions related to cloud security, performance optimisation, and troubleshooting in cloud environments. Be ready to discuss your experience with scripting or automation tools, as this could set you apart from other candidates.
How to prepare for a job interview at WALT Labs
✨Showcase Your GCP Expertise
Make sure to highlight your experience with Google Cloud Platform during the interview. Be prepared to discuss specific projects where you've successfully implemented GCP services, as this will demonstrate your hands-on knowledge and problem-solving skills.
✨Familiarise Yourself with Incident Management Tools
Since the role involves using incident.io for managing incidents, it’s a good idea to familiarise yourself with this tool or similar ones. Be ready to explain how you would handle incidents and provide examples of past experiences where you effectively managed support requests.
✨Demonstrate Your Troubleshooting Skills
Prepare to discuss your approach to troubleshooting in cloud environments. Think of specific challenges you've faced and how you resolved them, especially in relation to performance optimisation and security best practices.
✨Communicate Clearly and Confidently
Excellent communication is key in this role. Practice articulating your thoughts clearly, especially when discussing technical concepts. Be ready to explain complex ideas in simple terms, as you may need to communicate with clients and stakeholders who are not as technically savvy.