Site Reliability Engineer in London

Site Reliability Engineer in London

London Full-Time 36000 - 60000 £ / year (est.) No home office possible
P

At a Glance

  • Tasks: Ensure cloud services run smoothly and improve platform performance.
  • Company: Leading global provider of communication and collaboration solutions.
  • Benefits: Competitive salary, flexible working options, and opportunities for professional growth.
  • Why this job: Join a dynamic team and make a real impact on global cloud services.
  • Qualifications: Degree in IT or related field and 2 years of relevant experience.
  • Other info: Collaborative environment with strong focus on innovation and automation.

The predicted salary is between 36000 - 60000 £ per year.

About the Company

Our client is a leading global provider of communication and collaboration solutions, including cloud-based video conferencing and IP voice communication products. The company works with international partners and supports customers across multiple regions.

About the Role

We are looking for a Site Reliability Engineer (SRE) to support the operation and reliability of overseas cloud services. You will help ensure platform stability, handle incidents, support service requests and changes, and drive automation and continuous improvements to enhance service availability and performance.

Key Responsibilities

  • Operate and maintain overseas cloud services to ensure stable and reliable platform performance.
  • Monitor system health, identify performance bottlenecks, and implement improvements.
  • Manage operational activities including incident management, service request handling, problem management, and change management.
  • Perform software updates and deployments, and maintain core platform systems.
  • Respond to major and minor service disruptions, restore services, and conduct root cause analysis (RCA).
  • Develop and maintain automation tools/scripts to improve operational efficiency and reduce manual work.
  • Maintain clear documentation such as runbooks, SOPs, and technical procedures.
  • Participate in an on-call support roster to ensure timely response to production issues.

Requirements

  • Degree in Computer Science, Information Technology, Engineering, or a related discipline (or equivalent practical experience).
  • At least 2 years of relevant experience in SRE / DevOps / Cloud Operations / Platform Engineering or related roles.
  • Strong knowledge of Linux system administration and troubleshooting.
  • Hands-on experience with containers and Kubernetes.
  • Familiarity with common automation and configuration tools such as Ansible.
  • Experience with at least one scripting language such as Python and/or Shell.
  • Exposure to public cloud environments such as AWS and/or Azure is an advantage.
  • Good communication and stakeholder management skills, with the ability to work across teams.
  • Strong problem-solving skills and ability to work effectively in a production support environment.

Site Reliability Engineer in London employer: PDR GROUP (SEA) PTE. LTD.

Our client is an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration in the field of communication and cloud services. With a strong commitment to employee growth, they provide ample opportunities for professional development and skill enhancement, particularly in cutting-edge technologies like cloud operations and automation. Located in a vibrant area with access to international partners, employees enjoy a supportive environment that values work-life balance and encourages continuous improvement.
P

Contact Detail:

PDR GROUP (SEA) PTE. LTD. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with potential colleagues on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to cloud services and automation. This gives you a chance to demonstrate your expertise beyond just a CV.

✨Tip Number 3

Prepare for interviews by brushing up on common SRE scenarios and problem-solving questions. Practice articulating your thought process clearly, as communication is key in this role. We want to see how you tackle real-world challenges!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team.

We think you need these skills to ace Site Reliability Engineer in London

Site Reliability Engineering (SRE)
Cloud Operations
Linux System Administration
Troubleshooting
Containers
Kubernetes
Ansible
Python
Shell Scripting
Public Cloud Environments (AWS, Azure)
Incident Management
Problem Management
Change Management
Automation Tools
Documentation Skills

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud services, incident management, and automation tools. We want to see how your skills match what we're looking for!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how your background makes you a great fit for our team. Keep it concise but engaging – we love a good story!

Show Off Your Technical Skills: Don’t hold back on showcasing your technical skills in your application. Mention your experience with Linux, containers, and any scripting languages you know. We’re keen to see how you can contribute to our platform's stability and performance.

Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of your application and ensures you don’t miss out on any important updates. Plus, it’s super easy!

How to prepare for a job interview at PDR GROUP (SEA) PTE. LTD.

✨Know Your Tech Inside Out

Make sure you brush up on your Linux system administration and troubleshooting skills. Be ready to discuss your hands-on experience with containers and Kubernetes, as well as any automation tools you've used like Ansible. The more confident you are in your technical knowledge, the better you'll impress the interviewers.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've handled incidents or service disruptions in the past. Highlight your approach to root cause analysis (RCA) and any improvements you implemented afterwards. This will demonstrate your ability to think critically and act decisively under pressure.

✨Communicate Clearly and Effectively

Since good communication is key for this role, practice explaining complex technical concepts in simple terms. Be ready to discuss how you've collaborated with different teams and managed stakeholders in previous roles. Clear communication can set you apart from other candidates.

✨Demonstrate Your Passion for Automation

Talk about any automation tools or scripts you've developed to improve operational efficiency. If you have experience with scripting languages like Python or Shell, be prepared to discuss specific projects where you applied these skills. Showing your enthusiasm for continuous improvement will resonate well with the interviewers.

Site Reliability Engineer in London
PDR GROUP (SEA) PTE. LTD.
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

P
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>