Linux Site Reliability Engineer in Central

Linux Site Reliability Engineer in Central

Central Temporary Home office (partial)
Networking People (UK) Limited

At a Glance

  • Tasks: Join a top-notch team to enhance platform reliability and tackle hardware incidents.
  • Company: Dynamic tech company in Glasgow with a hybrid work model.
  • Benefits: Competitive day rate, flexible working, and opportunities for skill development.
  • Other info: Exciting chance to grow in a fast-paced, supportive environment.
  • Why this job: Make a real impact on critical systems while honing your Linux expertise.
  • Qualifications: Strong Linux skills and experience in server troubleshooting required.

We are looking for an experienced Linux Site Reliability Engineer (SRE) to join a high-performing infrastructure support team focused on maintaining and improving critical platform reliability within a large-scale enterprise environment. This position will focus on resolving hardware and platform-related incidents escalated from the L3 support team. The successful candidate will have strong Linux systems expertise, hands-on physical server troubleshooting experience, and a proactive approach to operational improvement, automation, and incident reduction.

Essential Skills / Requirements

  • Strong Linux administration and troubleshooting skills (process, networking basics, logs, package/service management).
  • Solid understanding of server hardware and peripherals (disks, RAID/HBA, NICs, firmware) and how failures present at OS level.
  • Experience with out-of-band management / lights-out technologies (e.g., iDRAC, iLO, IPMI/Redfish) for remote troubleshooting and recovery.
  • Proven ability to own incidents end-to-end: triage, identify mitigations/workarounds, coordinate with L3/engineering, communicate status, and drive to resolution.
  • Understanding of SRE operational practices and metrics (e.g., SLO/SLI concepts, error budgets, MTTD/MTTR) and a continuous-improvement mindset.
  • Strong communication skills (written and verbal): clear incident updates, customer/stakeholder management, and effective escalation and handoffs.
  • Strong documentation skills: writing clear runbooks/procedures, contributing to knowledge bases, and participating in post-incident reviews/root cause analysis.

Nice to Have / Desired Skills

  • Scripting and automation skills (e.g., Bash, Python) to build small tools, checks, and workflow automation that reduce toil.
  • Familiarity with virtualization and containerization concepts/operations (e.g., VMware/KVM, Docker, Kubernetes) and using automation to support these environments.
  • Experience with monitoring/observability and alerting workflows (dashboards, log analysis, alert tuning) and translating signals into actionable response steps.

Linux Site Reliability Engineer in Central employer: Networking People (UK) Limited

Join a dynamic and innovative team in Glasgow as a Linux Site Reliability Engineer, where you will play a crucial role in enhancing platform reliability within a large-scale enterprise environment. Our hybrid work culture promotes flexibility and collaboration, while our commitment to employee growth ensures you have access to continuous learning opportunities and the chance to make a meaningful impact on operational improvements. With a focus on teamwork and proactive problem-solving, we offer a supportive atmosphere that values your expertise and encourages professional development.
Networking People (UK) Limited

Contact Detail:

Networking People (UK) Limited Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Linux Site Reliability Engineer in Central

✨Tip Number 1

Network like a pro! Attend industry meetups or tech events in Glasgow to connect with other SREs and potential employers. You never know who might be looking for someone with your Linux expertise!

✨Tip Number 2

Show off your skills! Create a GitHub repository showcasing your scripting and automation projects. This is a great way to demonstrate your hands-on experience and proactive approach to operational improvement.

✨Tip Number 3

Prepare for interviews by brushing up on your incident management skills. Be ready to discuss how you've triaged incidents in the past and what metrics you used to measure success. We want to see that continuous-improvement mindset!

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets noticed. Plus, we love seeing candidates who are genuinely interested in joining our high-performing infrastructure support team.

We think you need these skills to ace Linux Site Reliability Engineer in Central

Linux Administration
Troubleshooting Skills
Server Hardware Knowledge
Out-of-Band Management
Incident Management
SRE Operational Practices
Communication Skills
Documentation Skills
Scripting Skills
Automation Skills
Virtualization Concepts
Containerization Concepts
Monitoring and Observability
Networking Basics

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your Linux administration skills and any hands-on server troubleshooting experience. We want to see how your background aligns with the role, so don’t be shy about showcasing relevant projects or achievements!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about Site Reliability Engineering and how your proactive approach can contribute to our team. Keep it concise but impactful – we love a good story!

Show Off Your Communication Skills: Since strong communication is key for this role, make sure your application reflects that. Whether it’s through clear incident updates or effective stakeholder management, we want to see how you convey complex information simply and effectively.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at StudySmarter!

How to prepare for a job interview at Networking People (UK) Limited

✨Know Your Linux Inside Out

Make sure you brush up on your Linux administration skills. Be ready to discuss troubleshooting processes, networking basics, and how to manage packages and services. Practise explaining your thought process when resolving issues, as this will show your depth of knowledge.

✨Get Hands-On with Server Hardware

Familiarise yourself with server hardware components like disks, RAID, and NICs. Be prepared to talk about how failures manifest at the OS level and share any personal experiences you've had with physical server troubleshooting. This practical knowledge can really set you apart.

✨Master Incident Management

Understand the end-to-end incident management process. Be ready to discuss how you triage incidents, identify workarounds, and communicate effectively with stakeholders. Highlight any past experiences where you successfully drove an incident to resolution.

✨Show Off Your Automation Skills

If you have scripting experience, especially in Bash or Python, make sure to mention it! Talk about any small tools or automation workflows you've built to reduce toil. This shows that you not only understand the technical side but also have a proactive approach to improving operations.

Linux Site Reliability Engineer in Central
Networking People (UK) Limited
Location: Central

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>