At a Glance
- Tasks: Join a top-notch team to enhance platform reliability and tackle hardware incidents.
- Company: Dynamic tech company in Glasgow with a hybrid work model.
- Benefits: Competitive day rate, flexible working, and opportunities for skill development.
- Other info: Exciting chance to grow in a fast-paced environment with a focus on innovation.
- Why this job: Make a real impact on critical systems while honing your Linux expertise.
- Qualifications: Strong Linux skills and experience in server troubleshooting required.
We are looking for an experienced Linux Site Reliability Engineer (SRE) to join a high-performing infrastructure support team focused on maintaining and improving critical platform reliability within a large-scale enterprise environment.
This position will focus on resolving hardware and platform-related incidents escalated from the L3 support team. The successful candidate will have strong Linux systems expertise, hands-on physical server troubleshooting experience, and a proactive approach to operational improvement, automation, and incident reduction.
Essential Skills / Requirements- Strong Linux administration and troubleshooting skills (process, networking basics, logs, package/service management).
- Solid understanding of server hardware and peripherals (disks, RAID/HBA, NICs, firmware) and how failures present at OS level.
- Experience with out-of-band management / lights-out technologies (e.g., iDRAC, iLO, IPMI/Redfish) for remote troubleshooting and recovery.
- Proven ability to own incidents end-to-end: triage, identify mitigations/workarounds, coordinate with L3/engineering, communicate status, and drive to resolution.
- Understanding of SRE operational practices and metrics (e.g., SLO/SLI concepts, error budgets, MTTD/MTTR) and a continuous-improvement mindset.
- Strong communication skills (written and verbal): clear incident updates, customer/stakeholder management, and effective escalation and handoffs.
- Strong documentation skills: writing clear runbooks/procedures, contributing to knowledge bases, and participating in post-incident reviews/root cause analysis.
- Scripting and automation skills (e.g., Bash, Python) to build small tools, checks, and workflow automation that reduce toil.
- Familiarity with virtualization and containerization concepts/operations (e.g., VMware/KVM, Docker, Kubernetes) and using automation to support these environments.
- Experience with monitoring/observability and alerting workflows (dashboards, log analysis, alert tuning) and translating signals into actionable response steps.
Site Reliability Linux Engineer in Glasgow employer: Networking People (UK) Limited
Contact Detail:
Networking People (UK) Limited Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Linux Engineer in Glasgow
✨Tip Number 1
Network like a pro! Attend industry meetups or tech events in Glasgow to connect with other SREs and potential employers. You never know who might be looking for someone with your skills!
✨Tip Number 2
Show off your skills! Create a GitHub repository showcasing your scripting and automation projects. This gives you a chance to demonstrate your hands-on experience and problem-solving abilities to potential employers.
✨Tip Number 3
Prepare for interviews by brushing up on your incident management skills. Be ready to discuss how you've triaged incidents in the past and what steps you took to resolve them. Real-life examples will make you stand out!
✨Tip Number 4
Don’t forget to apply through our website! We’re always on the lookout for talented individuals like you, and applying directly can give you an edge in the hiring process.
We think you need these skills to ace Site Reliability Linux Engineer in Glasgow
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your Linux administration skills and any hands-on experience with server hardware. We want to see how your background aligns with the role, so don’t be shy about showcasing relevant projects or achievements!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about Site Reliability Engineering and how your proactive approach can contribute to our team. Keep it concise but impactful – we love a good story!
Show Off Your Communication Skills: Since strong communication is key for this role, make sure your application reflects that. Whether it’s through clear incident updates or effective stakeholder management, we want to see how you convey complex information simply and effectively.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about what we do at StudySmarter!
How to prepare for a job interview at Networking People (UK) Limited
✨Know Your Linux Inside Out
Make sure you brush up on your Linux administration skills. Be ready to discuss troubleshooting processes, networking basics, and how to manage packages and services. Practising common scenarios can help you articulate your thought process during the interview.
✨Get Familiar with Server Hardware
Understand the ins and outs of server hardware and peripherals. Be prepared to explain how different failures present at the OS level. Having hands-on experience will give you an edge, so if you can, set up a test environment to play around with RAID configurations and NICs.
✨Master Incident Management
Show that you can own incidents from start to finish. Prepare examples of past incidents where you triaged issues, coordinated with teams, and communicated effectively. Highlight your ability to drive incidents to resolution while keeping stakeholders informed.
✨Scripting Skills are a Plus
If you have experience with scripting in Bash or Python, be sure to mention it! Talk about any small tools or automation workflows you've built to reduce toil. This shows your proactive approach to operational improvement, which is key for a Site Reliability Engineer.