Linux Site Reliability Engineer in Milton

Linux Site Reliability Engineer in Milton

Milton Full-Time No home office possible
Networking People (UK) Limited

At a Glance

  • Tasks: Join a dynamic team to enhance platform reliability and troubleshoot server issues.
  • Company: High-performing infrastructure support team in a large-scale enterprise.
  • Benefits: Competitive day rate, hybrid work model, and opportunities for professional growth.
  • Other info: Exciting chance to work with cutting-edge technologies and improve operational practices.
  • Why this job: Make a real impact on critical systems while developing your skills in a supportive environment.
  • Qualifications: Strong Linux skills, troubleshooting experience, and a proactive mindset.

We are looking for an experienced Linux Site Reliability Engineer (SRE) to join a high-performing infrastructure support team focused on maintaining and improving critical platform reliability within a large-scale enterprise environment. This position will focus on resolving hardware and platform-related incidents escalated from the L3 support team. The successful candidate will have strong Linux systems expertise and hands-on physical server troubleshooting experience, and a proactive approach to operational improvement, automation, and incident reduction.

Essential Skills / Requirements

  • Strong Linux administration and troubleshooting skills (process, networking basics, logs, package/service management).
  • Solid understanding of server hardware and peripherals (disks, RAID/HBA, NICs, firmware) and how failures present at OS level.
  • Experience with out-of-band management / lights-out technologies (e.g., iDRAC, iLO, IPMI/Redfish) for remote troubleshooting and recovery.
  • Proven ability to own incidents end-to-end: triage, identify mitigations/workarounds, coordinate with L3/engineering, communicate status, and drive to resolution.
  • Understanding of SRE operational practices and metrics (e.g., SLO/SLI concepts, error budgets, MTTD/MTTR) and a continuous-improvement mindset.
  • Strong communication skills (written and verbal): clear incident updates, customer/stakeholder management, and effective escalation and handoffs.
  • Strong documentation skills: writing clear runbooks/procedures, contributing to knowledge bases, and participating in post-incident reviews/root cause analysis.

Nice to Have / Desired Skills

  • Scripting and automation skills (e.g., Bash, Python) to build small tools, checks, and workflow automation that reduce toil.
  • Familiarity with virtualization and containerization concepts/operations (e.g., VMware/KVM, Docker, Kubernetes) and using automation to support these environments.
  • Experience with monitoring/observability and alerting workflows (dashboards, log analysis, alert tuning) and translating signals into actionable response steps.

Linux Site Reliability Engineer in Milton employer: Networking People (UK) Limited

Join a dynamic and innovative team in Glasgow as a Linux Site Reliability Engineer, where you will play a crucial role in enhancing platform reliability within a large-scale enterprise environment. Our hybrid work culture promotes flexibility and collaboration, while our commitment to employee growth ensures you have access to continuous learning opportunities and the chance to make a meaningful impact. With a focus on operational improvement and automation, you'll thrive in an environment that values your expertise and encourages proactive problem-solving.
Networking People (UK) Limited

Contact Detail:

Networking People (UK) Limited Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Linux Site Reliability Engineer in Milton

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with other Linux enthusiasts. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a GitHub repository showcasing your projects, scripts, or any automation tools you've built. This gives potential employers a taste of what you can do beyond just your CV.

✨Tip Number 3

Prepare for those interviews! Brush up on your troubleshooting skills and be ready to discuss real-life scenarios where you've resolved incidents. Practice explaining your thought process clearly and confidently.

✨Tip Number 4

Apply through our website! We make it easy for you to find roles that match your skills. Plus, it shows you're genuinely interested in joining our team. Don't miss out on the chance to land that dream job!

We think you need these skills to ace Linux Site Reliability Engineer in Milton

Linux Administration
Troubleshooting Skills
Server Hardware Knowledge
Out-of-Band Management
Incident Management
SRE Operational Practices
Communication Skills
Documentation Skills
Scripting Skills
Automation Skills
Virtualization Concepts
Containerization Concepts
Monitoring and Observability
Log Analysis

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your Linux administration skills and any relevant experience with server hardware. We want to see how your background aligns with the role, so don’t be shy about showcasing your troubleshooting expertise!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about Site Reliability Engineering and how your proactive approach can contribute to our team. Keep it concise but impactful!

Show Off Your Documentation Skills: Since strong documentation is key for this role, consider including examples of runbooks or procedures you've written in the past. This will demonstrate your ability to communicate effectively and contribute to knowledge bases.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates. Good luck!

How to prepare for a job interview at Networking People (UK) Limited

✨Know Your Linux Inside Out

Make sure you brush up on your Linux administration skills. Be ready to discuss troubleshooting processes, networking basics, and how to manage packages and services. They’ll likely ask you about real-world scenarios, so think of examples where you've resolved issues effectively.

✨Get Familiar with Server Hardware

Understand the ins and outs of server hardware and peripherals. Be prepared to explain how different failures present at the OS level. It’s a good idea to have some hands-on experience or anecdotes about dealing with RAID, NICs, and firmware issues.

✨Show Off Your Incident Management Skills

Be ready to talk about your experience owning incidents from start to finish. Highlight your ability to triage, identify workarounds, and communicate effectively with teams. They’ll want to see that you can drive incidents to resolution while keeping stakeholders informed.

✨Demonstrate Your Continuous Improvement Mindset

Discuss your understanding of SRE practices and metrics like SLOs and error budgets. Share examples of how you've implemented automation or improvements in past roles. This shows you’re proactive and committed to enhancing platform reliability.

Linux Site Reliability Engineer in Milton
Networking People (UK) Limited
Location: Milton

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>