SRE (Linux, Firmware & Server Infrastructure) in Milton

Job Board

Companies

Networking People (UK) Limited

SRE (Linux, Firmware & Server Infrastructure)

SRE (Linux, Firmware & Server Infrastructure) in Milton

Milton Temporary Home office (partial)

Apply Now

At a Glance

Tasks: Resolve complex platform and hardware incidents while managing firmware lifecycle and server configurations.
Company: Join a high-performing enterprise infrastructure team in Glasgow with a hybrid work model.
Benefits: Competitive day rate, flexible working, and opportunities for professional growth.
Other info: Collaborative culture with a focus on continuous improvement and operational excellence.
Why this job: Make a real impact on critical platforms and enhance your skills in a dynamic environment.
Qualifications: Strong Linux expertise, incident management experience, and excellent communication skills required.

Contract: Senior Platform Reliability Engineer (Linux, Firmware & Server Infrastructure)

Location: Glasgow (Hybrid - 3 days onsite)

Duration: 6 months

Day Rate: Negotiable (Inside IR35 via umbrella)

Reference: 20460

Overview

We are seeking a Senior Platform Reliability Engineer with deep Linux systems expertise and strong exposure to server hardware, firmware, and low-level infrastructure operations. This role sits within a high-performing enterprise infrastructure team responsible for maintaining and improving the reliability of critical platforms at scale.

The position is heavily focused on resolving complex platform and hardware-related incidents, particularly those escalated from L3 support, with an emphasis on firmware lifecycle management, disk encryption, logging, and server configuration (BIOS-level controls) across multi-vendor environments. This is a hands-off hardware role, requiring strong remote troubleshooting capabilities, excellent communication skills, and the ability to work closely with internal teams and external vendors to drive issues through to resolution.

Key Responsibilities

Own and manage end-to-end incident resolution for platform and hardware-related issues, including triage, mitigation, escalation, and post-incident review
Diagnose and troubleshoot Linux OS-level issues arising from hardware faults, firmware changes, or configuration inconsistencies
Manage and support firmware lifecycle processes, including upgrades, validation, and issue remediation
Work with disk encryption technologies and logging frameworks, ensuring system integrity and auditability
Maintain and troubleshoot server configuration settings, including BIOS-level parameters across multiple hardware vendors (strong Dell focus)
Utilize out-of-band management tools (e.g., iDRAC, iLO, RACADM, Redfish APIs) for remote diagnostics and recovery
Analyse vendor logs, support bundles, and telemetry data to identify root causes and remediation paths
Engage directly with hardware vendors and engineering teams, managing escalations and driving timely resolutions
Contribute to continuous improvement initiatives, reducing incident recurrence and operational toil
Produce and maintain high-quality documentation, including runbooks, troubleshooting guides, and knowledge base articles
Participate in post-incident reviews (RCA) and support improvements in reliability metrics (MTTR, MTTD, SLOs)

Essential Skills & Experience

Strong Linux administration and troubleshooting expertise, including:

Process and service management
System logs and diagnostics
Networking fundamentals
Package and configuration management

Solid understanding of server hardware and infrastructure, including:

Disks, RAID/HBA controllers
NICs and firmware interactions
Hardware failure modes and OS-level symptoms

Proven experience with:

Firmware management and upgrades
Disk encryption and secure configurations
BIOS/server configuration management

Hands-on experience with remote management and lights-out technologies, such as:

iDRAC, iLO
RACADM
Redfish or similar APIs

Strong track record of incident ownership, including:

Triage and mitigation
Cross-team coordination
Stakeholder communication
Driving issues through to resolution

Experience working with:

Vendor diagnostics, logs, and support bundles
Vendor escalation processes and engineering engagement

Excellent communication skills (written and verbal), with the ability to clearly articulate technical issues to both technical and non-technical stakeholders
Strong documentation skills, including creation of runbooks, procedures, and RCA reports

Desirable Skills

Scripting and automation experience (e.g., Python, Bash, Ansible)
Familiarity with configuration management and automation frameworks
Exposure to virtualisation and containerisation technologies (VMware, KVM, Docker, Kubernetes)
Experience with monitoring, observability, and alerting systems, including log analysis and alert tuning
Understanding of SRE principles and metrics, including SLOs, SLIs, error budgets, MTTR/MTTD

Key Attributes

Methodical and detail-oriented approach to troubleshooting
Strong sense of ownership and accountability
Comfortable working in high-pressure, incident-driven environments
Collaborative mindset with the ability to work across global teams and vendors
Proactive approach to continuous improvement and operational excellence

SRE (Linux, Firmware & Server Infrastructure) in Milton employer: Networking People (UK) Limited

At Networking People, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters collaboration and innovation. Our Glasgow location provides a hybrid working model, allowing for flexibility while being part of a high-performing team dedicated to maintaining critical infrastructure. We are committed to employee growth, providing opportunities for continuous learning and development in a supportive environment, making us an ideal choice for those seeking meaningful and rewarding careers.

Contact Detail:

Networking People (UK) Limited Recruiting Team

View Networking People (UK) Limited Profile

StudySmarter Expert Advice🤫

We think this is how you could land SRE (Linux, Firmware & Server Infrastructure) in Milton

✨Tip Number 1

Get your networking game on! Reach out to folks in the industry, especially those already working at companies you're eyeing. A friendly chat can sometimes lead to insider info or even a referral, which can give you a leg up in the application process.

✨Tip Number 2

Prepare for interviews like it's a big exam. Brush up on your Linux skills and be ready to tackle real-world scenarios they might throw at you. Practising troubleshooting on the spot will show them you’re the go-to person for resolving complex issues.

✨Tip Number 3

Don’t just wait for job postings to pop up! Keep an eye on our website and apply as soon as you see something that fits. The quicker you act, the better your chances are of standing out in a sea of applicants.

✨Tip Number 4

Show off your documentation skills! Bring along examples of runbooks or troubleshooting guides you've created. This not only highlights your expertise but also demonstrates your commitment to improving processes and sharing knowledge with the team.

We think you need these skills to ace SRE (Linux, Firmware & Server Infrastructure) in Milton

Linux Administration

Troubleshooting Expertise

Server Hardware Knowledge

Firmware Management

Disk Encryption Technologies

BIOS Configuration Management

Remote Management Tools (iDRAC, iLO, RACADM, Redfish)

Incident Resolution Ownership

Cross-Team Coordination

Vendor Diagnostics and Support

Documentation Skills

Scripting and Automation (Python, Bash, Ansible)

Virtualisation and Containerisation Technologies (VMware, KVM, Docker, Kubernetes)

Monitoring and Observability Systems

Understanding of SRE Principles and Metrics

Some tips for your application 🫡

Tailor Your CV:Make sure your CV highlights your Linux expertise and experience with server hardware. We want to see how your skills match the job description, so don’t be shy about showcasing relevant projects or roles you've had.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re the perfect fit for this role. We love seeing enthusiasm and a clear understanding of the responsibilities, so make it personal and engaging.

Show Off Your Troubleshooting Skills:Since this role is all about resolving complex incidents, include examples of how you've tackled similar challenges in the past. We want to know how you approach problem-solving and what tools you use!

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Plus, we can’t wait to hear from you!

How to prepare for a job interview at Networking People (UK) Limited

✨Know Your Linux Inside Out

Make sure you brush up on your Linux administration skills. Be prepared to discuss your troubleshooting techniques and how you've resolved OS-level issues in the past. They’ll want to hear about specific incidents where you diagnosed problems and what steps you took to fix them.

✨Familiarise Yourself with Hardware and Firmware

Since this role involves a lot of hardware interaction, it’s crucial to understand server components and firmware management. Review common hardware failure modes and be ready to explain how you’ve handled firmware upgrades or issues in previous roles.

✨Showcase Your Communication Skills

This position requires excellent communication, especially when dealing with vendors and internal teams. Prepare examples of how you’ve effectively communicated technical issues to both technical and non-technical stakeholders. Think about times when your communication made a difference in resolving an incident.

✨Prepare for Incident Management Scenarios

Expect questions around incident ownership and resolution processes. Be ready to walk through your approach to triaging incidents, coordinating with teams, and conducting post-incident reviews. Highlight any continuous improvement initiatives you’ve contributed to that reduced incident recurrence.

SRE (Linux, Firmware & Server Infrastructure) in Milton

Networking People (UK) Limited

Location: Milton

Apply Now

SRE (Linux, Firmware & Server Infrastructure) in Milton

At a Glance

SRE (Linux, Firmware & Server Infrastructure) in Milton employer: Networking People (UK) Limited

StudySmarter Expert Advice🤫

We think you need these skills to ace SRE (Linux, Firmware & Server Infrastructure) in Milton

Some tips for your application 🫡

How to prepare for a job interview at Networking People (UK) Limited

Company

Product

Help