At a Glance
- Tasks: Ensure reliability and performance of critical services in a secure environment.
- Company: Join a leading organisation supporting national infrastructure and defence.
- Benefits: Competitive salary, project allowances, and opportunities for career development.
- Other info: Collaborative team culture with long-term stability and growth opportunities.
- Why this job: Tackle complex engineering challenges and make a real impact on national security.
- Qualifications: Experience in secure environments and strong troubleshooting skills required.
The predicted salary is between 36000 - 60000 £ per year.
Location: Manchester (on-site / secure environments)
Clearance: SC required | DV preferred
Employment: Permanent or Contract
Salary/Rate: Competitive + project allowances (DOE)
Overview
We are seeking a Site Reliability Engineer (SRE) to support mission-critical platforms within a secure, high-assurance environment. This role focuses on reliability, scalability, automation, and operational resilience across complex infrastructure and cloud-enabled services. You will work within a collaborative engineering team ensuring systems remain secure, performant, and highly available to support critical national infrastructure and defence programmes.
Key Responsibilities
- Maintain and improve reliability, availability, and performance of critical services
- Implement monitoring, alerting, and observability solutions
- Automate infrastructure provisioning and operational workflows
- Support incident response, root cause analysis, and post-incident reviews
- Improve system resilience through fault tolerance and self-healing design
- Collaborate with DevOps, platform, and security teams to enhance service stability
- Maintain documentation, runbooks, and operational procedures
- Ensure systems meet security and compliance requirements
Technical Environment
- Infrastructure & Cloud
- Linux systems administration
- AWS, Azure, or private cloud environments
- Virtualisation and container platforms
- Terraform, Ansible, Puppet, or similar
- CI/CD tooling (GitLab CI, Jenkins, Azure DevOps)
- Docker & Kubernetes
- Container security & runtime reliability
- Prometheus, Grafana, ELK stack, Splunk, or similar
- Logging, metrics, tracing & alerting strategies
- High availability design & scaling strategies
- Load balancing & traffic management
- Performance tuning & capacity planning
Essential Requirements
- Active SC clearance (minimum) or eligibility
- DV clearance highly desirable
- Experience supporting production environments in secure or regulated sectors
- Strong troubleshooting and incident management skills
- Ability to work on-site within secure facilities
Desirable Experience
- Experience in defence, government, or critical infrastructure environments
- Knowledge of security hardening & compliance frameworks
- Scripting skills (Python, Bash, PowerShell)
- Understanding of Zero Trust and secure architecture principles
Why Join?
- Work on nationally significant, high-impact programmes
- Access to complex engineering challenges in secure environments
- Collaborative teams focused on engineering excellence
- Long-term programme stability and career development
Apply now to support secure, mission-critical systems that underpin national capability.
Site Reliability Engineer in Manchester employer: Tektora
Contact Detail:
Tektora Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer in Manchester
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with other Site Reliability Engineers. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those involving automation, cloud services, or incident management. This gives potential employers a taste of what you can bring to the table.
✨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Be ready to discuss your experience with monitoring tools, cloud environments, and how you've tackled reliability challenges in the past.
✨Tip Number 4
Don't forget to apply through our website! We’ve got loads of opportunities waiting for talented SREs like you. Plus, it’s a great way to ensure your application gets the attention it deserves.
We think you need these skills to ace Site Reliability Engineer in Manchester
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with Linux systems, cloud environments, and automation tools like Terraform or Ansible. We want to see how your skills match our needs!
Showcase Your Projects: Include any relevant projects you've worked on that demonstrate your ability to maintain and improve system reliability. If you've tackled incident response or implemented monitoring solutions, let us know! We love seeing real-world examples.
Be Clear and Concise: When writing your application, keep it clear and to the point. Use bullet points for key achievements and avoid jargon unless it's relevant. We appreciate straightforward communication that gets to the heart of your experience.
Apply Through Our Website: Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen to join our team at StudySmarter!
How to prepare for a job interview at Tektora
✨Know Your Tech Inside Out
Make sure you’re well-versed in the technical environment mentioned in the job description. Brush up on your Linux systems administration, AWS or Azure knowledge, and be ready to discuss your experience with automation tools like Terraform or Ansible. Being able to talk confidently about these technologies will show that you’re a strong candidate.
✨Demonstrate Problem-Solving Skills
Prepare to share specific examples of how you've tackled incidents in production environments. Think about times when you performed root cause analysis or improved system resilience. This will highlight your troubleshooting skills and your ability to work under pressure, which are crucial for a Site Reliability Engineer.
✨Show Your Collaborative Spirit
Since this role involves working closely with DevOps, platform, and security teams, be ready to discuss your experience in collaborative settings. Share examples of how you’ve worked with others to enhance service stability or improve operational workflows. This will demonstrate that you can thrive in a team-oriented environment.
✨Understand Security and Compliance
Given the nature of the role, it’s essential to have a solid grasp of security hardening and compliance frameworks. Familiarise yourself with Zero Trust principles and be prepared to discuss how you’ve implemented security measures in past roles. This will show that you take security seriously and understand its importance in maintaining reliable systems.