Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

Ledbury Full-Time 60000 - 84000 £ / year (est.) No home office possible
T

At a Glance

  • Tasks: Join us as a Site Reliability Engineer to ensure our services are reliable and efficient.
  • Company: TwinStream is a tech company formed by engineers with expertise in defense and security.
  • Benefits: Enjoy hybrid working, competitive day rates, and the chance to work on impactful projects.
  • Why this job: Be part of a growing team that values technical excellence and offers diverse learning opportunities.
  • Qualifications: Experience with configuration management, cloud services, and monitoring tools is essential.
  • Other info: Security clearance is required; we celebrate diversity and welcome all qualified candidates.

The predicted salary is between 60000 - 84000 £ per year.

Who are we:

In 2019, our founders were working as engineers solving complex cross domain problems in defence and security organisations.

TwinStream was formed to consolidate their collective expertise and experience into one business, providing technical excellence and exceptional service to their clients. We have teams working both on-site with clients and remotely from home.

Day Rate: £500 – £600

Location: Hybrid working near Ledbury with possible 24/7 call out when on rota

Security Clearance: Eligible for DV Clearance

About the role:

Our cross-domain services are used in high profile government organisations. The demand for these services continues to grow in both scope and scale. We are seeking an experienced Site Reliability Engineer to help satisfy that demand. As an SRE you will be responsible for ensuring the availability, performance and cost effectiveness of these services. You will be working with multiple feature development teams and the BAU/Support team to define and evolve our cloud & on-prem infrastructure & delivery pipelines, improving system observability, demonstrating performance and capacity improvements and proactively identifying and mitigating reliability risks.

Key Responsibilities of the Site Reliability Engineer:

  1. Collaborate with Software Engineers to improve reliability and performance in their subsystems
  2. Partner with System Administrators in automating toil and eliminating alerts
  3. Evolve observability and monitoring capabilities to identify and solve problems before they impact the business
  4. Support development environments to help us achieve our delivery and quality goals
  5. Research and evaluate technologies, tools and services to influence buy-vs-build decisions
  6. Develop expertise in diverse technical and business domains
  7. Expand your knowledge of the technical stacks used

Skills & Experience Required:

  1. Experience using modern configuration management tools (such as Ansible, Chef or similar)
  2. Experience working with Terraform
  3. Experience working with docker containers & container orchestration tools (such as Kubernetes, OpenShift or Docker Swarm)
  4. Experience both using and maintaining CI / CD tools (such as Jenkins or similar)
  5. Experience with monitoring tools such as InfluxDB, Prometheus or Grafana.
  6. Experience of event-driven integration with MQ messaging (RabbitMQ or similar AMQP solution)
  7. Good understanding of relational databases and SQL
  8. Linux command line, administration and shell scripting
  9. Working knowledge of network security protocols
  10. Experience using, developing with and maintaining cloud hosting services (ideally AWS EC2, RDS, S3, Lambda)

Desirable Skills:

  1. Industry experience writing well-tested code in one of our platform languages (Java, Go, Python or similar)
  2. Knowledge of cross domain principles & technologies
  3. Experience of working in a service management environment
  4. Practical applications of using observability patterns in previous systems
  5. Creating and monitoring system availability metrics and using those to drive work that reduces downtime

Further Information:

To meet the security requirements of certain clients and industries we serve, any job offer will be contingent upon the successful completion of a security screening process.

At TwinStream, we take pride in being an equal opportunity employer. We celebrate diversity and are committed to fostering an inclusive environment where all individuals are valued and respected. We welcome applications from qualified candidates regardless of race, religion, disability, age, sexual orientation, or gender.

#J-18808-Ljbffr

Site Reliability Engineer employer: Twinstream

At TwinStream, we pride ourselves on being an exceptional employer, offering a dynamic hybrid work environment near Ledbury that fosters collaboration and innovation. Our commitment to employee growth is evident through continuous learning opportunities and the chance to work on high-profile projects within government organizations. With a focus on diversity and inclusion, we ensure that every team member is valued, respected, and empowered to make a meaningful impact in their role.
T

Contact Detail:

Twinstream Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarize yourself with the specific tools and technologies mentioned in the job description, such as Terraform, Docker, and CI/CD tools like Jenkins. Having hands-on experience or projects showcasing these skills can set you apart during the interview process.

✨Tip Number 2

Highlight your experience in collaborating with software engineers and system administrators. Be prepared to discuss specific examples where you improved reliability and performance in previous roles, as this aligns closely with the responsibilities of the Site Reliability Engineer position.

✨Tip Number 3

Research TwinStream and their work in high-profile government organizations. Understanding their mission and the challenges they face will help you tailor your responses and demonstrate your genuine interest in contributing to their success.

✨Tip Number 4

Prepare to discuss your approach to observability and monitoring. Be ready to share how you've previously identified and solved problems before they impacted the business, as this is a key aspect of the role you're applying for.

We think you need these skills to ace Site Reliability Engineer

Configuration Management Tools (Ansible, Chef)
Terraform
Docker Containers
Kubernetes
OpenShift
Docker Swarm
CI/CD Tools (Jenkins)
Monitoring Tools (InfluxDB, Prometheus, Grafana)
Event-Driven Integration (RabbitMQ, AMQP)
Relational Databases and SQL
Linux Command Line and Administration
Shell Scripting
Network Security Protocols
Cloud Hosting Services (AWS EC2, RDS, S3, Lambda)
Programming Languages (Java, Go, Python)
Observability Patterns
System Availability Metrics

Some tips for your application 🫡

Understand the Role: Make sure you fully understand the responsibilities and requirements of a Site Reliability Engineer. Tailor your application to highlight your relevant experience with configuration management tools, CI/CD processes, and cloud services.

Highlight Relevant Experience: In your CV and cover letter, emphasize your experience with tools like Terraform, Docker, and monitoring solutions such as Prometheus or Grafana. Provide specific examples of how you've improved system reliability and performance in previous roles.

Showcase Collaboration Skills: Since the role involves working closely with software engineers and system administrators, mention any past experiences where you successfully collaborated with cross-functional teams to achieve common goals.

Tailor Your Cover Letter: Write a personalized cover letter that reflects your understanding of TwinStream's mission and values. Discuss how your skills align with their needs and express your enthusiasm for contributing to their projects.

How to prepare for a job interview at Twinstream

✨Showcase Your Technical Skills

Be prepared to discuss your experience with configuration management tools like Ansible or Chef, and demonstrate your knowledge of Terraform. Highlight specific projects where you've successfully implemented these technologies.

✨Emphasize Collaboration

Since the role involves working closely with software engineers and system administrators, share examples of how you've collaborated in past roles to improve system reliability and performance. This will show that you can work well in a team environment.

✨Demonstrate Problem-Solving Abilities

Prepare to discuss how you've identified and mitigated reliability risks in previous positions. Use specific examples to illustrate your proactive approach to problem-solving and improving system observability.

✨Understand the Business Impact

Be ready to explain how your technical decisions have positively impacted business outcomes. Discuss how you've used monitoring tools like Prometheus or Grafana to drive improvements in system availability and performance.

Site Reliability Engineer
Twinstream
T
  • Site Reliability Engineer

    Ledbury
    Full-Time
    60000 - 84000 £ / year (est.)

    Application deadline: 2027-03-29

  • T

    Twinstream

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>