Site Reliability Engineer

Site Reliability Engineer

Oxford Full-Time 43200 - 72000 £ / year (est.) No home office possible
J

At a Glance

  • Tasks: Join us as a Site Reliability Engineer to ensure our systems are reliable and scalable.
  • Company: Be part of an innovative PaaS company focused on remote monitoring and network management.
  • Benefits: Enjoy flexible work options, competitive salary, and opportunities for professional growth.
  • Why this job: Make a real impact by optimising systems for millions of users while collaborating with diverse teams.
  • Qualifications: You need a degree in a tech field and 7+ years in Site Reliability Engineering or related roles.
  • Other info: Experience with AWS, Linux, and CI/CD is essential; bring your scripting skills to the table!

The predicted salary is between 43200 - 72000 £ per year.

Our partner, an innovative PaaS company specializing in remote monitoring and network management solutions, is looking for a Site Reliability Engineer to help ensure the critical infrastructure and applications' reliability, scalability, and performance. In this role, you’ll build and maintain highly available systems, support and optimize CI/CD pipelines, and determine optimal solutions for the company’s products. You’ll collaborate closely with development, DevOps, and other teams to maintain high uptime, security, and user experience standards for millions of endpoints.

Experience and Education:

  • Bachelor's or higher degree in Computer Science, Information Systems, Information Technology, or a related technical field/experience.
  • 7+ years of experience in Site Reliability Engineering, DevOps, Infrastructure, or related roles.
  • Deep understanding of AWS and its various modules and services.
  • Strong background in Linux administration and troubleshooting.
  • Proven experience in implementing and managing CI/CD pipelines and Infrastructure as Code (IAC) solutions.
  • Proven experience in monitoring and observability tools to proactively manage system health.

Skills and Strengths:

  • AWS (Amazon Web Services)
  • Auto Scaling
  • Fargate
  • Route53
  • Observability tools (New Relic, DataDog, Splunk)
  • Scripting (Ansible, Bash, Python, GO)
  • CI/CD

Primary Job Responsibilities:

  • Design and support EC2/ECS/EKS/Fargate environments for high availability and fault tolerance.
  • Implement advanced AWS features (Route53, ALB/NLB, multi-region setups) to ensure global reliability.
  • Maintain and optimize the existing CI/CD pipelines and deployment processes to streamline software delivery, reduce risks, and ensure seamless integration of new features.
  • Collaborate with Development, QA, and DevOps teams to integrate best practices into build and release processes.
  • Implement, manage, and enhance monitoring tools to proactively detect and resolve system issues.
  • Administer and optimize Linux-based servers and applications, ensuring stability, performance, and security.
  • Implement and manage containerization solutions to improve scalability and efficiency.
  • Implement security best practices across AWS environments, ensuring compliance with industry standards and safeguarding cloud infrastructure.
  • Develop automated incident response mechanisms and self-healing solutions to minimize downtime and enhance fault tolerance.
  • Diagnose and resolve infrastructure, networking, and application-related performance issues to ensure operational efficiency.
  • Ensure business continuity by designing and maintaining robust backups, failover strategies, and disaster recovery solutions.
  • Identify, diagnose, and resolve infrastructure or application performance bottlenecks.
  • Create real-time monitoring dashboards and alerting systems to track system health, capacity, and performance trends.
  • Work closely with development teams to fine-tune infrastructure for cost efficiency while maintaining high performance.
  • Ensure business continuity by designing and maintaining robust backup, failover, and disaster recovery solutions.

Site Reliability Engineer employer: JR United Kingdom

Join an innovative PaaS company in the Oxford district, where as a Site Reliability Engineer, you will be part of a dynamic team dedicated to ensuring the reliability and performance of critical infrastructure. With a strong focus on employee growth, we offer opportunities for professional development and collaboration across various teams, fostering a culture of innovation and excellence. Enjoy the unique advantage of working in a vibrant location that combines a rich history with modern technological advancements, making it an ideal place for meaningful and rewarding employment.
J

Contact Detail:

JR United Kingdom Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with the specific AWS services mentioned in the job description, such as EC2, ECS, and Route53. Having hands-on experience or projects that showcase your skills with these services can set you apart during discussions.

✨Tip Number 2

Demonstrate your understanding of CI/CD pipelines by preparing to discuss any relevant tools you've used, like Jenkins or GitLab CI. Be ready to share examples of how you've optimised these processes in previous roles.

✨Tip Number 3

Showcase your problem-solving skills by preparing to discuss specific incidents where you diagnosed and resolved performance issues. Use metrics or outcomes to illustrate the impact of your solutions.

✨Tip Number 4

Network with current Site Reliability Engineers or professionals in similar roles through platforms like LinkedIn. Engaging with them can provide insights into the company culture and expectations, which can be beneficial during your application process.

We think you need these skills to ace Site Reliability Engineer

AWS (Amazon Web Services)
Linux Administration
CI/CD Pipeline Management
Infrastructure as Code (IAC)
Monitoring and Observability Tools
Scripting (Ansible, Bash, Python, GO)
Containerization Solutions
Network Management
Security Best Practices
Incident Response Mechanisms
Disaster Recovery Solutions
Performance Tuning
Collaboration with Development and DevOps Teams
Problem-Solving Skills
Attention to Detail

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in Site Reliability Engineering, DevOps, and AWS. Use specific examples of projects where you've implemented CI/CD pipelines or managed infrastructure to demonstrate your skills.

Craft a Compelling Cover Letter: Write a cover letter that showcases your passion for reliability and performance in systems. Mention your experience with monitoring tools and how you can contribute to the company's goals. Personalise it to reflect your understanding of their needs.

Highlight Technical Skills: Clearly list your technical skills related to the job description, such as AWS services, Linux administration, and scripting languages. Use bullet points for clarity and ensure they align with the requirements mentioned in the job posting.

Showcase Problem-Solving Abilities: Include examples of how you've diagnosed and resolved performance issues in past roles. This could be through case studies or specific incidents where your actions led to improved system reliability or efficiency.

How to prepare for a job interview at JR United Kingdom

✨Showcase Your Technical Skills

Be prepared to discuss your experience with AWS, Linux administration, and CI/CD pipelines in detail. Highlight specific projects where you've implemented these technologies, as this will demonstrate your hands-on expertise.

✨Understand the Company’s Products

Research the company’s PaaS solutions and their approach to remote monitoring and network management. Being able to discuss how your skills can enhance their products will show your genuine interest and alignment with their goals.

✨Prepare for Scenario-Based Questions

Expect questions that assess your problem-solving abilities in real-world scenarios. Think of examples where you’ve had to troubleshoot system issues or optimise performance, and be ready to explain your thought process.

✨Emphasise Collaboration Skills

Since the role involves working closely with development, QA, and DevOps teams, be sure to highlight your teamwork experiences. Share examples of how you’ve successfully collaborated on projects to achieve common goals.

Site Reliability Engineer
JR United Kingdom
J
  • Site Reliability Engineer

    Oxford
    Full-Time
    43200 - 72000 £ / year (est.)

    Application deadline: 2027-06-18

  • J

    JR United Kingdom

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>