Site Reliability Engineer (Applications)

Belfast Full-Time No home office possible

At a Glance

Tasks: Join a dynamic team as a Site Reliability Engineer, enhancing application reliability and performance.
Company: A leading global investment firm with $63 billion AUM, focused on tech-driven solutions.
Benefits: Enjoy free meals, wellness perks, generous bonuses, and a supportive work environment.
Why this job: Make a real impact in a fast-paced culture that values innovation and continuous learning.
Qualifications: Expert coding skills, knowledge of observability systems, and experience with cloud technologies required.
Other info: Flexible caregiver leave, sabbaticals, and a commitment to sustainability and community involvement.

An amazing Global Investment Client of ours located in Central London is looking for a Site Reliability Engineer to join their team on a permanent basis. This is a rare opportunity and the package offered for this role is up to £300k depending on skills and experience.

ABOUT THE COMPANY

The company is a leading provider of alternative investment solutions with approximately $63 billion of assets under management (“AUM”) and over 550 employees worldwide including London, New York, Singapore, and Hong Kong. One of their founding beliefs is that technology and data are at the core of the business allowing them to build and maintain cutting-edge hardware and software solutions. The technology team is lean and has a culture that encourages interaction across all areas of the business on a global scale. Their aim is to use the best tool for the job; therefore, there is the opportunity to be constantly learning and use modern technologies. Their teams strive to push boundaries and think innovatively creating an environment that is fast-paced, dynamic, and successful.

ABOUT THE ROLE

They are looking for an enthusiastic Site Reliability Engineer to join the SRE team in London. Their team is central to the business as they are responsible for the technology that underpins everything they do; therefore, you will have a direct impact on the success of the company. From scaling for the huge volumes of data that drive their research process to improving the reliability and speed of a rapidly evolving application estate, there is always a relentless focus on automation and efficiency at scale. The company's engineers own their varied technology stack, end-to-end, and are in constant search of incremental improvements, new technologies, and ways of working to evolve their platform and give them a competitive edge. They are looking for people who want to find unique solutions for optimising efficiency and performance in a context where they are key enablers. The ideal candidate will be passionate about improving reliability and removing toil by identifying opportunities for automation and building platforms to make the systems more 'reliable by default'.

Responsibilities:

Evangelise the SRE mindset and implement best practices across the environment.
Understand the business and find ways to measure and enhance resilience across the application estate.
Eliminate the toil that emerges with complex, distributed systems by automating where possible.
Work as both an individual contributor and collaboratively to find new ways of improving the reliability, availability, security, and performance of the infrastructure.
Accelerate the migration strategy to more cloud-native, distributed applications.
Improve productivity and developer experience through automation and interface improvements in local tool chains, IDEs, CI/CD.

Requirements:

Expert level scripting / coding skills in one or more languages (Python / Golang etc.).
Expert knowledge of observability systems (Prometheus / ELK / Jaeger / Opentelemetry / Service Meshes etc.).
Experience with configuration management tools (Ansible / Puppet / Kapitan / Terraform).
Experience with distributed data platforms (Kafka / Flink / Airflow).
Comfortable using cloud-native and containerisation technologies (Kubernetes / Docker).
Good Linux systems knowledge (experience with RHEL desirable).
Broad knowledge across network technologies, server virtualisation, and storage.
Self-starter, able to quickly pick up concepts, implement new ideas and think outside the box.
Focused on improving system reliability, availability, security, and performance through testing, automation, and standardisation.
Ability to simply articulate the 'why' behind best practices.
Ability to build positive and collaborative relationships with colleagues across teams and geographies.

PERKS & BENEFITS

Food & Beverage: Complimentary breakfast and lunch for all employees plus on-site coffee bars and a wide variety of healthy snacks.
Annual Discretionary Bonuses: Reflecting firm and individual performance.
Cycle to Work Initiative: Green loan scheme which employees are able to use for the purchase of bicycles.
Employee Referral Programme: Bonus for each successful hire in the month your referral joins the company.
Global Office Design: They aim to create a cohesive environment, regardless of region.
Pension Scheme: Generous pension and retirement savings plans.
Carbon Offset Programme: The company offsets its CO2 emissions annually and aims to sustainably source all office materials.
Physical and Mental Fitness: Health and wellness benefits include an onsite gym & classes (LDN and NYC), gym subsidies in other regions, access to mental health support, and subscriptions to mindfulness platforms.
Charity Donation Matching: Generous charity matching scheme and ample opportunities to become involved in the community.
Enhanced Caregiver Leave: Enhanced, flexible primary and secondary caregiver leave.
Sabbatical: Generous sabbatical after you’ve been with the company for 8 years and every 4 years after that.
Annual Training Allowance: Encourage personal and professional development.
Health and Life Insurance: Range of healthcare benefits to help you manage your personal, physical, and emotional wellbeing.

Site Reliability Engineer (Applications) employer: H&R Talent

Join a leading global investment firm in Central London that prioritises technology and innovation, offering a dynamic work culture where your contributions directly impact the company's success. With a focus on continuous learning and modern technologies, employees enjoy exceptional benefits including complimentary meals, wellness programmes, and generous training allowances, all within a collaborative environment that fosters personal and professional growth.

Contact Detail:

H&R Talent Recruiting Team

View H&R Talent Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer (Applications)

✨Tip Number 1

Familiarise yourself with the specific technologies mentioned in the job description, such as Python, Golang, and Kubernetes. Being able to discuss your experience with these tools during interviews will demonstrate your technical fit for the role.

✨Tip Number 2

Showcase your problem-solving skills by preparing examples of how you've improved system reliability or automated processes in previous roles. This will help you illustrate your ability to contribute to the company's focus on efficiency and performance.

✨Tip Number 3

Research the company’s culture and values, particularly their emphasis on technology and data. Be ready to discuss how your personal values align with theirs, as cultural fit is often just as important as technical skills.

✨Tip Number 4

Network with current or former employees of the company through platforms like LinkedIn. Engaging with them can provide insights into the interview process and company dynamics, which can be invaluable when preparing for your application.

We think you need these skills to ace Site Reliability Engineer (Applications)

Expert level scripting skills in Python or Golang

Knowledge of observability systems (Prometheus, ELK, Jaeger, OpenTelemetry)

Experience with configuration management tools (Ansible, Puppet, Terraform)

Familiarity with distributed data platforms (Kafka, Flink, Airflow)

Proficiency in cloud-native and containerisation technologies (Kubernetes, Docker)

Strong Linux systems knowledge (RHEL experience desirable)

Broad understanding of network technologies, server virtualisation, and storage

Ability to automate processes to improve system reliability and performance

Strong problem-solving skills and innovative thinking

Excellent communication skills for articulating best practices

Ability to build collaborative relationships across teams and geographies

Focus on enhancing developer experience through automation and tool improvements

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience and skills that align with the Site Reliability Engineer role. Focus on your expertise in scripting, observability systems, and cloud-native technologies.

Craft a Compelling Cover Letter: Write a cover letter that showcases your passion for improving system reliability and automation. Mention specific projects or experiences where you've successfully implemented best practices or innovative solutions.

Showcase Your Technical Skills: In your application, clearly outline your technical skills, especially in Python, Golang, and any configuration management tools you’ve used. Provide examples of how you've applied these skills in previous roles.

Demonstrate Cultural Fit: Research the company's culture and values. In your application, express how your personal values align with their focus on innovation, collaboration, and continuous learning.

How to prepare for a job interview at H&R Talent

✨Showcase Your Technical Skills

Be prepared to discuss your expert-level scripting and coding skills, particularly in Python or Golang. Bring examples of past projects where you've implemented observability systems or worked with configuration management tools.

✨Demonstrate Your Problem-Solving Ability

Highlight your experience in automating processes and eliminating toil in complex systems. Be ready to share specific instances where you identified opportunities for improvement and successfully implemented solutions.

✨Understand the Company Culture

Research the company's focus on technology and data as core business elements. Be ready to discuss how your values align with their innovative and fast-paced environment, and how you can contribute to their goals.

✨Prepare Questions About the Role

Think of insightful questions that show your interest in the role and the company. Ask about their current challenges in reliability and performance, and how the SRE team collaborates across different regions.

Site Reliability Engineer (Applications)

Belfast

Full-Time

Application deadline: 2027-06-23
H&R Talent

View H&R Talent Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now