Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

Full-Time No home office possible
Go Premium
T

Role Overview

The Global Analytics team is responsible for developing and maintaining Price Discovery solutions used by the Front Office to generate and disseminate market information to clients. This data and associated financial calculations are integrated into a range of applications across the firm. As the Site Reliability Engineer, you will play a critical role in ensuring the availability, reliability, and performance of our production environment applications bridging the gap between the software and operations engineering teams.

Role Responsibilities

Ensure uptime, availability, and performance of Global Analytics services

Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs)

Respond to incidents and outages working with the Software and Operations engineering teams to quickly resolve

Respond to application and infrastructure alerts to prevent service disruption

Work with the Software Engineering team to reduce repetitive tasks such as deployments and monitoring

Build and maintain internal tools to improve developer productivity

Implement and maintain logging, metrics and tracing systems with alignment to Global Architecture best practices

Plan for scaling capacity, forecasting future infrastructure needs

Ensure compliance with departmental policies (i.e. change management, IT security standards, release management, incident management)

Collaborate with Software Engineering team to maintain and improve continuous integration and deployment pipelines

Collaborate with QA team to ensure safe and reliable software releases

Ensure that systems are secure and satisfy compliance requirements to meet industry standards and regulatory requirements

Experience / Competences

Educated to degree level or equivalent combination of education and experience

Solid experience working with financial trading systems

Good understanding of high-level Networking systems (e.g. firewalls, load-balancers, etc.)

Experience working with cloud platforms, preferably AWS, with Kubernetes and Docker

Experience working with monitoring and observability tools such as Grafana and Prometheus

Knowledge of CI / CD pipeline tools such as Gitlab and Infrastructure as Code (IaC) tools like Terraform

Scripting and Automation experience, ideally with Python and PowerShell

Experience of application performance profiling tools

Highly analytical, focus on long-term results and delivery

Job Band & Level

Professional / Level 5

#J-18808-Ljbffr

T

Contact Detail:

TP ICAP Recruiting Team

Site Reliability Engineer
TP ICAP
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

T
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>