Site Reliability Engineer

Full-Time No home office possible

Role Overview

The Global Analytics team is responsible for developing and maintaining Price Discovery solutions used by the Front Office to generate and disseminate market information to clients. This data and associated financial calculations are integrated into a range of applications across the firm. As the Site Reliability Engineer, you will play a critical role in ensuring the availability, reliability, and performance of our production environment applications bridging the gap between the software and operations engineering teams.

Role Responsibilities

Ensure uptime, availability, and performance of Global Analytics services

Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs)

Respond to incidents and outages working with the Software and Operations engineering teams to quickly resolve

Respond to application and infrastructure alerts to prevent service disruption

Work with the Software Engineering team to reduce repetitive tasks such as deployments and monitoring

Build and maintain internal tools to improve developer productivity

Implement and maintain logging, metrics and tracing systems with alignment to Global Architecture best practices

Plan for scaling capacity, forecasting future infrastructure needs

Ensure compliance with departmental policies (i.e. change management, IT security standards, release management, incident management)

Collaborate with Software Engineering team to maintain and improve continuous integration and deployment pipelines

Collaborate with QA team to ensure safe and reliable software releases

Ensure that systems are secure and satisfy compliance requirements to meet industry standards and regulatory requirements

Experience / Competences

Educated to degree level or equivalent combination of education and experience

Solid experience working with financial trading systems

Good understanding of high-level Networking systems (e.g. firewalls, load-balancers, etc.)

Experience working with cloud platforms, preferably AWS, with Kubernetes and Docker

Experience working with monitoring and observability tools such as Grafana and Prometheus

Knowledge of CI / CD pipeline tools such as Gitlab and Infrastructure as Code (IaC) tools like Terraform

Scripting and Automation experience, ideally with Python and PowerShell

Experience of application performance profiling tools

Highly analytical, focus on long-term results and delivery

Job Band & Level

Professional / Level 5

#J-18808-Ljbffr

Contact Detail:

TP ICAP Recruiting Team

View TP ICAP Profile

Site Reliability Engineer

TP ICAP

Site Reliability Engineer

Full-Time
TP ICAP

1000-5000

View TP ICAP Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Site Reliability Engineer

Role Overview

Role Responsibilities

Experience / Competences

Job Band & Level

Site Reliability Engineer

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z