Engineer - Site Reliability in London

Engineer - Site Reliability in London

London Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Cedar Cares, Inc

At a Glance

  • Tasks: Support high-availability trading platforms and troubleshoot complex technical issues.
  • Company: Join Cboe, a leading global markets company with a dynamic team.
  • Benefits: Competitive salary, flexible working hours, and opportunities for professional growth.
  • Other info: Collaborative environment with global exposure and excellent career advancement potential.
  • Why this job: Make a real impact in the fast-paced world of trading technology.
  • Qualifications: Bachelor's degree in a tech-related field and experience in software or systems engineering.

The predicted salary is between 60000 - 80000 £ per year.

Role Overview

The Site Reliability Engineer (London) is a role served by experienced technologists with a diverse set of skills ranging from software development to systems, network, application, and database management. The Cboe Site Reliability Engineering team is a highly skilled unit responsible for platform engineering, configuration management, implementation, capacity planning, performance tuning, analysis, troubleshooting, reporting, and process automation. This position is instrumental in support of both Cboe’s European markets and Cboe's follow-the-sun support model for its US Global Trading Hours (GTH) markets, providing critical overnight and early‑session coverage from London that ensures continuous, high‑availability operations across Cboe's real‑time low‑latency trading platforms.

The London‑based SRE provides technical support to Cboe Trade Desk and Operations Support Center staff across time zones, and works closely with Software Engineering, Systems Engineering, and Network Engineering teams to troubleshoot complex issues and coordinate platform configuration updates. A Site Reliability Engineer must be able to work independently with little to no direct supervision in performing their duties.

Major Job Duties

  • Platform Configuration Management: Provide configuration management of new and existing trading platforms and support implementation of new features and functionality based on new business requirements. Monitor development activities, change management tickets, and evaluate their impact on Cboe Operations. Execute daily change tickets assigned to Site Reliability Engineering in support of updates to production, disaster recovery, and certification systems. While the primary focus of this role involves support of bare‑metal on‑premises infrastructure, experience with cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes) is desirable.
  • Incident Response & Technical Troubleshooting: Serve as a technical responder for production incidents occurring during US GTH market hours covered from the London time zone. Participate in incident triage, root cause analysis, and resolution in coordination with globally distributed engineering and operations teams. Provide timely, precise communication to stakeholders during active incidents and contribute to post‑incident reviews and remediation tracking to drive long‑term platform stability.
  • System Availability & Technical Support: Provide technical support and operational oversight to sustain resiliency and high availability of critical business operations. Monitor production, disaster recovery, and certification systems for issues. Analyze and optimize performance of real‑time trading platforms. Operate and maintain low‑latency bare‑metal infrastructure, including hardware health, Linux OS tuning, and kernel‑by‑pass networking stacks such as Solarflare/Onload. Investigate software defects and assist the build team to resolve build/deployment issues.
  • Reporting & Data Analysis: Create and improve upon existing reports related to Operations management. Analyze technical data sets (e.g., order entry, market data, matching engine logs) to troubleshoot or explain perceived issues. Execute SQL queries against a database to perform data analysis for customers and associates. Service and support historical data product requests.
  • Capacity Planning: Drive capacity planning decisions for Cboe Exchanges and support capacity planning needs of various Cboe business units. Participate in Capacity Planning meetings with engineering and technical operations management staff.
  • Automation & Process Improvement: Support task and system health automation efforts through development, testing, and maintenance of Python tools. Leverage AI to maximize efficiency.
  • On‑Call & Weekend Testing: Participate in weekend testing (e.g., capacity testing, fail‑over, etc.) and provide follow‑the‑sun on‑call technical support as part of Cboe's global Operations team.

Ideal Candidate

  • Bachelor's Degree: Computer Science, Computer Engineering, Software Engineering, or a related discipline.
  • Area of Expertise and/or Skills: Technical Operations, Unix Shell, Software Engineering OR Systems, Network, or Database Administration; SQL, Python, C++, or other programming language.

Additional Requirements

  • Communication & Language Skills: Fluency in English—both written and spoken—is required. This role demands clear, precise, and unambiguous communication at all times. As the operational bridge between Cboe’s European and APAC SRE teams and its US‑based leadership, the ability to communicate with clarity across time zones, cultures, and technical disciplines is fundamental to the success of this role.
  • Global Markets Awareness: Familiarity with European markets as well as US equities, options, and/or futures market structures is strongly preferred. This role directly supports both Cboe’s European exchange operations and its US GTH market coverage, requiring situational awareness across multiple market sessions and regulatory environments.

Cboe Global Markets is an Equal Opportunity Employer.

Engineer - Site Reliability in London employer: Cedar Cares, Inc

Cboe Global Markets is an exceptional employer, offering a dynamic work environment in London that fosters innovation and collaboration among skilled technologists. With a strong emphasis on employee growth, Cboe provides opportunities for professional development through hands-on experience with cutting-edge technologies and a supportive culture that values clear communication and teamwork across global teams. Employees benefit from a competitive compensation package, flexible working arrangements, and the chance to contribute to high-availability operations in the fast-paced world of trading.

Cedar Cares, Inc

Contact Details:

Cedar Cares, Inc Recruitment Team

We think you need these skills to ace Engineer - Site Reliability in London

SQL
Communication Skills
Problem-Solving Skills
Python
Automation
Data Engineering
ETL/ELT Processes