Site Reliability Engineer III (Tue - Sat) in Belfast

Site Reliability Engineer III (Tue - Sat) in Belfast

Belfast Full-Time No working from home possible
CME Group

As the world's leading and most diverse derivatives marketplace, CME Group (www.cmegroup.com) is seeking a Site Reliability Engineer III (Tue - Sat) to lead the design, development, and operation of high‑performance systems in our Markets portfolio. In this role you will work closely with senior engineers on complex projects, own key reliability initiatives, and mentor junior colleagues.

Key Responsibilities

  • Own Observability: Design, build, and refine monitoring, alerting, and observability solutions. Drive continuous improvement of our SLIs & SLOs to enable faster issue detection and resolution.
  • Drive Reliability Projects: Take ownership of reliability‑focused projects from design to implementation, collaborating with product teams to ensure new features are scalable, resilient, and safe.
  • Lead Technical Solutions: Lead technical discussions for your work, presenting solution options and proposals with clear trade‑offs.
  • Automate Intelligently: Proactively identify and eliminate toil through robust automation, improving both system reliability and team velocity.
  • Manage Incidents: Take a leading role in incident response, owning the resolution of significant incidents, ensuring rapid system recovery, and driving meaningful action from blameless post‑mortems.
  • Mentor & Coach: Act as a technical mentor and point of escalation for L1 and L2 SREs, fostering their growth through code reviews and paired work.
  • Architect for the Future: Contribute your own ideas to the product backlog and play an active role in the architectural design for the migration to Google Cloud Platform (GCP).

What We're Looking For

  • 3–5+ years of professional experience in a Site Reliability, DevOps, Software, or Systems Engineering role.
  • Strong, hands‑on experience administering and troubleshooting Linux‑based production systems.
  • Proficient programming skills in a language like Python or Go, with a track record of automating complex operational tasks.
  • Proven ability to lead technical initiatives and solve complex problems with a high degree of autonomy.
  • Excellent communication skills, with the ability to articulate complex technical concepts to diverse audiences.
  • A proactive and ownership‑oriented mindset.
  • Cloud Platforms: Deep experience with Google Cloud Platform (GCP), especially GCE, GKE, and cloud networking.
  • Monitoring Tools: Expertise in designing and managing monitoring stacks (e.g., Prometheus, Grafana, OpenTelemetry).
  • Distributed Systems: Strong practical knowledge of building and maintaining large‑scale distributed systems.
  • Containerisation: Advanced experience with Kubernetes and Docker in a production environment.
  • Networking: Solid understanding of networking protocols (HTTP, TCP/UDP, IP) and network architecture.
  • Domain Knowledge: Experience in financial markets, low‑latency systems, or with message‑oriented middleware.

Benefits

  • Bonus Programme
  • Equity Programme
  • Employee Stock Purchase Plan (ESPP)
  • Private Medical and Dental coverage
  • Income Protection
  • Life Assurance
  • Cycle To Work
  • Family Leave
  • Education Assistance – MBA/Advanced Degree/Bachelor Degree
  • Ongoing Employee Development Training/Certification
  • Hybrid Working

As an equal‑opportunity employer, we consider all potential employees without regard to any protected characteristic.

#J-18808-Ljbffr
CME Group

Contact Details:

CME Group Recruitment Team