At a Glance
- Tasks: Lead reliability improvements and shape product direction in a fast-paced trading environment.
- Company: Join CME Group, a global leader in financial services technology.
- Benefits: Competitive salary, benefits, and opportunities for career advancement.
- Why this job: Work with cutting-edge tech and make a real impact on market stability.
- Qualifications: Experience in SRE, strong communication skills, and a growth mindset.
- Other info: Collaborative culture with a focus on innovation and continuous improvement.
The predicted salary is between 36000 - 60000 ÂŁ per year.
CME Group is seeking a Staff SRE to help build, operate and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME’s Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world’s busiest trading days. The successful candidate will have a strong understanding of SRE principles and practices, enjoy the cut-and-thrust of operating Production systems, be a strong communicator, and may have previously worked in an SRE role, a software engineering role, a DevOps role or a systems engineering role.
As a Staff SRE you will lead Product direction for improving reliability. You will shape our roadmap, architecture and drive high-impact changes across teams.
Key responsibilities:- Serve as the technical leader for Product reliability - defining a Product Reliability Roadmap and influencing decisions on direction and prioritisation.
- Define, design and lead the implementation of Service-Level Indicators (SLIs) and Service-Level Objectives (SLOs) that truly reflect customer experience, alongside appropriate observability and monitoring.
- Work alongside lead product engineers to design testing for reliability, performance, capacity and DR.
- Lead reliability delivery for the team, assuming accountability while managing risks and dependencies, and ensuring Product leadership are proactively updated.
- Participate in on-call and act as an escalation to others; steps in to act as an Operational Lead in major incident response - demonstrating urgency while remaining calm and considered.
- Lead post-incident analyses and work with stakeholders to prioritise both tactical and strategic improvements.
- Apply a continuous improvement mindset, identify reliability process improvements and work with Product leaders to influence change and adoption of team process.
- Improve reliability, quality, and time-to-market through removal of toil and seizing opportunities to shift-left etc.
- Participate in DR testing and continuously improve.
- Lead Production review meetings based on SLOs, error budgets and incident data and ensure outcomes are decided and prioritised.
- Represent SRE in architecture decisions with reliability and resiliency a priority.
- Mentor other engineers in SRE principles, championing a culture of “SRE as a practice”.
- Map usage to capacity to costs, while ensuring no impacts to reliability.
- Support the migration of markets applications to Google Cloud Platform (GCP), ensuring a seamless transition.
- Stay informed on emerging technologies & latest industry trends, and recognise opportunities for CME Group.
- Develop POCs which can be adopted and reused across the organisation.
- A highly accountable and collaborative person with demonstrated ability to influence change and proven track record of leading large-scale changes.
- Excellent communication and teamwork skills; someone with strong stakeholder management who is able to communicate effectively across disciplines and across regions.
- Experience working in an SRE role.
- Experience with Linux-based systems & with Cloud-based platform(s) - Google Cloud Platform, GCE, and/or GKE a bonus.
- Strong knowledge of application architectures, messaging middleware, and network protocols.
- Experience with monitoring and observability tools such as OpenTelemetry, Splunk, Prometheus, Grafana, etc.
- Experience automating CI/CD processes and solutions.
- A growth mindset; eagerness to learn and adapt in a fast-paced trading environment.
- Understanding of current and emerging technologies.
- Experience working on financial applications and trading platforms in capital markets is a bonus.
- Experience working on ultra-low latency (ULL) platform a bonus.
- Experience working in an agile environment.
- Be part of a global leader in financial services technology.
- Work on cutting-edge technology in a collaborative and innovative culture.
- Competitive compensation and benefits package.
- Opportunity to grow and advance your career in SRE with an organisation who is transforming to this approach.
- Join CME Group and play a crucial role in ensuring the stability and performance of our Markets applications while contributing to the migration to Google Cloud Platform.
Apply now to be a part of our dynamic SRE team!
Staff Site Reliability Engineer in Derry employer: CME Group
Contact Detail:
CME Group Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Staff Site Reliability Engineer in Derry
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current employees at CME Group. A friendly chat can sometimes lead to opportunities that aren’t even advertised!
✨Tip Number 2
Show off your skills! Prepare a portfolio or GitHub repository showcasing your SRE projects, automation scripts, or any relevant work. This gives you a chance to demonstrate your expertise beyond just words.
✨Tip Number 3
Practice makes perfect! Get ready for those technical interviews by brushing up on SRE principles, incident management, and cloud technologies. Mock interviews with friends can help you feel more confident.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the CME Group team!
We think you need these skills to ace Staff Site Reliability Engineer in Derry
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with SRE principles and practices. We want to see how your skills align with the role, so don’t hold back on showcasing your relevant projects!
Showcase Your Communication Skills: Since this role involves a lot of collaboration, it’s important to demonstrate your communication prowess. Use clear and concise language in your application, and maybe even share examples of how you've effectively communicated across teams in the past.
Highlight Your Technical Expertise: Don’t forget to mention your experience with Linux-based systems, cloud platforms like GCP, and any monitoring tools you’ve used. We’re keen to see how your technical background can contribute to our team’s success!
Apply Through Our Website: We encourage you to apply directly through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and ensure it reaches the right people!
How to prepare for a job interview at CME Group
✨Know Your SRE Principles
Make sure you brush up on your Site Reliability Engineering principles before the interview. Be ready to discuss how you've applied SLIs and SLOs in past roles, as well as your experience with incident response and post-incident analyses.
✨Showcase Your Technical Skills
Prepare to talk about your experience with Linux-based systems and cloud platforms, especially Google Cloud Platform. Bring examples of how you've automated CI/CD processes or improved reliability in previous projects to demonstrate your technical prowess.
✨Communicate Effectively
Since strong communication is key for this role, practice articulating your thoughts clearly. Think about how you can convey complex technical concepts to non-technical stakeholders, and be ready to share examples of successful collaboration across teams.
✨Emphasise Continuous Improvement
Be prepared to discuss your mindset around continuous improvement. Share specific instances where you've identified process improvements or led initiatives that enhanced reliability and performance, showing that you're proactive and results-driven.