Senior, Site Reliability Engineer (Infrastructure operations) in Dublin

Senior, Site Reliability Engineer (Infrastructure operations) in Dublin

Dublin Full-Time 60000 - 80000 £ / year (est.) No working from home possible
M

At a Glance

  • Tasks: Lead the reliability of Mastercard's critical payment systems and enhance service quality.
  • Company: Join Mastercard, a global leader in digital payments and innovation.
  • Benefits: Competitive salary, inclusive culture, and opportunities for professional growth.
  • Other info: Dynamic team environment with on-call responsibilities and continuous learning opportunities.
  • Why this job: Make a real impact on global transactions while working with cutting-edge technology.
  • Qualifications: 5-10 years in SRE or related roles, strong troubleshooting skills, and experience with automation tools.

The predicted salary is between 60000 - 80000 £ per year.

Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.

About the Role: Mastercard’s Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers. We achieve this by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards, ensuring compliance with rigorous security requirements. Within Mastercard, SRE focuses on the reliability and performance of core infrastructure, networks, and foundational services that power our applications. Our mission is to ensure these components operate with excellence, enabling applications to deliver an outstanding customer experience. In this role, you will join our Payments Network SRE team and take ownership of continuously assessing and elevating the end to end service quality of our platform. You will leverage data to drive root cause analysis and deliver strategic insights to key stakeholders on resource utilization, capacity forecasting, and performance trends—ensuring the availability, scalability, and resilience of our network.

Key Responsibilities:

  • Lead continuous assessments of the application infrastructure supporting critical Mastercard applications, focusing on health, performance, monitoring and alerting, and capacity analysis.
  • Collaborate with Product and Development teams to forecast growth requirements and ensure scalability and resiliency.
  • Champion observability as a core principle for infrastructure services by assessing environments and technologies to uncover gaps in monitoring and alerting.
  • Design and implement strategies to close these gaps, ensuring all infrastructure telemetry is integrated into a unified, single-pane-of-glass view.
  • Build custom dashboards to investigate and perform root cause analysis on complex issues.
  • Lead regular incident reviews with internal support teams to ensure root causes are identified.
  • When patterns of failure or compatibility issues between software and infrastructure emerge, develop and implement strategies to remediate or mitigate risks.
  • Leverage automation and AI technologies to enhance proactive issue detection, enable self-healing capabilities, reducing Mean Time to Detect (MTTD) and Mean Time to Mitigate (MTTM).
  • Develop testing and validation plans for new environment builds, disaster recovery exercises and post-maintenance activities to certify environment readiness before customer traffic is routed to it.
  • Champion continuous learning, development, and knowledge sharing across networking and other infrastructure disciplines to strengthen multi-disciplinary SRE team capabilities.
  • Lead training initiatives for team members and Product and Development on networking aspects of the platforms.
  • Evaluate vendor hardware, firmware, and software upgrade roadmaps, and conduct proof-of-concept (POC) testing to identify potential risks and opportunities for improvement in upcoming releases.

All about you:

  • 5–10 years of experience in an SRE or SRE related operations role, including 3+ years supporting e-commerce, financial services, or large scale SaaS platforms.
  • Excellent infrastructure troubleshooting and analytical problem solving skills.
  • Strong hands on experience with observability and monitoring tools such as Splunk, Dynatrace, or equivalent, with a proven ability to triage and investigate complex issues.
  • Familiarity with network telemetry tools such as SolarWinds and NetScout.
  • Proficiency in packet level debugging, including capturing traffic with tools like tcpdump and analyzing packets using Wireshark.
  • Broad understanding of end to end infrastructure supporting payment platforms—spanning platform services, networking, databases, and storage.
  • Experience with automation and Infrastructure as Code tools such as Chef, Ansible, and Terraform, as well as structured data formats (JSON/YAML).
  • Excellent communication skills with the ability to coordinate cross functional troubleshooting efforts and lead RCA processes to closure.
  • Demonstrated ability to troubleshoot complex production issues, perform root cause analysis, and drive long term corrective actions.
  • Experience partnering with development teams to shape architecture, define SLIs/SLOs, and embed reliability into services from design through operation.
  • Strong understanding of monitoring and observability ecosystems, including Prometheus, Grafana, ELK/EFK, Splunk, Dyantrace, and OpenTelemetry.
  • Effective incident management skills with a structured, analytical approach to problem solving.

The Payments Network SRE team is responsible for the runtime availability of some of Mastercard’s most critical core payment systems, which support national infrastructure and operate 24/7 year-round. As a result, this role will include periodic on-call responsibilities when required.

Corporate Security Responsibility

All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

  • Abide by Mastercard’s security policies and practices;
  • Ensure the confidentiality and integrity of the information being accessed;
  • Report any suspected information security violation or breach, and
  • Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.

Senior, Site Reliability Engineer (Infrastructure operations) in Dublin employer: Mastercard

Mastercard is an exceptional employer that fosters a culture of innovation and collaboration, empowering employees to drive meaningful change in the digital economy. With a commitment to professional growth, employees benefit from continuous learning opportunities and a supportive environment that values diversity and inclusion. Located in a dynamic industry, Mastercard offers competitive benefits and the chance to work on critical infrastructure that impacts millions globally, making it a rewarding place to advance your career.

M

Contact Details:

Mastercard Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior, Site Reliability Engineer (Infrastructure operations) in Dublin

Tip Number 1

Network with current employees at Mastercard! Reach out on LinkedIn or attend industry events. Having an insider's perspective can give you a leg up and help you understand the company culture better.

Tip Number 2

Prepare for technical interviews by brushing up on your SRE skills. Practice troubleshooting scenarios and be ready to discuss your experience with observability tools like Splunk or Dynatrace. We want to see how you think on your feet!

Tip Number 3

Showcase your problem-solving skills during interviews. Be ready to share specific examples of how you've tackled complex issues in the past, especially in high-pressure environments. This is your chance to shine!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining the Mastercard team.

We think you need these skills to ace Senior, Site Reliability Engineer (Infrastructure operations) in Dublin

Site Reliability Engineering (SRE)
Infrastructure Operations
Observability and Monitoring Tools
Splunk
Dynatrace
Network Telemetry Tools
SolarWinds

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Site Reliability Engineer role. Highlight your experience with infrastructure operations, observability tools, and any relevant projects that showcase your skills in maintaining high-performance systems.

Craft a Compelling Cover Letter:Your cover letter should tell us why you're passionate about this role at Mastercard. Share specific examples of how you've improved system reliability or performance in previous positions, and connect your experiences to our mission of building a sustainable economy.

Showcase Your Problem-Solving Skills:In your application, emphasise your analytical problem-solving skills. We want to see how you've tackled complex issues in the past, especially in high-pressure environments. Use concrete examples to illustrate your approach to troubleshooting and root cause analysis.

Apply Through Our Website:Don't forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at Mastercard!

How to prepare for a job interview at Mastercard

Know Your Infrastructure Inside Out

Before the interview, make sure you have a solid understanding of the infrastructure that supports payment platforms. Brush up on your knowledge of observability tools like Splunk and Dynatrace, as well as network telemetry tools such as SolarWinds. Being able to discuss specific examples of how you've used these tools in past roles will impress the interviewers.

Showcase Your Problem-Solving Skills

Prepare to discuss complex production issues you've encountered and how you approached root cause analysis. Use the STAR method (Situation, Task, Action, Result) to structure your answers. This will demonstrate your analytical problem-solving skills and your ability to drive long-term corrective actions.

Emphasise Collaboration and Communication

Since this role involves working closely with Product and Development teams, be ready to share examples of how you've successfully collaborated in the past. Highlight your communication skills and how you've coordinated cross-functional troubleshooting efforts. This will show that you're not just technically proficient but also a team player.

Prepare for Incident Management Scenarios

Expect questions about incident management and your approach to handling outages or performance issues. Be prepared to discuss your structured, analytical approach to problem-solving and how you ensure that incidents are managed effectively. This will demonstrate your readiness for the on-call responsibilities that come with the role.