Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

London Full-Time 48000 - 84000 £ / year (est.) No home office possible
P

At a Glance

  • Tasks: Be the superhero ensuring our platform's reliability, scalability, and performance.
  • Company: Join Paymentology, a global leader in payment processing technology.
  • Benefits: Enjoy full-time remote work, flexible hours, and a supportive, diverse environment.
  • Why this job: Work on cutting-edge tech projects that make a real difference globally.
  • Qualifications: Bachelor's in Computer Science; 3 years SRE experience; strong cloud and programming skills.
  • Other info: Be part of a collaborative team with opportunities for continuous learning.

The predicted salary is between 48000 - 84000 £ per year.

Paymentology is the first truly global issuerprocessor giving banks and fintechs the technology team and experience to rapidly issue and process Mastercard Visa and UnionPay cards across more than 50 countries at scale. Our advanced multicloud platform offering both shared and dedicated processing instances vast global presence and richer realtime data set us apart as the leader in payments. Were on the hunt for an exceptional Site Reliability Engineer (SRE) to join our dedicated team. As an SRE at Paymentology youll be the superhero responsible for maintaining improving and ensuring the high availability scalability and performance of our platform. What you get to do:: Platform Reliability and Scalability: Build software that enhances Paymentology services scalability and reliability. Ensure platform services meet required uptime and service quality levels. Contribute to the design of reliable cloud infrastructure and implement reusable clouduptime components as code. Regularly review and optimise SRE practices tools and methodologies to enhance overall system reliability and team efficiency. Observability and Automation: Contribute to the design implementation and maintenance of observability and monitoring solutions to track the platform health its costeffectiveness the reliability and scalability and identify potential issues which can be fed back to product and platform engineering in a continuous improvement loop. Develop and implement automation scripts and tools to streamline operations and reduce manual interventions. Enable product teams to selfserve by participating in the development of a developer platform. Production Issue Resolution: Play an active role with the incident response teams diagnosing and resolving production issues quickly to minimise downtime. Standards Compliance: Support product teams in building services that adhere to our security and quality standards. Crossteam Collaboration: Work closely with engineering operations and product teams to ensure reliability is considered throughout the endtoend software development lifecycle. We seek to achieve this through advocacy and developing a culture of reliability. What you can look forward to:: At Paymentology we value making a difference to the lives of the people who work for us and who live in the communities where we operate. You can look forward to working with a diverse global team where Paymentologists at all levels play an important part in our global mission to advance the world through payments and make a difference on a global scale. Travel: < 5% Requirements : What it takes to succeed: Bachelor s Degree in Computer Science Information Technology or related field. A minimum of 3 years in a dedicated SRE role as well as 5 years of prior software development experience. Comprehensive understanding of largescale distributed platform architecture. Extensive handson cloud experience particularly with AWS. Proven experience developing scalable modular infrastructureascode projects using tools such as Terraform CloudFormation Puppet and Ansible. Practical experience with Docker and container orchestrators including AWS ECS & EKS and Kubernetes. Experience in administering or integrating identity management systems for SSO including AWS IAM Okta and Active Directory. Experience with disaster recovery and redundancy strategies in both cloud and onpremises environments. Proficiency with leading monitoring tools such as Datadog Honeycomb.io Splunk Prometheus Grafana ELK Stack and New Relic. Programming expertise especially in systems programming languages (e.g. Java Kotlin Scala) and databases (e.g. SQL Server PostgreSQL). Familiarity with industryleading CI/CD tools such as Jenkins GitHub Actions Gitlab CI CodePipelines CircleCI and ArgoCD. Track record of achieving platformlevel and endtoend SLIs SLOs and SLAs and fostering accountability. Ability to navigate complex situations and lead effective postincident reviews (PIRs). Knowledge of implementing solutions to reduce Mean Time to Identify (MTTI) and Mean Time to Resolve (MTTR). Expertise in implementing best practices for load balancing fault tolerance and resource allocation to maintain service quality and efficiency at scale. Understanding of security best practices within cloud environments. Youll also need to bring a collaborative mindset working seamlessly across teams to drive innovative solutions. And of course your exceptional communication skills in English will allow you to clearly convey your ideas and recommendations. As a key member of our technical team you will be expected to maintain high availability and be ready to address critical incidents ensuring the continuous performance of our systems. This includes being part of an oncall schedule to support 24/7 operations. Why Paymentology Fulltime remote position with flexible hours. An inclusive and supportive work environment that values diversity. A chance to work on cuttingedge technology projects that make a difference. Opportunities for continuous learning and development. Ready to Join Us If youre a gadget guru who thrives on optimizing infrastructure automating all the things and delivering skyhigh availability and performance we want to hear from you! Apply now and be part of a company that values your skills and fosters your growth. Remote Work : Yes Employment Type : Independent Contractor

Site Reliability Engineer employer: Paymentology

At Paymentology, we pride ourselves on being an exceptional employer that champions innovation and inclusivity. As a Site Reliability Engineer, you'll enjoy the flexibility of a full-time remote position with adaptable hours, allowing you to balance work and life while contributing to cutting-edge technology projects. Our supportive work culture fosters continuous learning and development, ensuring that you have ample opportunities to grow your skills and make a meaningful impact in the global payments landscape.
P

Contact Detail:

Paymentology Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarize yourself with the specific tools and technologies mentioned in the job description, such as AWS, Terraform, and Docker. Having hands-on experience with these will not only boost your confidence but also demonstrate your readiness for the role.

✨Tip Number 2

Engage with the SRE community online. Participate in forums or groups where you can discuss best practices and challenges related to site reliability engineering. This will help you stay updated on industry trends and show your passion for the field.

✨Tip Number 3

Prepare to discuss real-world scenarios where you've successfully resolved production issues or improved system reliability. Being able to share specific examples will highlight your problem-solving skills and experience.

✨Tip Number 4

Showcase your collaborative mindset by thinking of ways you can contribute to cross-team efforts. Be ready to discuss how you can work with product and engineering teams to enhance reliability throughout the software development lifecycle.

We think you need these skills to ace Site Reliability Engineer

Cloud Infrastructure Design
AWS Expertise
Infrastructure as Code (IaC)
Terraform
CloudFormation
Puppet
Ansible
Docker
Kubernetes
AWS ECS
AWS EKS
Identity Management Systems
AWS IAM
Okta
Active Directory
Disaster Recovery Strategies
Monitoring Tools (e.g., Datadog, Prometheus, Grafana)
Systems Programming Languages (e.g., Java, Kotlin, Scala)
SQL Databases (e.g., SQL Server, PostgreSQL)
CI/CD Tools (e.g., Jenkins, GitHub Actions)
SLIs, SLOs, and SLAs Management
Post-Incident Review (PIR) Leadership
Mean Time to Identify (MTTI) Reduction
Mean Time to Resolve (MTTR) Reduction
Load Balancing Best Practices
Security Best Practices in Cloud Environments
Collaboration Across Teams
Exceptional Communication Skills in English

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience in Site Reliability Engineering and software development. Focus on your hands-on cloud experience, particularly with AWS, and any relevant projects that demonstrate your skills in infrastructure as code.

Craft a Strong Cover Letter: In your cover letter, express your passion for optimizing infrastructure and automating processes. Mention specific tools and technologies you have worked with, such as Terraform, Docker, and monitoring tools like Datadog or Prometheus, to showcase your expertise.

Showcase Problem-Solving Skills: Provide examples of how you've diagnosed and resolved production issues in the past. Highlight your ability to minimize downtime and your experience with post-incident reviews to demonstrate your problem-solving capabilities.

Highlight Collaboration Experience: Emphasize your experience working across teams, especially with engineering, operations, and product teams. Discuss how you advocate for reliability and contribute to a culture of collaboration within the organization.

How to prepare for a job interview at Paymentology

✨Showcase Your Technical Expertise

Be prepared to discuss your hands-on experience with cloud platforms, especially AWS. Highlight specific projects where you've implemented infrastructure as code using tools like Terraform or CloudFormation.

✨Demonstrate Problem-Solving Skills

Prepare examples of how you've diagnosed and resolved production issues in the past. Discuss your approach to minimizing downtime and improving system reliability during incidents.

✨Emphasize Collaboration

Since cross-team collaboration is key, share experiences where you've worked closely with engineering, operations, and product teams. Highlight how you advocated for reliability throughout the software development lifecycle.

✨Communicate Clearly

Your communication skills are crucial. Practice explaining complex technical concepts in simple terms, as you'll need to convey ideas effectively to both technical and non-technical stakeholders.

Site Reliability Engineer
Paymentology
P
  • Site Reliability Engineer

    London
    Full-Time
    48000 - 84000 £ / year (est.)

    Application deadline: 2026-12-29

  • P

    Paymentology

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>