Site Reliability Engineering Manager (Data Infra)

Site Reliability Engineering Manager (Data Infra)

Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Complyadvantage

At a Glance

  • Tasks: Lead a team of SREs to enhance system reliability and performance.
  • Company: Join an innovative tech company fighting financial crime.
  • Benefits: Equity participation, unlimited time off, hybrid work, and annual learning budget.
  • Other info: Collaborative environment with excellent career development opportunities.
  • Why this job: Shape the future of tech while making a real-world impact.
  • Qualifications: Experience in managing engineering teams and cloud-native architectures.

The predicted salary is between 60000 - 80000 £ per year.

We are looking for a driven and experienced Site Reliability Engineering Manager to join our innovative Tech Team. You will lead and empower a team of SREs, partnering closely with Engineering, Product, and Security to ensure our platforms are resilient, scalable, and secure. You will play a key role in shaping reliability strategy, improving system performance, and embedding best practices in observability, incident management, and automation to support the delivery of high-impact solutions in the fight against financial crime.

Responsibilities

  • Take ownership of your team, being responsible for current team members’ growth and development, plus hiring and onboarding new team members.
  • Create a positive environment where your team members thrive to deliver the best outcomes and innovations.
  • Be a role model for your team, mentoring and coaching them, whilst having a learning mindset yourself, being open to new ideas and technologies.
  • Within the context of our broader technology vision, set the direction for your team and take accountability for tech decisions.
  • Use your specific experience working with cloud systems to input into technical decision‑making.
  • Work with other stakeholders across engineering to ensure the systems and services your team provides meet the needs of your internal customers.
  • Collaborate, both within your team and across the tribe to ensure your team’s implementation meets industry standards.

The role reports to the Director of Infrastructure. You’ll be managing a team of Engineers focused on the provision and support of our Stateful / Data layer technologies powering all of our services, both in development and production. The main technologies we use are YugaByte (sharded Postgres), Kafka (via Strimzi), Elasticsearch (via ECK), Redis and Spark/data warehousing on GCP and AWS using their PaaS systems. As the technology stack underpins all other engineering work, a collaborative mindset is a must.

Tech Stack

ComplyAdvantage is fully cloud-based, with a modern kubernetes-focused tech stack. All compute workloads run in Kubernetes, with clusters in multiple regions to support the needs of our global client base. Our production services are multi-cloud by design and are currently hosted in AWS and GCP. We make heavy use of Terraform and Helm to define our infrastructure and services, and lean heavily on GitOps paradigms – production and non‑production environments are defined in git and changes to these environments (both cloud infrastructure and Kubernetes applications) are managed via git. ArgoCD is our tool of choice for controlling our deployments, and paired with our Istio mesh, allows us for advanced deployment patterns used by our development teams such as progressive rollouts. Our observability stack consists of Grafana Cloud, along with some on‑prem Mimir, amongst others. We focus on Open Telemetry for application metrics, with SLO and metric driven alerting at all levels, from Cloud infra through to application performance.

Qualifications

  • Have experience of managing and growing high performing engineering teams.
  • Have experience with Kubernetes and Terraform.
  • Have experience hosting microservices‑based architectures.
  • Have experience of working with cloud native architectures (AWS and GCP are preferred).
  • Have good communication and writing skills including experience writing technical documentation.

Nice to Haves

  • Experience of working in a start‑up/scale‑up environment.
  • Have experience managing observability platforms, whether self-hosted or third‑party – e.g. Grafana stack, Datadog, NewRelic.
  • Have experience managing pipeline tools, whether self-hosted or third‑party – e.g. CircleCI, ArgoCD, Harness, etc.

Benefits

  • Equity participation in our innovative mission to combat financial crime.
  • Unlimited Time Off Policy to promote work‑life balance and well‑being.
  • We embrace a hybrid approach that requires employees to be in the office for two days a week. We strongly believe that this approach fosters collaboration and enables the building of meaningful relationships.
  • Opportunities for collaboration and career development with smart, like‑minded professionals.
  • Annual learning budget to support professional growth.
  • A home office budget to support working from home.
  • Enhanced parental leave and childcare benefits.
  • Life insurance and medical coverage through BUPA, including pre‑existing conditions.
  • Pension contribution through The People’s Pension.

Site Reliability Engineering Manager (Data Infra) employer: Complyadvantage

At ComplyAdvantage, we pride ourselves on being an exceptional employer that champions innovation and collaboration within our Tech Team. Our commitment to employee growth is evident through our unlimited time off policy, annual learning budget, and a hybrid work model that fosters meaningful relationships while ensuring work-life balance. Join us in our mission to combat financial crime, where you will lead a talented team in a supportive environment that values your contributions and encourages continuous learning.

Complyadvantage

Contact Details:

Complyadvantage Recruitment Team

We think you need these skills to ace Site Reliability Engineering Manager (Data Infra)

Team Leadership
Cloud Systems Experience
Kubernetes
Terraform
Microservices Architecture
AWS
GCP