At a Glance
- Tasks: Lead the evolution to scalable microservices and enhance system reliability.
- Company: Join StarCompliance, a leader in simplifying compliance for financial institutions.
- Benefits: Competitive salary, flexible working options, and opportunities for professional growth.
- Why this job: Make a real impact on cloud-native systems and drive innovation in reliability practices.
- Qualifications: 5+ years in SRE or DevOps, with strong cloud platform experience.
- Other info: Collaborative environment focused on empowering teams and fostering a culture of reliability.
The predicted salary is between 36000 - 60000 ÂŁ per year.
About StarCompliance
StarCompliance is on a mission to make compliance simple and easy. Trusted globally by enterprise financial institutions, the user-friendly STAR platform empowers organizations to achieve regulatory compliance while safeguarding their integrity and business reputations. Through a customizable, 360-degree view of employee activity, the STAR software enables firms to automate the detection and resolution of potential areas of conflict while streamlining daily workflows and increasing efficiency.
Location: Candidates MUST be UK based and have right to work.
We are seeking a highly skilled and pragmatic Site Reliability Engineer (SRE) to help lead our evolution from legacy single-tenant monoliths to modern, scalable, multi-tenant microservices. This is a pivotal role for our business, enabling faster delivery, improved reliability, and real scalability across our SaaS portfolio.
While we’ve got a solid handle on infrastructure monitoring, we’re still in the early innings when it comes to application-level observability, autoscaling, and progressive delivery strategies (e.g., canary releases, blue/green deployments). That’s where you come in.
You’ll work closely with Infrastructure, Architecture, Engineering, and Support teams to design, build, and evangelize the next generation of SRE practices and tools that ensure uptime, resiliency, and customer trust.
Responsibilities
- Champion Reliability by Design: Collaborate with architects and engineers to build resilient, fault-tolerant systems across our evolving cloud-native stack.
- Observability Overhaul: Lead the charge on full-stack observability, leveraging modern APM tooling, meaningful SLOs/SLIs, and actionable alerts.
- Scaling Systems: Develop and implement auto-scaling strategies, load testing plans, and capacity forecasting for multi-tenant environments.
- Progressive Delivery: Help implement and automate deployment strategies such as canary releases, feature flags, and blue/green rollouts.
- Incident Response: Create and refine on-call processes, incident response playbooks, and blameless post-mortem routines.
- Monitoring & Tooling: Own and evolve our monitoring infrastructure, integrating metrics, logs, and traces into a cohesive ecosystem.
- Developer Empowerment: Build reusable templates, dashboards, and platform tooling to empower dev teams to “shift left” on reliability.
- Cross-functional Collaboration: Work hand-in-hand with Infrastructure, Architecture, Support, and Engineering teams to drive shared accountability for uptime and performance.
Skills
- 5+ years in SRE, DevOps, or Production Engineering roles, ideally within a SaaS or cloud-native environment.
- Deep experience with cloud platforms (preferably Azure or AWS), and Infrastructure-as-Code tools (e.g. Terraform).
- Hands-on experience with Azure DevOps is strongly preferred, as our CI/CD and project workflows are fully built around it.
- Proficiency with observability tools such as New Relic, Datadog, Prometheus, or similar.
- Strong understanding of software deployment strategies, CI/CD pipelines, and release engineering.
- Ability to code in at least one modern scripting or systems language (e.g., Python, PowerShell, Go, Bash).
- Experience operating multi-tenant environments with an emphasis on security, performance, and cost optimization.
- Excellent communicator who thrives in cross-functional settings and can influence engineering culture around reliability.
Desirable Skills
- Experience in regulated industries (e.g., financial services, healthcare).
- Background with service mesh architectures, distributed tracing, and gRPC/GraphQL.
- Familiarity with incident management platforms (e.g., PagerDuty, OpsGenie).
- Contributions to open-source SRE tooling or frameworks.
StarCompliance Background Checks
All positions require pre-employment screening due to employees potentially having access to highly sensitive and confidential information involving finance and compliance; candidates must be trustworthy and have a heightened sensitivity to protecting confidential financial, professional information. To be eligible for employment with StarCompliance, candidates must undergo a rigorous background investigation with checks including, but not limited to, criminal record history, consumer credit, employment history, qualifications, and education checks.
Equal Opportunity Employer Statement
We prohibit discrimination and harassment of any kind based on race, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, gender identity or expression, marital/civil union/domestic partnership status, veteran status or any other protected characteristic as outlined by country, state, or local laws. This policy applies to all employment practices within our organisation, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. StarCompliance makes hiring decisions based solely on qualifications, merit, and business needs at the time. For more information, please request a copy of our Equal Opportunities Policy.
Site Reliability Engineer UK in London employer: StarCompliance, LLC
Contact Detail:
StarCompliance, LLC Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer UK in London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current employees at StarCompliance. A friendly chat can sometimes lead to opportunities that aren’t even advertised!
✨Tip Number 2
Show off your skills! If you’ve got a portfolio or GitHub with projects related to SRE, make sure to highlight them during interviews. It’s a great way to demonstrate your hands-on experience and passion for the field.
✨Tip Number 3
Prepare for technical interviews by brushing up on your knowledge of cloud platforms and observability tools. Practice common SRE scenarios and be ready to discuss how you’d tackle real-world problems at StarCompliance.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the team at StarCompliance.
We think you need these skills to ace Site Reliability Engineer UK in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud platforms, observability tools, and any relevant projects that showcase your skills in SRE practices.
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about reliability engineering and how your background aligns with our mission at StarCompliance. Keep it concise but impactful!
Showcase Your Technical Skills: Don’t forget to mention your hands-on experience with tools like Azure DevOps and Infrastructure-as-Code. We want to see your technical prowess, so be specific about the technologies you've worked with and the results you've achieved.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!
How to prepare for a job interview at StarCompliance, LLC
✨Know Your Tech Stack
Make sure you’re well-versed in the cloud platforms mentioned in the job description, especially Azure or AWS. Brush up on Infrastructure-as-Code tools like Terraform and be ready to discuss your hands-on experience with them.
✨Demonstrate Observability Knowledge
Familiarise yourself with observability tools such as New Relic, Datadog, or Prometheus. Be prepared to share examples of how you've implemented full-stack observability in past roles and how it improved system reliability.
✨Showcase Your Coding Skills
Since coding is a key part of the role, practice coding in at least one modern scripting language like Python or Go. You might be asked to solve a problem on the spot, so brush up on your coding skills and be ready to demonstrate your thought process.
✨Prepare for Cross-Functional Collaboration
This role requires working closely with various teams, so think of examples where you’ve successfully collaborated across departments. Highlight your communication skills and how you’ve influenced engineering culture around reliability in previous positions.