At a Glance
- Tasks: Operate and improve a leading fraud protection SaaS platform in a dynamic team.
- Company: Join Visa, a global leader in payments technology, dedicated to making a difference.
- Benefits: Hybrid work model, competitive salary, and opportunities for professional growth.
- Other info: Collaborative environment with a focus on learning and career advancement.
- Why this job: Make a real impact by tackling financial crime with innovative technology.
- Qualifications: Experience with cloud infrastructure, coding skills, and a passion for problem-solving.
The predicted salary is between 50000 - 70000 £ per year.
About Us
Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories, dedicated to uplifting everyone, everywhere by being the best way to pay and be paid. At Visa, you'll have the opportunity to create impact at scale - tackling meaningful challenges, growing your skills and seeing your contributions impact lives around the world. Join Visa and do work that matters - to you, to your community, and to the world. Progress starts with you.
At Featurespace (a Visa company), we strive to be the world's best software company at protecting our clients and their customers from fraud attacks and fighting financial crime. We do that with personality, heart and professionalism - cultivating an innovative, fun and positive team atmosphere where everybody can contribute to solving our clients' problems in new, innovative ways.
The Opportunity
As a Site Reliability Engineer (Cloud Ops), you will help operate and continuously improve Featurespace's world-leading product, ARIC Risk Hub, delivered as a robust cloud-based SaaS solution. You will work as part of the Cloud Operations / SRE team to ensure our platform is reliable, scalable, measurable, repeatable, secure, and cost-effective. You will participate in designing, developing, deploying, monitoring, supporting, documenting, and troubleshooting our SaaS platform, collaborating closely with engineering, data science, internal stakeholders, external vendors, and customers to deliver excellent service outcomes.
Responsibilities
- Operate and support production deployments of ARIC Risk Hub SaaS, including deploying, maintaining, monitoring, upgrading, and troubleshooting platform and application components.
- Build software and systems to manage platform infrastructure and applications.
- Continuously evaluate and improve technology and operational processes to increase quality, reduce costs, and improve time-to-market.
- Participate in service resilience and failure testing, including predictable and unpredictable failure scenarios.
- Provide second-line operational support for SaaS customers, ensuring timely and high-quality issue resolution.
- Gather service performance data and generate reports and insights to guide reliability and scalability improvements.
- Develop, maintain, and document internal processes and operational runbooks.
- Collaborate with engineering and data science teams to drive new and improved ARIC Risk Hub capabilities.
- Participate in an on-call roster, including out-of-hours support as required.
This is a hybrid position. Expectation of days in office will be confirmed by your Hiring Manager.
Qualifications
Core Skills- Experience administering cloud infrastructure or supporting cloud applications (preferably AWS).
- Working knowledge of Linux, shell scripting, and command-line tools.
- Ability to write or maintain code in at least one high-level programming language (e.g., Python).
- Understanding of networking fundamentals (e.g., DNS, routing, firewalls).
- Familiarity with source control systems (e.g., Git).
- Exposure to CI/CD concepts and pipelines.
- Familiarity with monitoring, metrics, and alerting systems.
- Experience operating and supporting production-grade services.
- Ability to write clear technical documentation and follow defined operational processes.
- Infrastructure as Code and configuration management experience (e.g., Terraform, SaltStack, Ansible).
- Experience with containerization (Docker) and Kubernetes (deploying or operating services).
- Exposure to service mesh technologies (e.g., Istio).
- Experience building or operating cloud-native or serverless applications.
- Familiarity with observability and data platforms such as Prometheus, Grafana, MongoDB, Elasticsearch, Kafka, and HashiCorp Vault.
- Understanding of application and data security fundamentals (authentication, authorization, encryption, TLS).
- Awareness of regulated standards (e.g., PCIDSS, SOC2, ISO27001).
Relevant industry experience supporting cloud-based SaaS platforms in production environments.
Excellent interpersonal and communication skills, with the ability to collaborate across teams and organizations.
Strong attention to detail and a proactive, best-practice driven approach to work.
Passion for learning new skills and technologies and staying current with industry developments.
Curiosity, innovation, and enthusiasm for solving complex problems.
Strong time-management skills and the ability to prioritise effectively.
Site Reliability Engineer (Cambridge) employer: Visa
Contact Detail:
Visa Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (Cambridge)
✨Tip Number 1
Network like a pro! Reach out to current employees at Featurespace or Visa on LinkedIn. A friendly chat can give you insider info and maybe even a referral, which can really boost your chances.
✨Tip Number 2
Prepare for the interview by brushing up on your technical skills. Make sure you can talk confidently about cloud infrastructure, Linux, and any programming languages you know. We want to see your passion and expertise shine through!
✨Tip Number 3
Show off your problem-solving skills! Be ready to discuss past experiences where you tackled complex issues, especially in cloud environments. We love candidates who can think on their feet and come up with innovative solutions.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Featurespace.
We think you need these skills to ace Site Reliability Engineer (Cambridge)
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter for the Site Reliability Engineer role. Highlight your experience with cloud infrastructure, coding skills, and any relevant projects that showcase your problem-solving abilities.
Show Your Passion: Let us see your enthusiasm for technology and learning! Mention any recent courses, certifications, or personal projects that demonstrate your commitment to staying current in the field.
Be Clear and Concise: When writing your application, keep it straightforward. Use clear language and avoid jargon unless it's relevant. We want to understand your skills and experiences without wading through unnecessary fluff.
Apply Through Our Website: Don’t forget to submit your application through our official website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy!
How to prepare for a job interview at Visa
✨Know Your Cloud Basics
Make sure you brush up on your cloud infrastructure knowledge, especially if you're familiar with AWS. Be ready to discuss how you've administered cloud applications or managed deployments in the past.
✨Show Off Your Scripting Skills
Since you'll be working with Linux and shell scripting, it’s a good idea to prepare examples of scripts you've written. Highlight how these scripts improved efficiency or solved specific problems in your previous roles.
✨Demonstrate Problem-Solving Prowess
Be prepared to talk about a time you faced a challenging issue in a production environment. Discuss how you approached troubleshooting and what steps you took to resolve the problem, showcasing your analytical skills.
✨Communicate Clearly
As collaboration is key in this role, practice explaining technical concepts in simple terms. This will help demonstrate your communication skills and ability to work across teams, which is crucial for a Site Reliability Engineer.