At a Glance
- Tasks: Design and maintain automation for a cloud-native data platform on AWS.
- Company: Major international financial services organisation with a focus on innovation.
- Benefits: Competitive contract rate, flexible working, and opportunities for skill development.
- Why this job: Join a dynamic team to enhance reliability and resilience in a cutting-edge data environment.
- Qualifications: Experience in SRE principles, AWS, and automation tools like Terraform.
- Other info: Contract role with potential for growth until January 2027.
The predicted salary is between 36000 - 60000 £ per year.
We are recruiting an AWS Site Reliability Engineer (SRE) to support a cloud-native data platform for a major international financial services organisation. The platform is built on AWS, with core components including Snowflake and Databricks, and underpins critical analytics and data services used across the business. This role focuses on reliability engineering, automation, observability, and resilience. You will work closely with data engineering and platform teams to ensure the platform is scalable, highly available, and operationally robust in a regulated, high-availability environment.
Key Responsibilities
- Design, build, and maintain automation for infrastructure provisioning, platform operations, and incident response using Infrastructure as Code (IaC) and CI/CD.
- Lead resiliency and disaster recovery (DR) planning, including DR testing, failure scenarios, and recovery validation across AWS and data platform services.
- Define and manage SLIs, SLOs, and SLAs for critical data pipelines and platform services, using error budgets to drive reliability improvements.
- Build and operate comprehensive observability solutions (metrics, logs, traces, alerting) across AWS, Snowflake, and Databricks workloads.
- Partner with data engineering and platform teams to embed reliability-by-design into architecture and delivery.
- Perform root cause analysis (RCA) on incidents and drive continuous improvement to reduce operational toil.
- Own and drive resolution of incidents and service requests raised by platform consumers, identifying recurring issues and automating fixes to improve reliability and user experience.
Required Skills & Experience
- Strong practical experience applying Site Reliability Engineering (SRE) principles, including SLO/SLI/SLA design and error budgets.
- Proven production experience with AWS (e.g. EC2, S3, IAM, VPC, CloudWatch).
- Hands-on experience with automation and Infrastructure as Code (Terraform, CloudFormation, or CDK).
- Experience building and operating observability and monitoring solutions.
- Scripting experience in Python and/or Bash.
- Exposure to data platforms such as Snowflake and/or Databricks.
Site Reliability Engineer in Glasgow employer: Paritas Recruitment
Contact Detail:
Paritas Recruitment Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer in Glasgow
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to AWS, automation, and observability. This gives potential employers a taste of what you can do beyond your CV.
✨Tip Number 3
Prepare for interviews by brushing up on SRE principles and AWS services. Practice common interview questions and scenarios related to reliability engineering and incident response. Confidence is key!
✨Tip Number 4
Don’t forget to apply through our website! We’ve got some fantastic opportunities waiting for you, and applying directly can sometimes give you an edge. Let’s get you that dream job!
We think you need these skills to ace Site Reliability Engineer in Glasgow
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with AWS, automation, and observability. We want to see how your skills align with our needs!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how you can contribute to our cloud-native data platform. Keep it engaging and relevant to the job description.
Showcase Your Projects: If you've worked on any relevant projects, make sure to mention them! Whether it's automation scripts or monitoring solutions, we love seeing practical examples of your work that demonstrate your skills.
Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications and ensures you get all the updates directly from us. Plus, it’s super easy!
How to prepare for a job interview at Paritas Recruitment
✨Know Your AWS Inside Out
Make sure you brush up on your AWS knowledge, especially services like EC2, S3, and CloudWatch. Be ready to discuss how you've used these in past projects, as well as any challenges you faced and how you overcame them.
✨Showcase Your Automation Skills
Prepare examples of how you've implemented Infrastructure as Code using tools like Terraform or CloudFormation. Highlight specific scenarios where your automation efforts improved efficiency or reliability in a production environment.
✨Understand Reliability Metrics
Familiarise yourself with SLIs, SLOs, and SLAs. Be prepared to explain how you've defined and managed these metrics in previous roles, and how they contributed to the overall reliability of the systems you worked on.
✨Demonstrate Problem-Solving Abilities
Think of a few incidents you've dealt with in the past and be ready to walk through your root cause analysis process. Discuss how you identified recurring issues and what steps you took to automate fixes and improve user experience.