At a Glance
- Tasks: Enhance reliability and performance of critical systems using AWS and DevOps practices.
- Company: Join Experian, a global leader in data and technology, making a real impact.
- Benefits: Competitive salary, bonus, healthcare, generous leave, and volunteering days.
- Other info: Opportunities for growth in a dynamic, supportive environment.
- Why this job: Be part of a diverse team driving innovation and shaping the future of data.
- Qualifications: Experience in cloud operations, DevOps, and scripting for automation.
The predicted salary is between 60000 - 80000 £ per year.
Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, all using our unique combination of data, analytics and software. We also assist millions of people to realize their financial goals and help them save time and money. We invest in people and new advanced technologies to unlock the power of data.
We are looking for a Site Reliability Engineer (SRE) to improve the reliability and performance of business-critical systems. You will focus on AWS cloud infrastructure, DevOps tooling, and core SRE practices within a distributed, production environment. Reporting to our Lead, you will work with development, platform, and operations teams to ensure systems are stable, scalable, well-monitored and meet defined reliability targets.
Main Responsibilities- Reliability and Operations: Support high availability, scalability and performance of production systems; Work with defined SLIs, SLOs and SLAs, ensuring services meet agreed reliability targets; Identify and reduce operational toil through automation and process improvement; Contribute to the design and implementation of fault-tolerant and resilient systems; Participate in resilience and failure testing activities to validate system behaviour under fault conditions and improve recovery.
- AWS & Cloud Operations: Manage and operate systems hosted on AWS (EC2, EKS/ECS, RDS, S3, Lambda, CloudWatch, IAM, and VPC); Support cloud deployments and infrastructure changes following best practices; Help with backup, disaster recovery and resiliency planning.
- DevOps & Automation: Work with CI/CD pipelines and DevOps practices to ensure reliable and repeatable deployments, including build, test and release automation processes; Use Infrastructure as Code tools such as Terraform or CloudFormation to manage and provision infrastructure; Develop automation using scripting languages (Python, Bash or similar) to reduce operational toil and improve efficiency; Participate in production incident response, troubleshooting, and service restoration; Perform root cause analysis (RCA) and contribute to post-incident reviews; Help implement preventive actions to avoid incident recurrence.
- Observability: Configure and maintain monitoring, logging, and alerting using tools like CloudWatch, Prometheus, Grafana, Splunk, or Dynatrace; Develop dashboards to track system and platform health and reliability metrics across the user journey; Improve alert quality to reduce noise and improve response times; Work with application and engineering teams to embed reliability into system design; Collaborate within a globally distributed team, using clear handovers to ensure continuity; Share knowledge and contribute to team-wide best practices; Communicate with all kinds of stakeholders, influencing decisions through reliability-focused insights.
- Experience in production support, DevOps, SRE, cloud operations, or systems engineering.
- Cloud Expertise: Hands-on experience with AWS cloud services, including compute, container and serverless workloads; Practical experience with CI/CD pipelines and DevOps practices, including Git-based version control, pull request workflows, code reviews, and deployment automation; Experience with SRE principles, monitoring, and reliability engineering practices; Proficiency in scripting (Python, Bash, or similar) for automation and operational tooling; Experience with Linux systems and troubleshooting production issues.
- Exposure to data platforms and data pipelines; Understanding of data reliability concepts; Experience supporting or operating complex distributed systems.
Benefits package includes great compensation and discretionary bonus. Core benefits include pension, Bupa healthcare, Sharesave scheme and more. 25 days annual leave with 8 bank holidays and 3 volunteering days. You can purchase additional annual leave. Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experian's DNA and practices, and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work, irrespective of their gender, ethnicity, religion, colour, sexuality, physical ability or age. If you have a disability or special need that requires accommodation, please let us know at the earliest opportunity.
Senior Site Reliability Engineer (SRE) in Nottingham employer: Experian Health
Contact Detail:
Experian Health Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Site Reliability Engineer (SRE) in Nottingham
✨Tip Number 1
Network like a pro! Reach out to current or former employees at Experian on LinkedIn. A friendly chat can give us insider info and maybe even a referral, which can really boost our chances.
✨Tip Number 2
Prepare for the interview by brushing up on AWS and SRE principles. We should be ready to discuss our hands-on experience with cloud services and how we've tackled reliability challenges in the past.
✨Tip Number 3
Showcase our problem-solving skills! Be ready to share specific examples of how we've automated processes or improved system performance. Real-life stories can make us stand out.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure our application gets seen by the right people. Plus, it shows we’re serious about joining the Experian team.
We think you need these skills to ace Senior Site Reliability Engineer (SRE) in Nottingham
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Senior Site Reliability Engineer role. Highlight your experience with AWS, DevOps practices, and any relevant SRE principles. We want to see how your skills align with what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about reliability engineering and how your background makes you a great fit for our team. Keep it engaging and personal – we love to see your personality!
Showcase Your Projects: If you've worked on any projects that demonstrate your expertise in cloud operations or automation, make sure to mention them. We appreciate seeing real-world applications of your skills, so don’t hold back on the details!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you're serious about joining our team at Experian!
How to prepare for a job interview at Experian Health
✨Know Your AWS Inside Out
Make sure you brush up on your AWS knowledge, especially the services mentioned in the job description like EC2, EKS, and RDS. Be ready to discuss how you've used these services in past projects and any challenges you faced.
✨Demonstrate Your SRE Skills
Prepare to talk about your experience with SRE principles, including SLIs, SLOs, and SLAs. Have examples ready that showcase how you've improved system reliability and performance in previous roles.
✨Show Off Your Automation Skills
Since automation is key in this role, be prepared to discuss your experience with scripting languages like Python or Bash. Bring examples of how you've used Infrastructure as Code tools like Terraform to streamline operations.
✨Communicate Clearly and Effectively
As you'll be collaborating with various teams, practice explaining complex technical concepts in simple terms. Think about how you can influence decisions through your insights on reliability and be ready to share your thoughts during the interview.