At a Glance
- Tasks: Manage AWS architecture and implement SRE practices for operational excellence.
- Company: Join Reward Gateway, a leader in digital services for workplace connections.
- Benefits: Hybrid work model, competitive salary, and a focus on personal growth.
- Why this job: Make a real impact by enhancing reliability and availability in a dynamic environment.
- Qualifications: 4+ years in DevOps/SRE with strong AWS and automation skills.
- Other info: Diverse culture that values creativity and individuality.
The predicted salary is between 7000 - 7500 £ per month.
This role offers a hybrid work offering to be present in our London office twice per week. Reward Gateway|Edenred is a leading digital platform for services and payments for people at work, connecting 52 million users and 2 million partner merchants in 45 countries via close to 1 million corporate clients. Our shared mission of ‘Making the World a Better Place to Work' and ‘Enriching connections, For good’, guides our every action and charts a sustainable path to a better future.
Due to expansion, an opportunity has become available for a Site Reliability Engineer to join our team to help us transform our existing operational workloads to an SRE approach.
Key Responsibilities- Day-to-day operations of our complex AWS architecture
- Integrating tightly with our DevOps team members
- Following SRE practices and maintaining high standards of compliance
- Implementing a new standard of observability utilising SLI/SLO/Error Budgets
- Continually evolving our observability platforms for greater coverage
- Using a code-first approach to build and changes to reduce TOIL
- Advocating a strong focus on availability, reliability and uptime
- Liaising with the Engineering teams for the constant evolution of metrics
- Working towards planned roadmap goals
- Actively taking part in the daily stand-ups and keeping sprints on track
- Keeping up-to-date documentation in the JIRA & Confluence tools
- Taking part in SRE Incident Management processes
- Acting as a key Incident Commander within the Incident Management process
- Ensuring a focus on cost efficiency for the platforms & services
- Working with team members to foster collaboration and ongoing communication with stakeholders
- At least 4 years of experience in DevOps or SRE, with a keen interest in growing as a Site Reliability Engineer
- Experience with AWS or other cloud providers
- Enterprise infrastructure experience in HA environments
- Automation skills through Terraform, Python, Bash or similar
- Wide-reaching SRE skills and a deep understanding of SRE practices
- A strong understanding of SQL, PHP, Kubernetes, CI/CD
- Observability product experience (eg, New Relic, Datadog)
- Managing infrastructures using SLI/SLO & Error Budgets
- Ability to work both independently and as part of a team
- Ability to work under pressure and be highly reliable
- Adaptability and flexibility to change in a fast-moving environment
- An ability to learn new tools and processes quickly and impart that knowledge
Salary on offer ranges from £7,000 to £7,500 gross per month, depending on experience. Currently, no bonuses or share options are offered.
The Interview Process- Screening video interview with the Senior Talent Partner and Head of SRE
- Final interview with the Director of Infrastructure & Head of SRE
Be comfortable. Be you. At Reward Gateway, we want all of our employees to feel comfortable bringing their passion, creativity and individuality to work. We value all cultures, backgrounds and experiences, as we truly believe that diversity drives innovation. Express yourself, join our community and help us make the World a Better Place to Work.
Third Floor, 1 Dean Street London W1D 3RB United Kingdom
Site Reliability Engineer (SRE) in City of London employer: Rewardgateway
Contact Detail:
Rewardgateway Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (SRE) in City of London
✨Tip Number 1
Get to know the company culture before your interview. Check out their social media and website to see what they're all about. This will help you tailor your answers and show that you're genuinely interested in being part of their mission.
✨Tip Number 2
Practice your technical skills! Since this role is all about SRE practices, brush up on your AWS knowledge and automation skills. You might even want to run through some common scenarios or problems you could face in the role.
✨Tip Number 3
Don’t forget to prepare questions for your interviewers. This shows that you’re engaged and serious about the position. Ask about their current projects or how they measure success in the SRE team.
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re proactive and keen to join the team at Reward Gateway.
We think you need these skills to ace Site Reliability Engineer (SRE) in City of London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Site Reliability Engineer role. Highlight your AWS experience, automation skills, and any SRE practices you've implemented. We want to see how you can contribute to our mission!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and how your background aligns with our goals at Reward Gateway. Keep it engaging and personal – we love getting to know the real you!
Showcase Your Projects: If you've worked on relevant projects, don’t hold back! Include links or descriptions of your work that demonstrate your skills in observability, automation, or cloud infrastructure. This gives us a glimpse into your hands-on experience.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!
How to prepare for a job interview at Rewardgateway
✨Know Your AWS Inside Out
Since the role involves day-to-day operations of a complex AWS architecture, make sure you brush up on your AWS knowledge. Be ready to discuss specific services you've used and how they relate to SRE practices. This will show that you're not just familiar with AWS, but that you can leverage it effectively in your role.
✨Showcase Your Automation Skills
With automation being a key part of the job, prepare examples of how you've used tools like Terraform, Python, or Bash to streamline processes. Bring along any scripts or projects you've worked on that demonstrate your ability to reduce TOIL and improve efficiency.
✨Understand SLI/SLO and Error Budgets
The interviewers will likely want to know how you manage availability and reliability. Be prepared to explain your understanding of SLI/SLO and how you've implemented error budgets in past roles. Real-world examples will help illustrate your expertise and commitment to high standards.
✨Be Ready for Incident Management Scenarios
As a potential Incident Commander, you should be ready to discuss your experience with incident management processes. Think of specific incidents you've managed, what your role was, and how you ensured effective communication and resolution. This will highlight your ability to work under pressure and lead during critical situations.