At a Glance
- Tasks: Ensure system reliability, automate tasks, and respond to incidents swiftly.
- Company: Join Tombola, a leading tech company focused on delivering seamless gaming experiences.
- Benefits: Enjoy hybrid work options, a collaborative culture, and opportunities for continuous learning.
- Why this job: Be part of a fun team that values innovation and system performance while making players happy.
- Qualifications: Experience as an SRE with skills in automation, monitoring, and cloud infrastructure is essential.
- Other info: Ready to make an impact? Apply now and join the Tombola family!
The predicted salary is between 36000 - 60000 £ per year.
Fancy being our next SRE Superstar? Here at Tombola, we're not just about bingo – we're about brilliant tech, seamless experiences, and keeping millions of players happy. And to do that, we need a Site Reliability Engineer who's as excited about rock-solid systems and clever automation as we are about winning lines!
So, what's this ace role all about? You'll be the wizard behind the curtain, ensuring our critical systems are always reliable, available, and performing like a dream. We're talking about implementing smart automation, sharp monitoring, and super-speedy incident response strategies to keep everything running smoothly. You'll be working hand-in-hand with our dev, infra, and security teams, making sure we balance exciting new features with unbeatable stability.
What you'll be getting up to:
- System Reliability & Availability Hero: You'll be the guardian of our uptime, making sure our critical systems are always available and hitting those all-important SLAs. You'll also be leading the charge on incident management, getting to the bottom of any issues and making sure we learn from them.
- Monitoring & Alerting Maestro: Setting up and maintaining top-notch monitoring systems will be your jam. You'll craft alerting systems that give us a heads-up before problems even get a chance to impact our players, and you'll define key metrics to measure system health.
- Incident Response Ace: When things get a bit wobbly, you'll be on the front lines, resolving incidents fast to minimize downtime. After the dust settles, you'll lead the root cause analysis to prevent similar issues from popping up again.
- Automation Whizz: Got a repetitive task? You'll be the one automating it away! From environment setup to configuration, you'll be using tools like Terraform, Git, and TeamCity to streamline everything and build slick CI/CD pipelines.
- Capacity Planning Pro: You'll ensure our systems can effortlessly scale to meet demand, optimizing resource usage so we're always efficient and ready for anything. You'll be forecasting future needs to keep things performing perfectly.
- Performance Optimiser: You'll be constantly poking and prodding our systems, tuning databases, improving response times, and making sure everything runs at peak performance. Plus, you'll be running load and stress tests to ensure we can handle even the busiest periods.
- Infrastructure Guru: You'll be bossing our AWS cloud resources, making sure they're properly scaled, cost-effective, and resilient. And yes, you'll be crafting disaster recovery plans so we're ready for any curveballs!
- Collaboration King/Queen: You'll be working hand-in-hand with our awesome development teams, making sure new features are built with reliability in mind. You'll champion service ownership and provide valuable feedback to keep improving our operational success.
- Security & Compliance Captain: Keeping things safe is a big deal here. You'll be weaving security best practices into our infrastructure and making sure we're always playing by the rules and protecting our production environments.
- Documentation Dynamo: If you build it, you'll document it! Clear, concise docs for all our infrastructure, procedures, and runbooks are key.
- Continuous Improvement Enthusiast: You'll always be on the lookout for new tech and better ways of doing things, constantly pushing us to improve system reliability, performance, and efficiency.
Sound like a bit of you? If you're an experienced SRE with a passion for building reliable, scalable, and efficient systems, and you love working in a fun, collaborative environment, then we want to hear from you! Ready to join the Tombola family and help us build even more amazing things? Apply now!
Site Reliability Engineer (SRE) Sunderland - Hy... · Sunderland, UK · employer: Tombola
Contact Detail:
Tombola Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (SRE) Sunderland - Hy... · Sunderland, UK ·
✨Tip Number 1
Familiarise yourself with the specific tools mentioned in the job description, like Dynatrace, Terraform, and AWS. Having hands-on experience or projects showcasing your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Demonstrate your incident management skills by preparing examples of past experiences where you successfully resolved incidents. Be ready to discuss your approach to root cause analysis and how you implemented changes to prevent future issues.
✨Tip Number 3
Showcase your collaboration skills by highlighting any previous work with development teams. Discuss how you’ve contributed to building reliable features and how you’ve communicated effectively to ensure operational success.
✨Tip Number 4
Stay updated on the latest trends in site reliability engineering and automation. Mention any recent technologies or methodologies you've explored that could benefit Tombola, demonstrating your commitment to continuous improvement.
We think you need these skills to ace Site Reliability Engineer (SRE) Sunderland - Hy... · Sunderland, UK ·
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience and skills that align with the Site Reliability Engineer role. Focus on your expertise in system reliability, automation, and incident management, as these are key aspects of the job.
Craft a Compelling Cover Letter: Write a cover letter that showcases your passion for technology and your understanding of the importance of system reliability. Mention specific tools and methodologies you’ve used, such as Terraform or AWS, to demonstrate your hands-on experience.
Showcase Problem-Solving Skills: In your application, provide examples of how you've successfully resolved incidents or improved system performance in previous roles. This will illustrate your ability to handle the responsibilities outlined in the job description.
Highlight Collaboration Experience: Since the role involves working closely with development and security teams, emphasise any past experiences where you collaborated effectively with cross-functional teams. This will show that you can thrive in a team-oriented environment like Tombola.
How to prepare for a job interview at Tombola
✨Show Your Passion for Reliability
Make sure to express your enthusiasm for system reliability and availability during the interview. Share examples of how you've ensured uptime in previous roles, and discuss your approach to incident management and root cause analysis.
✨Demonstrate Your Automation Skills
Be prepared to talk about your experience with automation tools like Terraform, Git, and TeamCity. Highlight specific tasks you've automated in the past and how it improved efficiency and reduced manual errors.
✨Discuss Monitoring and Alerting Strategies
Tombola values proactive monitoring, so come ready to discuss the monitoring systems you've set up. Explain how you define key metrics and craft alerting systems that help catch issues before they impact users.
✨Emphasise Collaboration and Communication
As an SRE, you'll work closely with development and security teams. Share examples of how you've collaborated in the past, focusing on how you provided feedback and championed service ownership to improve operational success.