At a Glance
- Tasks: Ensure high availability and reliability of a global Java platform while leading incident response.
- Company: Join a dynamic global engineering team in a fully remote role.
- Benefits: Competitive salary of ~£120k plus benefits, with a focus on work-life balance.
- Why this job: Make a real impact on system reliability and performance at scale.
- Qualifications: Proven experience as a Senior Site Reliability Engineer with strong problem-solving skills.
- Other info: Work in an autonomous, globally distributed team with opportunities for growth.
The predicted salary is between 120000 - 120000 £ per year.
We are hiring experienced Senior Site Reliability Engineers to join a global engineering team supporting a high‑availability, Java‑based platform used by customers worldwide. This is a permanent, fully remote role open to candidates based in the UK or Germany, offering a competitive package of ~£120k + benefits.
If you are a true SRE (not DevOps-focused) who cares deeply about reliability, stability, incident response, and performance at scale, we want to speak with you.
What You’ll Do
- Ensure high availability, scalability, reliability, and security across production environments
- Lead live incident response, drive root‑cause analysis, and deliver lasting solutions
- Build and maintain SLIs, SLOs, and SLAs
- Support a core Java product: patching, SDKs, configuration (YAML), and uptime work
- Drive automation using Python, Linux tooling, and IaC
- Work closely with security, compliance, and multiple engineering teams
- Participate in a 24/7 on‑call rotation (1 week every 4–5 weeks)
Tech Stack & Skills
- AWS: EC2, EKS, Load Balancers, VPC — with hands‑on production experience
- Linux: Deep troubleshooting & sysadmin fundamentals
- Python: Scripting for automation
- SRE mindset: Incident management, observability, reliability engineering principle
We’re Looking For
- Senior‑level SREs with proven experience running large‑scale, mission‑critical systems
- Engineers who love digging into incidents, solving problems properly, and improving systems over time
- Professionals who thrive in autonomous, globally distributed teams.
Locations
Site Reliability Engineer I in Basildon, Essex employer: Halian | Managed Services, Recruitment Agency & Contract Staffing
Contact Detail:
Halian | Managed Services, Recruitment Agency & Contract Staffing Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer I in Basildon, Essex
✨Tip Number 1
Network like a pro! Reach out to current or former employees on LinkedIn, and don’t be shy about asking for insights into the company culture and the SRE team. A friendly chat can sometimes lead to referrals, which can give you a leg up in the hiring process.
✨Tip Number 2
Prepare for technical interviews by brushing up on your incident management skills and reliability engineering principles. We recommend doing mock interviews with friends or using online platforms to simulate the real deal. The more comfortable you are, the better you'll perform!
✨Tip Number 3
Showcase your problem-solving skills! During interviews, be ready to discuss past incidents you've managed and how you improved system reliability. Use specific examples that highlight your SRE mindset and your ability to work under pressure.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take the initiative to engage directly with us.
We think you need these skills to ace Site Reliability Engineer I in Basildon, Essex
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with high availability and reliability. We want to see how you've tackled incidents and improved systems in your previous roles, so don’t hold back!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for SRE and how you align with our values at StudySmarter. Let us know why you're excited about this role and what you can bring to the team.
Showcase Your Technical Skills: Be specific about your experience with AWS, Linux, and Python. We love seeing concrete examples of how you've used these technologies to drive automation and improve system performance.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy!
How to prepare for a job interview at Halian | Managed Services, Recruitment Agency & Contract Staffing
✨Know Your SRE Fundamentals
Brush up on your Site Reliability Engineering principles. Be ready to discuss high availability, incident response, and how you’ve implemented SLIs, SLOs, and SLAs in past roles. This shows you’re not just familiar with the concepts but have practical experience.
✨Showcase Your Technical Skills
Prepare to dive deep into your technical expertise, especially with AWS, Linux, and Python. Have examples ready where you've used these technologies to solve real-world problems or improve system reliability. This will demonstrate your hands-on experience.
✨Incident Management Experience
Be prepared to talk about specific incidents you've managed. Discuss your approach to root-cause analysis and how you delivered lasting solutions. This is crucial for showing that you can handle the pressure of live incident response.
✨Cultural Fit and Team Collaboration
Since this role involves working in a globally distributed team, highlight your experience in remote collaboration. Share examples of how you’ve successfully worked with cross-functional teams, especially in a 24/7 on-call environment. This will show you’re a great fit for their culture.