At a Glance
- Tasks: Ensure high availability and reliability of a global Java platform while leading incident response.
- Company: Join a forward-thinking tech company with a global engineering team.
- Benefits: Competitive salary of ÂŁ120k, fully remote work, and comprehensive benefits.
- Why this job: Make a real impact on system reliability and work with cutting-edge technologies.
- Qualifications: Senior-level SRE experience with a focus on large-scale systems and incident management.
- Other info: Enjoy autonomy in a globally distributed team with opportunities for professional growth.
The predicted salary is between 120000 - 120000 ÂŁ per year.
We are hiring experienced Senior Site Reliability Engineers to join a global engineering team supporting a highâavailability, Javaâbased platform used by customers worldwide. This is a permanent, fully remote role open to candidates based in the UK or Germany, offering a competitive package of ~ÂŁ120k + benefits.
If you are a true SRE (not DevOps-focused) who cares deeply about reliability, stability, incident response, and performance at scale, we want to speak with you.
What You'll Do
- Ensure high availability, scalability, reliability, and security across production environments
- Lead live incident response, drive rootâcause analysis, and deliver lasting solutions
- Build and maintain SLIs, SLOs, and SLAs
- Support a core Java product: patching, SDKs, configuration (YAML), and uptime work
- Drive automation using Python, Linux tooling, and IaC
- Work closely with security, compliance, and multiple engineering teams
- Participate in a 24/7 onâcall rotation (1 week every 4â5 weeks)
Tech Stack & Skills
- AWS: EC2, EKS, Load Balancers, VPC â with handsâon production experience
- Linux: Deep troubleshooting & sysadmin fundamentals
- Python: Scripting for automation
- SRE mindset: Incident management, observability, reliability engineering principle
We're Looking For
- Seniorâlevel SREs with proven experience running largeâscale, missionâcritical systems
- Engineers who love digging into incidents, solving problems properly, and improving systems over time
- Professionals who thrive in autonomous, globally distributed teams.
Locations
Site Reliability Engineer in Dartford, Kent employer: Halian | Managed Services, Recruitment Agency & Contract Staffing
Contact Detail:
Halian | Managed Services, Recruitment Agency & Contract Staffing Recruiting Team
StudySmarter Expert Advice đ¤Ť
We think this is how you could land Site Reliability Engineer in Dartford, Kent
â¨Tip Number 1
Network like a pro! Reach out to fellow SREs on LinkedIn or join relevant online communities. Engaging with others in the field can lead to job opportunities that arenât even advertised yet.
â¨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those involving incident management and automation. This gives potential employers a taste of what you can bring to the table.
â¨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and incident response strategies. Practice common SRE scenarios and be ready to discuss how you've tackled challenges in the past.
â¨Tip Number 4
Donât forget to apply through our website! Weâre always on the lookout for talented SREs who are passionate about reliability and performance. Your next big opportunity could be just a click away!
We think you need these skills to ace Site Reliability Engineer in Dartford, Kent
Some tips for your application đŤĄ
Tailor Your CV: Make sure your CV highlights your experience with high-availability systems and incident management. We want to see how you've tackled challenges in the past, so donât hold back on those details!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for reliability and stability in systems. Let us know why youâre the perfect fit for our SRE team and how you can contribute to our mission.
Showcase Your Technical Skills: Be specific about your experience with AWS, Linux, and Python. We love seeing concrete examples of how you've used these technologies to drive automation and improve system performance.
Apply Through Our Website: We encourage you to apply directly through our website. Itâs the best way for us to receive your application and ensures youâre considered for this exciting opportunity with StudySmarter!
How to prepare for a job interview at Halian | Managed Services, Recruitment Agency & Contract Staffing
â¨Know Your SRE Fundamentals
Brush up on your Site Reliability Engineering principles. Be ready to discuss high availability, incident response, and performance at scale. Show that you understand the importance of SLIs, SLOs, and SLAs in maintaining system reliability.
â¨Demonstrate Your Technical Skills
Prepare to showcase your hands-on experience with AWS, Linux, and Python. You might be asked to solve a problem or troubleshoot a scenario, so practice articulating your thought process while working through technical challenges.
â¨Showcase Your Incident Management Experience
Be ready to share specific examples of incidents you've managed. Discuss how you approached root-cause analysis and what lasting solutions you implemented. This will highlight your ability to drive improvements in system reliability.
â¨Emphasise Team Collaboration
Since this role involves working closely with various teams, prepare to talk about your experience in collaborative environments. Share examples of how you've worked with security, compliance, and engineering teams to achieve common goals.