At a Glance
- Tasks: Design and scale observability platforms for cloud infrastructure supporting millions of devices.
- Company: Join a leading tech firm focused on innovative cloud solutions.
- Benefits: Competitive hourly rate, remote work flexibility, and opportunities for skill development.
- Why this job: Make a real impact on system reliability and performance in a dynamic environment.
- Qualifications: 5+ years in Site Reliability Engineering with strong Linux and programming skills.
- Other info: Immediate interviews available for passionate candidates ready to innovate.
The predicted salary is between 44000 - 49000 £ per year.
Location: London/UK (Remote)
Contract: 12 Months
Initial Day rate: 55 Per Hour - 62 Per Hour Inside IR35
Job Overview
We are looking for a Senior Site Reliability Engineer with strong experience in Observability, Monitoring and Distributed Systems to support large-scale cloud infrastructure supporting millions of devices globally. The role focuses on building and scaling monitoring, logging and alerting platforms to ensure high availability and performance of cloud services.
Responsibilities
- Design, deploy and scale observability platforms
- Manage and scale Prometheus monitoring systems
- Deploy and maintain large Elasticsearch clusters
- Build and maintain data pipelines using Kafka
- Develop alerting and monitoring frameworks
- Automate infrastructure using Terraform and Ansible
- Develop tools and scripts using Python, Go, Ruby or Bash
- Work with Linux systems (Debian/Ubuntu)
- Participate in on-call rotation
- Improve system reliability, performance and scalability
Required Skills
- 5+ years experience in Site Reliability Engineering / DevOps
- Strong Linux systems experience
- Observability and Monitoring tools experience
- Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana)
- Kafka
- Terraform / Infrastructure as Code
- Ansible / Configuration Management
- Programming experience (Python, Go, Ruby or Bash)
- Distributed systems and cloud infrastructure experience
This is an urgent vacancy where the hiring manager is shortlisting for an interview immediately. Please apply with a copy of your CV or send it to khushboo.co.uk.
Randstad Technologies is acting as an Employment Business in relation to this vacancy.
SRE - Site Reliability Engineer in City of London employer: Randstad Technologies Recruitment
Contact Detail:
Randstad Technologies Recruitment Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land SRE - Site Reliability Engineer in City of London
✨Tip Number 1
Network like a pro! Reach out to your connections in the industry, especially those who work in Site Reliability Engineering. A friendly chat can lead to insider info about job openings or even a referral.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to observability and monitoring. This gives potential employers a taste of what you can do beyond your CV.
✨Tip Number 3
Prepare for technical interviews by brushing up on your knowledge of Prometheus, Grafana, and Elasticsearch. Practice common SRE scenarios and be ready to discuss how you've tackled challenges in the past.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take that extra step!
We think you need these skills to ace SRE - Site Reliability Engineer in City of London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with observability, monitoring, and distributed systems. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects or achievements!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about Site Reliability Engineering and how your background makes you a perfect fit for our team. Keep it concise but impactful!
Showcase Your Technical Skills: Since we’re looking for someone with strong technical expertise, make sure to list your experience with tools like Prometheus, Grafana, and Elasticsearch clearly. We love seeing specific examples of how you've used these in past roles!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Don’t miss out!
How to prepare for a job interview at Randstad Technologies Recruitment
✨Know Your Tech Inside Out
Make sure you’re well-versed in the tools and technologies mentioned in the job description. Brush up on your experience with Prometheus, Grafana, and the ELK Stack. Be ready to discuss how you've used these tools in past projects and the impact they had on system reliability.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of challenges you've faced in Site Reliability Engineering. Think about times when you improved system performance or resolved critical incidents. Use the STAR method (Situation, Task, Action, Result) to structure your answers.
✨Demonstrate Your Automation Know-How
Since automation is key in this role, be prepared to talk about your experience with Terraform and Ansible. Discuss any scripts or tools you've developed using Python, Go, Ruby, or Bash, and how they contributed to streamlining processes or enhancing system reliability.
✨Ask Insightful Questions
Interviews are a two-way street! Prepare thoughtful questions about the company's cloud infrastructure, their approach to observability, and the team dynamics. This shows your genuine interest in the role and helps you assess if it’s the right fit for you.