At a Glance
- Tasks: Lead the design of a global observability stack for millions of devices.
- Company: Join a forward-thinking tech company with a focus on innovation.
- Benefits: 100% remote work, competitive pay, and opportunities for professional growth.
- Why this job: Make a real impact by building high-performance systems that scale rapidly.
- Qualifications: 5+ years in distributed systems and 2+ years in development with Ruby, Go, or Python.
- Other info: Dynamic role with a focus on automation and problem-solving in a collaborative environment.
The predicted salary is between 60000 - 80000 ÂŁ per year.
Location: UK - 100% Remote
Duration: 12-Month Initial Contract
Working Hours: 11:00 AM - 7:00 PM (Mostly)
The Mission
Our client is looking for a Senior SRE to lead the design and evolution of a global observability stack that supports millions of customer devices across 8 international data centers. This isn’t just about monitoring; it’s about building the high-performance, distributed systems that ensure our global cloud remains performant while scaling 2-3x every year.
The Scale of the Challenge
- Metrics at Scale: Design and scale a Prometheus architecture to handle 100M+ active series and beyond.
- Petabyte Logging: Operate high-performance ElasticSearch clusters holding 2000+TB of data.
- High-Throughput Pipelines: Grow data pipelines built on Kafka, handling hundreds of thousands of events per second.
- Engineering Autonomy: Write libraries and APIs (Ruby, Go) that provide engineers with self-service access to monitoring and logging systems.
- Infrastructure as Code: Leverage Terraform and Ansible to deploy across both public and private cloud environments.
What We’re Looking For
- Distributed Systems Expert: 5+ years of experience operating mid-to-large scale systems on Linux (Debian/Ubuntu)-whether on VMs or bare metal.
- The “S” in SRE: 2+ years of development experience with Ruby, Go, Python, or Scala. You prefer building tools to manual toil.
- Observability Specialist: Direct experience with Prometheus/Thanos/Cortex, ELK (Elasticsearch, Logstash, Kibana), Kafka, and Grafana.
- Automation Mindset: Strong proficiency in Terraform, Ansible, and Consul for infrastructure orchestration.
- Problem Solver: You are comfortable diving into unfamiliar codebases and have a deep understanding of software engineering best practices.
- Availability: You are comfortable being part of a production on-call rotation and working a shift that primarily covers 11:00 AM to 7:00 PM UK time.
If you are interested then let me know and we can have a confidential chat. You can also drop me your CV and I will get back to you on call directly to discuss the role further.
Randstad Technologies is acting as an Employment Business in relation to this vacancy.
Senior SRE employer: Randstad Technologies
Contact Detail:
Randstad Technologies Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior SRE
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, join relevant online communities, and attend meetups. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to distributed systems and observability. This gives potential employers a taste of what you can do beyond your CV.
✨Tip Number 3
Prepare for interviews by brushing up on technical questions and system design scenarios. Practice explaining your thought process clearly, as communication is key in SRE roles. We want to see how you tackle problems!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, we love hearing from passionate candidates who are eager to join our mission in building high-performance systems.
We think you need these skills to ace Senior SRE
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Senior SRE role. Highlight your experience with distributed systems, observability tools, and any relevant programming languages like Ruby or Go.
Craft a Compelling Cover Letter: Use your cover letter to tell us why you're the perfect fit for this position. Share specific examples of your past work that demonstrate your problem-solving skills and automation mindset.
Showcase Your Projects: If you've worked on any large-scale projects or have experience with Prometheus, Kafka, or ElasticSearch, make sure to mention these in your application. We love seeing real-world applications of your skills!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures it gets into the right hands quickly!
How to prepare for a job interview at Randstad Technologies
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like Prometheus, ELK, and Kafka. Brush up on your knowledge of Ruby and Go, as well as Terraform and Ansible. Being able to discuss these tools confidently will show that you’re ready to hit the ground running.
✨Showcase Your Problem-Solving Skills
Prepare examples from your past experience where you tackled complex issues in distributed systems. Be ready to explain your thought process and the steps you took to resolve those challenges. This will demonstrate your problem-solving mindset, which is crucial for a Senior SRE role.
✨Understand the Scale of the Challenge
Familiarise yourself with the scale at which the company operates. Think about how you would approach designing a system that can handle millions of customer devices and petabytes of data. Showing that you understand the challenges of scaling will impress your interviewers.
✨Ask Insightful Questions
Prepare thoughtful questions about the team’s current projects, the observability stack, and the company’s future plans. This not only shows your interest in the role but also gives you a chance to assess if the company aligns with your career goals.