At a Glance
- Tasks: Build and manage scalable, self-healing systems while supporting developers.
- Company: Join a dynamic social network serving over 100 million users monthly.
- Benefits: Enjoy flexible hours, 25 days holiday, and a personal development allowance.
- Why this job: Tackle unique challenges in a collaborative environment with a focus on automation.
- Qualifications: Mastery of Linux and a modern programming language; experience with cloud platforms.
- Other info: Participate in hackathons, conferences, and enjoy a vibrant office culture.
The predicted salary is between 43200 - 72000 £ per year.
Do you want to build and manage scalable, self-healing, globally-distributed systems? Join this dynamic team of SREs who keep this organisation fast, available, and growing, connecting users to great local businesses. They work in both the dev and systems worlds, implementing key parts of the core architecture and supporting devs as they try to do the same. As part of this company, you’ll get the opportunity to tackle interesting challenges that you can only find at the kind of scale that serves over 100 million users per month. You’ll work to empower product teams and developers: spinning up infrastructure should always be a git commit and a code review away: automation and self-service are at the core of what this team does.
What will you be doing?
- Work closely with developers in supporting new features and services
- Build tools to monitor site stability and performance
- Scale AWS-based infrastructure (no racking servers or swapping hard drives here!)
- Troubleshoot site issues using industry-leading tools like Splunk and SignalFX
- Automate everything with Puppet, Git, Jenkins, and Terraform
- Develop custom tools when off-the-shelf solutions don’t work
- Design new systems, tests, and procedures
What You’ll Bring to the Role:
- A mastery of Linux
- Command of your favourite modern programming language: Python, Ruby, Go, Rust, Java, C++, etc.
- A solid understanding of fundamental technologies like TCP/IP, HTTP, and DNS
- Knowledge of best practices related to security, performance, and disaster recovery
- Experience with web server configuration (Apache/Nginx/HAproxy), monitoring, trending, and high availability
- Strong scripting and automation skills
- Expertise in Configuration Management (Puppet/Ansible/Chef/etc.)
- Experience with public cloud platforms (we use AWS, but Azure/GCP are fine) and related tooling (Terraform, etc.)
- Experience with Docker or other container technologies
- Excellent communication and documentation skills
Full responsibility for projects from day one, an awesome team, and a dynamic work environment. Competitive salary with equity in the company, a pension scheme, and an optional employee stock purchase program. 25 days paid holiday initially, rising to 29 with service. Flexible working hours and meeting-free Thursdays. Regular 3-day Hackathons and weekly learning groups, always with interesting topics. Opportunities to participate in events and conferences throughout Europe and the US. Public transportation season ticket loan and £50 per month toward any exercise of your choice. Monthly personal development allowance. Central location, a fully stocked kitchen, adjustable sitting/standing desks, quarterly offsites, locally roasted coffee, happy hours, and more!
Site Reliability Engineer employer: Third Republic
Contact Detail:
Third Republic Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer
✨Tip Number 1
Familiarise yourself with the specific tools and technologies mentioned in the job description, such as AWS, Puppet, and Terraform. Having hands-on experience or projects showcasing your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Engage with the SRE community online through forums, social media, or local meetups. Networking with professionals in the field can provide insights into the role and may even lead to referrals within our company.
✨Tip Number 3
Prepare to discuss real-world scenarios where you've implemented automation or troubleshooting techniques. Being able to share specific examples of how you've improved system reliability will demonstrate your expertise and problem-solving skills.
✨Tip Number 4
Showcase your communication skills by being clear and concise when discussing technical concepts. As an SRE, you'll need to collaborate with developers, so demonstrating your ability to convey complex ideas simply can make a strong impression.
We think you need these skills to ace Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience and skills that align with the Site Reliability Engineer role. Focus on your mastery of Linux, programming languages, and any cloud platform experience you have.
Craft a Compelling Cover Letter: In your cover letter, express your passion for building scalable systems and your understanding of automation and self-service. Mention specific tools and technologies you've worked with that are relevant to the job description.
Showcase Problem-Solving Skills: Provide examples in your application of how you've tackled challenges in previous roles, particularly those related to site stability, performance monitoring, or automation. This will demonstrate your ability to handle the responsibilities outlined in the job description.
Highlight Communication Skills: Since excellent communication is key for this role, include instances where you've effectively collaborated with developers or other teams. This could be through documentation, project management, or direct collaboration on projects.
How to prepare for a job interview at Third Republic
✨Showcase Your Technical Skills
Be prepared to discuss your mastery of Linux and your favourite programming language. Highlight specific projects where you've implemented automation or built tools, as this aligns closely with the role's requirements.
✨Demonstrate Problem-Solving Abilities
Expect to face scenario-based questions that test your troubleshooting skills. Prepare examples of how you've resolved site issues using tools like Splunk or SignalFX, showcasing your analytical thinking.
✨Understand the Company’s Infrastructure
Familiarise yourself with AWS and other public cloud platforms. Be ready to discuss your experience with scaling infrastructure and using configuration management tools like Puppet or Terraform, as these are crucial for the role.
✨Communicate Effectively
Strong communication skills are essential in this role. Practice explaining complex technical concepts in simple terms, as you'll need to collaborate closely with developers and product teams.