Site Reliability Engineer, Compute
Site Reliability Engineer, Compute

Site Reliability Engineer, Compute

Full-Time 43200 - 72000 £ / year (est.) Home office possible
Go Premium
E

At a Glance

  • Tasks: Join our SRE team to enhance reliability and performance of our Compute infrastructure.
  • Company: Vercel powers the web for brands like Under Armour and eBay, focusing on developer experience.
  • Benefits: Enjoy flexible time off, remote work options, and a great compensation package with stock options.
  • Why this job: Be part of a mission-driven team that values innovation and personal growth in a supportive environment.
  • Qualifications: 3+ years in an SRE role or 5+ years in a related field, with strong problem-solving skills.
  • Other info: We embrace diversity and encourage all to apply, regardless of qualifications.

The predicted salary is between 43200 - 72000 £ per year.

Remote - United Kingdom, Germany, Netherlands

About Vercel: Vercel’s Frontend Cloud provides the developer experience and infrastructure to build, scale, and secure a faster, more personalized web. Customers like Under Armour, eBay, The Washington Post, Johnson & Johnson, and Zapier use Vercel to build dynamic user experiences on the web. At Vercel, our mission is to enable the world to ship the best products and that goes hand in hand with creating an environment where you can do the best work of your life.

About the Role: We are looking for experienced SREs to help grow our small team into a global footprint that can provide expert engagement across our core serving systems. As an early member of the SRE team, you will report directly to the Director of Managed Infrastructure and play a foundational role in expanding our SRE practice, integrating reliability principles more deeply into Vercel’s engineering process as we expand. Within the team, your focus will be on enhancing our Compute infrastructure in close partnership with our EU-based developer team. You will design for reliability and performance while managing for risk as we introduce major innovations to our compute stack.

What You Will Do:

  • Ensure that our products are built for reliability and scale by engaging in the end-to-end design, development, and deployment of new software.
  • Drive continuous risk mitigation and reduction through direct involvement in incident management, blameless postmortems, and follow-ups.
  • Drive measurable improvements to the reliability, performance, and efficiency of our production systems through instrumentation, analysis, and implementation of engineering improvements.
  • Devise repeatable, low-toil operational practices through the development of automated systems for software delivery, system failover, and capacity management.

About You:

  • At least 3 years experience in an SRE role, or at least 5 years experience in an adjacent role (e.g. platform engineering), operating in a scaled environment.
  • Firm grasp of the SRE philosophy and mindset, with practical experience working on or directly with SRE teams that have proactively engaged in system design and improvement.
  • Strong sense of accountability and commitment to problem solving, backed by a curiosity to dig deep and identify root causes.
  • Willingness to proactively engage with development teams to influence the course of software design and operational practices.
  • Capability to manage risk, make decisions, and exhibit sound judgment.
  • Demonstrated ability to plan and deliver long-term projects.
  • Experience with distributed system design.
  • Experience with Containers, Virtual Machines, and Linux.
  • Bonus: Experience working with Terraform and/or Golang.

Additional Information:

  • Great compensation package and stock options.
  • Learn and Grow - we provide mentorship and send you to events that help you build your network and skills.
  • Flexible Time Off - Flexible vacation policy with a recommended 4-weeks per year, and paid holidays.
  • Remote Friendly - Work with teammates from different time zones across the globe. We will provide you the gear you need to do your role, and a WFH budget for you to outfit your space as needed.

Vercel is committed to fostering and empowering an inclusive community within our organization. We do not discriminate on the basis of race, religion, color, gender expression or identity, sexual orientation, national origin, citizenship, age, marital status, veteran status, disability status, or any other characteristic protected by law. Vercel encourages everyone to apply for our available positions, even if they don’t necessarily check every box on the job description.

E

Contact Detail:

ECL Kontor Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer, Compute

✨Tip Number 1

Familiarise yourself with Vercel's products and services. Understanding their Frontend Cloud and how it enhances developer experience will help you articulate how your skills can contribute to their mission during discussions.

✨Tip Number 2

Engage with the SRE community online, particularly in forums or groups that discuss reliability engineering. This will not only keep you updated on best practices but also help you network with professionals who might have insights into Vercel's hiring process.

✨Tip Number 3

Prepare to discuss specific examples of how you've improved system reliability and performance in your previous roles. Be ready to share metrics or outcomes that demonstrate your impact, as this aligns with Vercel's focus on measurable improvements.

✨Tip Number 4

Showcase your experience with distributed systems and tools like Terraform or Golang. If you have projects or contributions that highlight these skills, be prepared to discuss them in detail, as they are highly relevant to the role.

We think you need these skills to ace Site Reliability Engineer, Compute

Site Reliability Engineering (SRE)
Distributed System Design
Incident Management
Blameless Postmortems
Risk Mitigation
Software Development Lifecycle
Automation of Software Delivery
Capacity Management
Performance Analysis
Linux Administration
Containerisation (e.g. Docker)
Virtual Machines
Terraform
Golang
Strong Problem-Solving Skills
Collaboration with Development Teams
Project Planning and Delivery

Some tips for your application 🫡

Understand the Role: Before applying, make sure you fully understand the responsibilities and requirements of the Site Reliability Engineer position at Vercel. Tailor your application to highlight relevant experience and skills that align with their needs.

Craft a Tailored CV: Your CV should reflect your experience in SRE or related roles. Emphasise your familiarity with distributed systems, incident management, and any relevant tools like Terraform or Golang. Use specific examples to demonstrate your achievements and impact in previous positions.

Write a Compelling Cover Letter: In your cover letter, express your enthusiasm for Vercel's mission and how your background makes you a great fit for their team. Discuss your approach to reliability and performance, and mention any relevant projects that showcase your problem-solving skills.

Proofread and Edit: Before submitting your application, carefully proofread your documents for any spelling or grammatical errors. A polished application reflects your attention to detail and professionalism, which are crucial in an SRE role.

How to prepare for a job interview at ECL Kontor

✨Understand the SRE Philosophy

Make sure you have a solid grasp of the Site Reliability Engineering philosophy. Be prepared to discuss how you've applied these principles in your previous roles, especially in terms of system design and improvement.

✨Showcase Your Problem-Solving Skills

Be ready to share specific examples of how you've tackled complex problems in the past. Highlight your curiosity and commitment to digging deep to identify root causes, as this is crucial for an SRE role.

✨Familiarise Yourself with Their Tech Stack

Research Vercel's technology stack, particularly around Containers, Virtual Machines, and Linux. If you have experience with Terraform or Golang, be sure to mention it, as it could give you an edge.

✨Prepare for Scenario-Based Questions

Expect scenario-based questions that assess your risk management and decision-making skills. Think about past incidents you've managed and how you approached them, including any blameless postmortems you conducted.

Site Reliability Engineer, Compute
ECL Kontor
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

E
  • Site Reliability Engineer, Compute

    Full-Time
    43200 - 72000 £ / year (est.)
  • E

    ECL Kontor

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>