Observability Site Reliability Engineer
Observability Site Reliability Engineer

Observability Site Reliability Engineer

London Full-Time No home office possible
Go Premium
A

London, England, United Kingdom Software and Services

Description

Apple Services Engineering infrastructure is BIG. Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges.As an SRE at Apple, you\’ll need to solve these problems using data, teamwork, and your own expertise. SREs at Apple own the full infrastructure stack; from device driver performance debugging to content delivery network traffic management — our responsibilities are both broad and deep.ASE runs the majority of its systems on Linux. We run a mix of open source, vendor licensed, and internally developed tools to perform functions such as system configuration management, provisioning, software deployment, logging, and monitoring.You\’ll learn these tools and have opportunities to improve them. Our team is collaborative; we work closely with the development teams we support to deliver the best results for Apple.We think critically and strive to balance the best solution with the need to get things done for each engineering challenge we face. Good ideas are heard and results are rewarded.

Minimum Qualifications

  • Strong understanding of the Linux operating system and TCP/IP suite of networking protocols
  • Ability to design, author, and release code in languages like Go or Python
  • Hands-on experience managing large numbers of diverse systems with configuration management or software delivery platforms (such as Puppet, Chef, Ansible)
  • Familiarity with microservices architecture and container orchestration with Kubernetes

Preferred Qualifications

  • Bare metal management experience and experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks
  • Acute drive to automate manual operations and to improve them through repeated iteration
  • Experience with scale testing, disaster recovery, and capacity planning and experienced in managing and scaling distributed systems in a public, private, or hybrid cloud environment
  • Experience with the Prometheus ecosystem and a good understanding of infrastructure observability principles

Education & Experience

BS/MS in Computer Science or Equivalent ( + in depth experience of software development or production operations experience in a large-scale environment)

#J-18808-Ljbffr

A

Contact Detail:

Apple Inc. Recruiting Team

Observability Site Reliability Engineer
Apple Inc.
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

A
  • Observability Site Reliability Engineer

    London
    Full-Time

    Application deadline: 2027-08-28

  • A

    Apple Inc.

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>