Site Reliability Engineer (SRE) - grok.com & API
Site Reliability Engineer (SRE) - grok.com & API

Site Reliability Engineer (SRE) - grok.com & API

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
X

At a Glance

  • Tasks: Join our team to build scalable backend services for grok.com and our API.
  • Company: xAI creates AI systems to enhance human understanding and knowledge.
  • Benefits: Enjoy competitive pay, equity, private health, and flexible work-from-home options.
  • Why this job: Be part of a motivated team that values initiative and engineering excellence.
  • Qualifications: Expertise in Kubernetes, continuous deployment, monitoring, and infrastructure as code is essential.
  • Other info: Work primarily in London with occasional late meetings for team coordination.

The predicted salary is between 43200 - 72000 £ per year.

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the team

You will work on the team that is responsible for the backend services that power grok.com and our API. Our team is currently based primarily in London with a small but growing number of engineers located in Palo Alto. We focus on writing highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

About the role

An ideal candidate meets at least the following requirements:

  • Expert knowledge of Kubernetes
  • Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD
  • Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty
  • Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform

Location

We hire engineers in London and in Palo Alto. We usually work from the office 5 days a week but allow for work-from-home days when required. Candidates joining the London team must be willing to attend late meetings at least once a week to coordinate with the rest of our team.

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15 minute interview (“phone interview”) during which a member of our team will ask some basic technical questions. If you clear the initial phone interview, you will enter the main process, which consists of two technical interviews. All interviews will be conducted via Google Meet.

Competitive cash-based compensation, xAI equity, Private health and dental insurance.

xAI is an equal opportunity employer and does not unlawfully discriminate based on race, color, religion, ethnicity, ancestry, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, age, disability, medical conditions, genetic information, marital status, military or veteran status, or any other applicable legally protected characteristics.

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all applicable federal, state, and local laws, including the San Francisco Fair Chance Ordinance, Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.

Site Reliability Engineer (SRE) - grok.com & API employer: xAI

At xAI, we pride ourselves on being an exceptional employer that fosters a culture of innovation and collaboration. Our London-based team thrives in a dynamic environment where engineers are empowered to take initiative and contribute directly to our mission of advancing AI technology. With competitive compensation, private health and dental insurance, and opportunities for professional growth, we offer a rewarding workplace for those eager to tackle complex challenges alongside like-minded individuals.
X

Contact Detail:

xAI Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer (SRE) - grok.com & API

Tip Number 1

Familiarise yourself with the specific technologies mentioned in the job description, such as Kubernetes, Buildkite, and Prometheus. Having hands-on experience or projects that showcase your expertise in these areas will give you a significant edge during the interview process.

Tip Number 2

Prepare to discuss your previous work experiences in detail, especially those that demonstrate your problem-solving skills and ability to work under pressure. Be ready to share specific examples of how you've contributed to engineering excellence in past roles.

Tip Number 3

Since communication is key in this role, practice explaining complex technical concepts in simple terms. This will help you convey your knowledge effectively during interviews and show that you can collaborate well with your team.

Tip Number 4

Research xAI’s mission and values thoroughly. Understanding their focus on curiosity and initiative will allow you to align your answers with their organisational culture, making you a more appealing candidate.

We think you need these skills to ace Site Reliability Engineer (SRE) - grok.com & API

Kubernetes Expertise
Continuous Deployment Systems (Buildkite, ArgoCD)
Monitoring Technologies (Prometheus, Grafana, PagerDuty)
Infrastructure as Code (Pulumi, Terraform)
Scalability and Reliability Engineering
Cloud Computing
Problem-Solving Skills
Strong Communication Skills
Team Collaboration
Adaptability to Changing Environments
Technical Documentation
Prioritisation Skills
Curiosity and Initiative

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your expertise in Kubernetes, continuous deployment systems, and monitoring technologies. Use specific examples from your past work that demonstrate your skills and achievements relevant to the role.

Craft a Strong Statement of Exceptional Work: In 100 words or less, describe a piece of work you are most proud of. Focus on a project that showcases your technical skills and problem-solving abilities, particularly in areas related to site reliability engineering.

Provide Relevant Links: If you have a LinkedIn profile, X profile, or Google Scholar page, include the URLs in your application. Ensure these profiles are up-to-date and reflect your professional accomplishments and contributions.

Be Clear About Visa Requirements: If you will require sponsorship for employment visa status, be transparent about it in your application. This helps the company understand your situation and can facilitate the hiring process.

How to prepare for a job interview at xAI

Showcase Your Technical Expertise

Be prepared to discuss your expert knowledge of Kubernetes, continuous deployment systems, and monitoring technologies. Highlight specific projects where you've successfully implemented these tools, as this will demonstrate your hands-on experience.

Communicate Clearly and Concisely

Strong communication skills are essential for this role. Practice explaining complex technical concepts in simple terms, as you may need to share your knowledge with teammates who might not have the same level of expertise.

Demonstrate Problem-Solving Skills

Expect technical interviews to include problem-solving scenarios. Prepare by reviewing common SRE challenges and think through how you would approach them. Be ready to articulate your thought process during the interview.

Prepare for Cultural Fit Questions

Since xAI values initiative and a strong work ethic, be ready to discuss examples from your past that showcase your ability to take charge and deliver excellence. This will help you align with their flat organisational structure and team dynamics.

Site Reliability Engineer (SRE) - grok.com & API
xAI
X
  • Site Reliability Engineer (SRE) - grok.com & API

    London
    Full-Time
    43200 - 72000 £ / year (est.)

    Application deadline: 2027-07-10

  • X

    xAI

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>