Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
X

At a Glance

  • Tasks: Join a dynamic team to enhance system reliability and automate operations.
  • Company: Xcede is a cutting-edge investment firm merging technology and finance.
  • Benefits: Enjoy a collaborative environment with opportunities for growth and innovation.
  • Why this job: Be part of a high-performing team shaping the future of technology in finance.
  • Qualifications: Strong Python skills, experience in SRE/DevOps, and a degree in CS or Engineering required.
  • Other info: This role offers exposure to low-latency trading environments and innovative tools.

The predicted salary is between 43200 - 72000 £ per year.

A technology-focused, multi-strat investment firm, operating at the cutting edge of their industry, is looking for a Site Reliability Engineer to join their highly skilled, innovative team.

Essential skills:

  • Strong proficiency in Python for infrastructure and automation
  • Hands-on experience in SRE, DevOps or production engineering roles
  • Deep understanding of monitoring, incident response workflows, and system architecture
  • Productive approach to improving systems and reducing technical debt
  • Strong collaboration and communication skills – working closely with developers, quants, and platform engineers
  • Experience designing and delivering scalable, reliable production systems
  • Proficiency with Linux/Unix systems
  • Bachelor’s degree in CS, Engineering or a related field
  • Familiarity with Kubernetes, Docker, or container orchestration technologies
  • Experience with automation tools such as Terraform or Ansible
  • Background in Go, Bash or other system-level languages
  • Exposure to low-latency trading environments, market data systems, or exchange protocol

This firm, merging science, technology and trading, is offering the chance to play a key role in a high-performing team, developing the infrastructure behind one of the most dynamic and innovative environments in the industry. At the heart of the firm’s operations, you’ll design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting).

You will also:

  • Apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability
  • Build, document, and improve high-performance system designs
  • Lead incident response and implement improvements
  • Collaborate closely with quant developers/platform teams on evolving infrastructure
  • Evaluate and implement new tools, balancing performance, maintainability, and operational complexity

This is a rare and exciting opportunity to join a collaborative, fast-paced and intellectually stimulating environment, contributing closely to the future of a global firm spearheading innovation and creativity in the industry.

For a full spec and to learn more, please get in touch.

Site Reliability Engineer employer: Xcede

Xcede is an exceptional employer, offering a dynamic and innovative work environment in the heart of London. As a Site Reliability Engineer, you will be part of a highly skilled team that values collaboration and creativity, with ample opportunities for professional growth and development. The firm prioritises employee well-being and fosters a culture of continuous improvement, making it an ideal place for those seeking meaningful and rewarding careers in technology and finance.
X

Contact Detail:

Xcede Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with the core principles of Site Reliability Engineering (SRE). Understanding SLIs, SLOs, and error budgets will not only help you in interviews but also demonstrate your commitment to enhancing system reliability.

✨Tip Number 2

Showcase your hands-on experience with automation tools like Terraform or Ansible. Be prepared to discuss specific projects where you've implemented these tools to improve system performance and reduce technical debt.

✨Tip Number 3

Highlight your collaboration skills by preparing examples of how you've worked closely with developers and platform engineers in past roles. This will illustrate your ability to thrive in a team-oriented environment, which is crucial for this position.

✨Tip Number 4

If you have experience in low-latency trading environments or market data systems, make sure to mention it. This niche knowledge can set you apart from other candidates and show that you understand the specific challenges of the industry.

We think you need these skills to ace Site Reliability Engineer

Proficiency in Python
Hands-on experience in SRE, DevOps or production engineering
Understanding of monitoring and incident response workflows
Knowledge of system architecture
Collaboration and communication skills
Experience designing scalable production systems
Proficiency with Linux/Unix systems
Familiarity with Kubernetes and Docker
Experience with automation tools like Terraform or Ansible
Background in Go or Bash
Exposure to low-latency trading environments
Ability to apply SRE principles (SLIs, SLOs, error budgets)
Skills in building and documenting high-performance system designs
Incident response leadership

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your proficiency in Python, experience with SRE or DevOps roles, and familiarity with tools like Kubernetes and Docker. Use specific examples to demonstrate your skills in system architecture and incident response.

Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Mention how your background aligns with their needs, particularly your experience in designing scalable systems and improving operational efficiency.

Showcase Relevant Projects: If you have worked on projects that involved automation tools like Terraform or Ansible, or have experience in low-latency trading environments, be sure to include these in your application. Highlight your contributions and the impact they had.

Prepare for Technical Questions: Anticipate technical questions related to SRE principles, system reliability, and incident management. Be ready to discuss your approach to improving systems and reducing technical debt, as well as your collaboration with cross-functional teams.

How to prepare for a job interview at Xcede

✨Showcase Your Python Skills

As a Site Reliability Engineer, strong proficiency in Python is essential. Be prepared to discuss your experience with Python in detail, especially how you've used it for infrastructure and automation tasks.

✨Demonstrate Your SRE Knowledge

Familiarise yourself with core SRE principles such as SLIs, SLOs, and error budgets. Be ready to explain how you've applied these concepts in previous roles to enhance system reliability.

✨Highlight Collaboration Experience

This role requires strong collaboration with developers and platform engineers. Share specific examples of how you've successfully worked in cross-functional teams to improve systems and reduce technical debt.

✨Prepare for Technical Questions

Expect technical questions related to monitoring, incident response workflows, and system architecture. Brush up on your knowledge of Linux/Unix systems, Kubernetes, Docker, and automation tools like Terraform or Ansible.

Site Reliability Engineer
Xcede
X
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>