At a Glance
- Tasks: Join a dynamic team to enhance system reliability and automate operations.
- Company: Xcede is a cutting-edge investment firm merging technology and finance.
- Benefits: Enjoy a collaborative environment with opportunities for growth and innovation.
- Why this job: Be part of a high-performing team shaping the future of technology in finance.
- Qualifications: Strong Python skills, experience in SRE/DevOps, and a degree in CS or Engineering required.
- Other info: This role offers exposure to low-latency trading environments and innovative tools.
The predicted salary is between 43200 - 72000 £ per year.
A technology-focused, multi-strat investment firm, operating at the cutting edge of their industry, is looking for a Site Reliability Engineer to join their highly skilled, innovative team.
Essential skills:
- Strong proficiency in Python for infrastructure and automation
- Hands-on experience in SRE, DevOps or production engineering roles
- Deep understanding of monitoring, incident response workflows, and system architecture
- Productive approach to improving systems and reducing technical debt
- Strong collaboration and communication skills – working closely with developers, quants, and platform engineers
- Experience designing and delivering scalable, reliable production systems
- Proficiency with Linux/Unix systems
- Bachelor’s degree in CS, Engineering or a related field
- Familiarity with Kubernetes, Docker, or container orchestration technologies
- Experience with automation tools such as Terraform or Ansible
- Background in Go, Bash or other system-level languages
- Exposure to low-latency trading environments, market data systems, or exchange protocol
This firm, merging science, technology and trading, is offering the chance to play a key role in a high-performing team, developing the infrastructure behind one of the most dynamic and innovative environments in the industry. At the heart of the firm’s operations, you’ll design and implement automation for operations, deployments, monitoring and incident management, as well as owning the observability stack (metrics, logs, traces and alerting).
You will also:
- Apply core SRE principles (SLIs, SLOs, error budgets) to enhance system reliability
- Build, document, and improve high-performance system designs
- Lead incident response and implement improvements
- Collaborate closely with quant developers/platform teams on evolving infrastructure
- Evaluate and implement new tools, balancing performance, maintainability, and operational complexity
This is a rare and exciting opportunity to join a collaborative, fast-paced and intellectually stimulating environment, contributing closely to the future of a global firm spearheading innovation and creativity in the industry.
For a full spec and to learn more, please get in touch.
Site Reliability Engineer employer: Xcede
Contact Detail:
Xcede Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer
✨Tip Number 1
Familiarise yourself with the core principles of Site Reliability Engineering (SRE). Understanding SLIs, SLOs, and error budgets will not only help you in interviews but also demonstrate your commitment to enhancing system reliability.
✨Tip Number 2
Showcase your hands-on experience with automation tools like Terraform or Ansible. Be prepared to discuss specific projects where you've implemented these tools to improve system performance and reduce technical debt.
✨Tip Number 3
Highlight your collaboration skills by preparing examples of how you've worked closely with developers and platform engineers in past roles. This will illustrate your ability to thrive in a team-oriented environment, which is crucial for this position.
✨Tip Number 4
If you have experience in low-latency trading environments or market data systems, make sure to mention it. This niche knowledge can set you apart from other candidates and show that you understand the specific challenges of the industry.
We think you need these skills to ace Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your proficiency in Python, experience with SRE or DevOps roles, and familiarity with tools like Kubernetes and Docker. Use specific examples to demonstrate your skills in system architecture and incident response.
Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Mention how your background aligns with their needs, particularly your experience in designing scalable systems and improving operational efficiency.
Showcase Relevant Projects: If you have worked on projects that involved automation tools like Terraform or Ansible, or have experience in low-latency trading environments, be sure to include these in your application. Highlight your contributions and the impact they had.
Prepare for Technical Questions: Anticipate technical questions related to SRE principles, system reliability, and incident management. Be ready to discuss your approach to improving systems and reducing technical debt, as well as your collaboration with cross-functional teams.
How to prepare for a job interview at Xcede
✨Showcase Your Python Skills
As a Site Reliability Engineer, strong proficiency in Python is essential. Be prepared to discuss your experience with Python in detail, especially how you've used it for infrastructure and automation tasks.
✨Demonstrate Your SRE Knowledge
Familiarise yourself with core SRE principles such as SLIs, SLOs, and error budgets. Be ready to explain how you've applied these concepts in previous roles to enhance system reliability.
✨Highlight Collaboration Experience
This role requires strong collaboration with developers and platform engineers. Share specific examples of how you've successfully worked in cross-functional teams to improve systems and reduce technical debt.
✨Prepare for Technical Questions
Expect technical questions related to monitoring, incident response workflows, and system architecture. Brush up on your knowledge of Linux/Unix systems, Kubernetes, Docker, and automation tools like Terraform or Ansible.