BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience in London

BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience in London

London Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Dormont Manufacturing Co

At a Glance

  • Tasks: Lead the charge in adopting SRE methodologies and enhance service reliability.
  • Company: Join a leading firm committed to innovation and collaboration.
  • Benefits: Enjoy competitive pay, flexible work options, and growth opportunities.
  • Other info: Be part of a dynamic team that values curiosity and shared ownership.
  • Why this job: Make a real impact on system reliability while working with cutting-edge technology.
  • Qualifications: Experience in coding, cloud services, and automation tools is essential.

The predicted salary is between 60000 - 80000 £ per year.

Key Responsibilities

  • Provide technical leadership in the understanding and adoption of SRE methodologies across the firm.
  • Incorporate observability standards into code and deployment pipelines.
  • Evolve the SRE standards that are adopted across all teams.
  • Partner with colleagues in various roles and reporting lines to improve service reliability and operational efficiency.
  • Assist developers and engineers directly and through AI assistants.
  • Implement instrumentation and provide comprehensive performance insights to service owners.
  • Ensure monitoring and alerting that reflects the reliability of services for users and enables effective on-call operations.
  • Implement strategic observability tools and work to control overhead in maintenance and cost.
  • Participate in on-call rotations and respond to system incidents to ensure service availability and minimise operational impact.
  • Use automation to manage, maintain, and scale SRE systems with minimal human intervention.
  • Foster a blameless culture while assisting in postmortem discussions and reporting.

Qualifications

  • Ability to write automation scripts, as well as read and troubleshoot code (Python, C#, Typescript, etc.).
  • Make effective use of coding assistants and chat models (Anthropic, Open AI).
  • Proficiency with public cloud providers (strong AWS experience required, preferred Azure experience).
  • Configuration as code, infrastructure management, and CI/CD tooling (Terraform, Puppet, Gitlab CI).
  • Hands-on experience with Docker and container schedulers including AWS ECS & EKS.
  • Excellent troubleshooting skills for Linux, Windows, and Networking.
  • Experience with observability tools (Grafana, Prometheus, Splunk, etc.).
  • Comfortable under pressure with incident management and collaborating during postmortems.
  • Excellent communication and organisational skills.
  • Curiosity and drive to improve systems and processes through a sense of shared ownership.

BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience in London employer: Dormont Manufacturing Co

At Blackstone, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. Our commitment to employee growth is evident through continuous learning opportunities and the chance to work with cutting-edge technologies in a supportive environment. Located in a vibrant area, we provide our Site Reliability Engineers with unique advantages, including access to industry-leading resources and a strong emphasis on work-life balance.

Dormont Manufacturing Co

Contact Details:

Dormont Manufacturing Co Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience in London

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those already working at Blackstone. A friendly chat can open doors and give you insider info on what they're really looking for.

Tip Number 2

Show off your skills! If you’ve got a portfolio or GitHub with projects that highlight your SRE expertise, make sure to share it. It’s a great way to demonstrate your coding chops and problem-solving abilities.

Tip Number 3

Prepare for the interview by brushing up on your incident management scenarios. Be ready to discuss how you've handled pressure and improved systems in the past. We want to see that blameless culture in action!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re serious about joining the team!

We think you need these skills to ace BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience in London

SRE Methodologies
Observability Standards
Service Reliability
Operational Efficiency
Instrumentation
Performance Insights
Monitoring and Alerting

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with SRE methodologies and the specific tools mentioned in the job description. We want to see how your skills align with our needs!

Show Off Your Technical Skills:Don’t hold back on showcasing your coding abilities! Include examples of automation scripts you've written or projects where you've implemented observability tools. This is your chance to shine, so let us know what you can do!

Be Clear and Concise:When writing your application, keep it straightforward and to the point. We appreciate clarity, so make sure your experience and achievements are easy to read and understand. Avoid jargon unless it’s relevant!

Apply Through Our Website:We encourage you to submit your application through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy!

How to prepare for a job interview at Dormont Manufacturing Co

Know Your SRE Methodologies

Before the interview, brush up on SRE methodologies and be ready to discuss how you've applied them in your previous roles. Think about specific examples where you improved service reliability or operational efficiency, as this will show your technical leadership skills.

Showcase Your Coding Skills

Be prepared to demonstrate your coding abilities, especially in Python, C#, or Typescript. You might be asked to troubleshoot code or write automation scripts during the interview, so practice common coding challenges and be familiar with using coding assistants.

Familiarise Yourself with Observability Tools

Since observability is a key part of the role, make sure you know your way around tools like Grafana, Prometheus, and Splunk. Be ready to discuss how you've implemented monitoring and alerting in past projects, and how it contributed to service reliability.

Prepare for Incident Management Scenarios

Expect questions about incident management and postmortem discussions. Think of examples where you handled system incidents under pressure, and how you fostered a blameless culture. This will highlight your ability to collaborate effectively and improve processes.