Bury Freelance No home office possible

At a Glance

Tasks: Join a cutting-edge FinTech team as a Site Reliability Engineer, focusing on chaos engineering and infrastructure.
Company: Work with a globally renowned FinTech company committed to innovation and excellence.
Benefits: Enjoy a fully remote role with competitive pay (£600 - £700pd) and flexible contract terms.
Why this job: Be part of a dynamic culture that values collaboration, mentorship, and the latest tech trends.
Qualifications: Advanced cloud expertise, chaos engineering experience, and strong problem-solving skills are essential.
Other info: This is a 6-month contract role, ideal for those passionate about resilience in distributed systems.

Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 – 700pd (Outside IR35), 6 months+

The Client:

Owen Thomas has partnered with a company that is looking for exceptional engineers that have a genuine interest in working with cutting-edge technology, in a globally renowned FinTech company.

Technical Requirements

Infrastructure Expertise:

-Advanced experience with cloud platforms (AWS, GCP, Azure), including designing, -deploying, and maintaining scalable infrastructure.

-Strong knowledge of container orchestration tools like Kubernetes and Docker.

-Expertise in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi.

Chaos Engineering Proficiency:

-Hands-on experience with chaos engineering tools like Gremlin, Chaos Monkey, or LitmusChaos to design and execute fault injection experiments.

-Proven track record of implementing resilience testing strategies across distributed systems.

Monitoring and Observability:

-Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, New Relic, Elastic Stack).

-Strong understanding of metrics, logging, and tracing in distributed systems.

Automation and Scripting:

-Proficiency in scripting and automation languages (e.g., Python, Go, Shell, Ruby, or Java).

-Demonstrated ability to automate infrastructure and operational processes.

Incident Management and Root Cause Analysis:

-Expertise in incident response processes, including triage, mitigation, and communication.

-Familiarity with incident management tools like PagerDuty or Opsgenie.

Resilience and Scalability Design:

-Advanced understanding of system design principles, scalability, and high-availability architectures.

-Practical experience with load testing and performance benchmarking tools (e.g., JMeter, Locust, k6).

Soft Skills and Additional Qualities

Strong Problem-Solving Skills:

-Ability to debug and resolve complex issues in production environments.

Cross-Team Collaboration:

-Experience working closely with development, DevOps, and QA teams to implement best practices in reliability and availability.

Proactive Communication:

-Clear and concise communication skills to collaborate with diverse stakeholders and write detailed documentation.

Mentorship and Knowledge Sharing:

-Willingness to mentor other team members in chaos engineering principles and SRE best practices.

Desirable Extras

Certifications:

-Relevant certifications (e.g., AWS Certified DevOps Engineer, CKA, CKAD, or Google Professional Cloud DevOps Engineer).

Experience in Highly Regulated Industries:

-Familiarity with compliance frameworks (e.g., PCI DSS, GDPR, ISO 27001) is advantageous.

Exposure to Emerging Tools and Practices:

-Knowledge of modern chaos engineering trends, such as adaptive resilience testing or AI-driven fault detection.

Performance Monitoring in Legacy Systems:

-Ability to apply SRE and chaos engineering principles in legacy system environments.

If you are interested in applying, please apply here and we will get back to you if it\’s a good match for the client! We appreciate your patience 🙂

Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+ employer: Owen Thomas | Pending B Corp™

Owen Thomas is an exceptional employer that offers a fully remote work environment, allowing you to thrive in your role as a Site Reliability Engineer while enjoying the flexibility of working from anywhere in the UK. With a strong focus on cutting-edge technology and chaos engineering, the company fosters a collaborative culture that encourages continuous learning and mentorship, providing ample opportunities for professional growth. Join a globally renowned FinTech company where your expertise will be valued, and you can make a meaningful impact on resilient and scalable infrastructure.

Contact Detail:

Owen Thomas | Pending B Corp™ Recruiting Team

View Owen Thomas | Pending B Corp™ Profile

StudySmarter Expert Advice 🤫

✨Tip Number 1

Make sure to showcase your hands-on experience with chaos engineering tools like Gremlin or Chaos Monkey. Highlight specific projects where you implemented fault injection experiments, as this will demonstrate your practical knowledge in the field.

✨Tip Number 2

Since this role emphasizes collaboration, be prepared to discuss your experience working with cross-functional teams. Share examples of how you've worked closely with development and DevOps teams to enhance system reliability.

✨Tip Number 3

Familiarize yourself with the latest trends in chaos engineering and resilience testing. Being able to discuss adaptive resilience testing or AI-driven fault detection can set you apart from other candidates.

✨Tip Number 4

If you have relevant certifications, make sure to mention them during your discussions. Certifications like AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer can significantly boost your credibility for this role.

We think you need these skills to ace Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+

Cloud Platform Expertise (AWS, GCP, Azure)

Container Orchestration (Kubernetes, Docker)

Infrastructure as Code (IaC) Tools (Terraform, CloudFormation, Pulumi)

Chaos Engineering Tools (Gremlin, Chaos Monkey, LitmusChaos)

Resilience Testing Strategies

Monitoring and Observability Tools (Prometheus, Grafana, Datadog, New Relic, Elastic Stack)

Scripting and Automation Languages (Python, Go, Shell, Ruby, Java)

Incident Response Processes

Incident Management Tools (PagerDuty, Opsgenie)

System Design Principles

Load Testing and Performance Benchmarking Tools (JMeter, Locust, k6)

Problem-Solving Skills

Cross-Team Collaboration

Proactive Communication

Mentorship in Chaos Engineering and SRE Best Practices

Relevant Certifications (AWS Certified DevOps Engineer, CKA, CKAD, Google Professional Cloud DevOps Engineer)

Familiarity with Compliance Frameworks (PCI DSS, GDPR, ISO 27001)

Knowledge of Modern Chaos Engineering Trends

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with cloud platforms, container orchestration tools, and chaos engineering. Use specific examples that demonstrate your expertise in these areas.

Craft a Strong Cover Letter: In your cover letter, express your genuine interest in the role and the company. Mention your hands-on experience with chaos engineering tools and how you have implemented resilience testing strategies in previous roles.

Showcase Soft Skills: Don't forget to include your strong problem-solving skills and ability to collaborate across teams. Provide examples of how you've communicated effectively with diverse stakeholders or mentored team members.

Highlight Relevant Certifications: If you have any relevant certifications, such as AWS Certified DevOps Engineer or CKA, make sure to mention them prominently in your application. This can set you apart from other candidates.

How to prepare for a job interview at Owen Thomas | Pending B Corp™

✨Showcase Your Technical Expertise

Be prepared to discuss your experience with cloud platforms like AWS, GCP, or Azure. Highlight specific projects where you designed and maintained scalable infrastructure, and be ready to dive into the details of your work with container orchestration tools like Kubernetes and Docker.

✨Demonstrate Chaos Engineering Knowledge

Familiarize yourself with chaos engineering principles and tools such as Gremlin or Chaos Monkey. Be ready to share examples of how you've implemented resilience testing strategies in distributed systems and the outcomes of those experiments.

✨Emphasize Monitoring and Observability Skills

Discuss your experience with monitoring tools like Prometheus or Grafana. Explain how you have utilized metrics, logging, and tracing to enhance system observability and reliability, and provide specific instances where this has made a difference.

✨Highlight Problem-Solving and Collaboration

Prepare to talk about complex issues you've resolved in production environments and how you collaborated with cross-functional teams. Share examples that showcase your proactive communication skills and your ability to mentor others in SRE best practices.

Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+

Bury

Freelance

Application deadline: 2027-01-21
Owen Thomas | Pending B Corp™

View Owen Thomas | Pending B Corp™ Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now