At a Glance
- Tasks: Join a cutting-edge FinTech team as a Site Reliability Engineer, focusing on chaos engineering and infrastructure.
- Company: Work with a globally renowned FinTech company committed to innovation and excellence.
- Benefits: Enjoy a fully remote role with competitive pay (£600 - £700pd) and flexible contract terms.
- Why this job: Be part of a dynamic culture that values collaboration, mentorship, and the latest tech trends.
- Qualifications: Advanced cloud expertise, chaos engineering experience, and strong problem-solving skills are essential.
- Other info: This is a 6-month contract role, ideal for those passionate about resilience in distributed systems.
Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 – 700pd (Outside IR35), 6 months+
The Client:
Owen Thomas has partnered with a company that is looking for exceptional engineers that have a genuine interest in working with cutting-edge technology, in a globally renowned FinTech company.
Technical Requirements
Infrastructure Expertise:
-Advanced experience with cloud platforms (AWS, GCP, Azure), including designing, -deploying, and maintaining scalable infrastructure.
-Strong knowledge of container orchestration tools like Kubernetes and Docker.
-Expertise in Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Pulumi.
Chaos Engineering Proficiency:
-Hands-on experience with chaos engineering tools like Gremlin, Chaos Monkey, or LitmusChaos to design and execute fault injection experiments.
-Proven track record of implementing resilience testing strategies across distributed systems.
Monitoring and Observability:
-Experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, New Relic, Elastic Stack).
-Strong understanding of metrics, logging, and tracing in distributed systems.
Automation and Scripting:
-Proficiency in scripting and automation languages (e.g., Python, Go, Shell, Ruby, or Java).
-Demonstrated ability to automate infrastructure and operational processes.
Incident Management and Root Cause Analysis:
-Expertise in incident response processes, including triage, mitigation, and communication.
-Familiarity with incident management tools like PagerDuty or Opsgenie.
Resilience and Scalability Design:
-Advanced understanding of system design principles, scalability, and high-availability architectures.
-Practical experience with load testing and performance benchmarking tools (e.g., JMeter, Locust, k6).
Soft Skills and Additional Qualities
Strong Problem-Solving Skills:
-Ability to debug and resolve complex issues in production environments.
Cross-Team Collaboration:
-Experience working closely with development, DevOps, and QA teams to implement best practices in reliability and availability.
Proactive Communication:
-Clear and concise communication skills to collaborate with diverse stakeholders and write detailed documentation.
Mentorship and Knowledge Sharing:
-Willingness to mentor other team members in chaos engineering principles and SRE best practices.
Desirable Extras
Certifications:
-Relevant certifications (e.g., AWS Certified DevOps Engineer, CKA, CKAD, or Google Professional Cloud DevOps Engineer).
Experience in Highly Regulated Industries:
-Familiarity with compliance frameworks (e.g., PCI DSS, GDPR, ISO 27001) is advantageous.
Exposure to Emerging Tools and Practices:
-Knowledge of modern chaos engineering trends, such as adaptive resilience testing or AI-driven fault detection.
Performance Monitoring in Legacy Systems:
-Ability to apply SRE and chaos engineering principles in legacy system environments.
If you are interested in applying, please apply here and we will get back to you if it\’s a good match for the client! We appreciate your patience 🙂
Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+ employer: Owen Thomas | Pending B Corp™
Contact Detail:
Owen Thomas | Pending B Corp™ Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+
✨Tip Number 1
Make sure to showcase your hands-on experience with chaos engineering tools like Gremlin or Chaos Monkey. Highlight specific projects where you implemented fault injection experiments, as this will demonstrate your practical knowledge in the field.
✨Tip Number 2
Since this role emphasizes collaboration, be prepared to discuss your experience working with cross-functional teams. Share examples of how you've worked closely with development and DevOps teams to enhance system reliability.
✨Tip Number 3
Familiarize yourself with the latest trends in chaos engineering and resilience testing. Being able to discuss adaptive resilience testing or AI-driven fault detection can set you apart from other candidates.
✨Tip Number 4
If you have relevant certifications, make sure to mention them during your discussions. Certifications like AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer can significantly boost your credibility for this role.
We think you need these skills to ace Owen Thomas | Pending B Corp™ | Site Reliability Engineer (SRE) Contract Role, Platform, Chaos Engineering | FinTech, Enterprise | Fully Remote, UK | £ 600 - 700pd (Outside IR35), 6 months+
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with cloud platforms, container orchestration tools, and chaos engineering. Use specific examples that demonstrate your expertise in these areas.
Craft a Strong Cover Letter: In your cover letter, express your genuine interest in the role and the company. Mention your hands-on experience with chaos engineering tools and how you have implemented resilience testing strategies in previous roles.
Showcase Soft Skills: Don't forget to include your strong problem-solving skills and ability to collaborate across teams. Provide examples of how you've communicated effectively with diverse stakeholders or mentored team members.
Highlight Relevant Certifications: If you have any relevant certifications, such as AWS Certified DevOps Engineer or CKA, make sure to mention them prominently in your application. This can set you apart from other candidates.
How to prepare for a job interview at Owen Thomas | Pending B Corp™
✨Showcase Your Technical Expertise
Be prepared to discuss your experience with cloud platforms like AWS, GCP, or Azure. Highlight specific projects where you designed and maintained scalable infrastructure, and be ready to dive into the details of your work with container orchestration tools like Kubernetes and Docker.
✨Demonstrate Chaos Engineering Knowledge
Familiarize yourself with chaos engineering principles and tools such as Gremlin or Chaos Monkey. Be ready to share examples of how you've implemented resilience testing strategies in distributed systems and the outcomes of those experiments.
✨Emphasize Monitoring and Observability Skills
Discuss your experience with monitoring tools like Prometheus or Grafana. Explain how you have utilized metrics, logging, and tracing to enhance system observability and reliability, and provide specific instances where this has made a difference.
✨Highlight Problem-Solving and Collaboration
Prepare to talk about complex issues you've resolved in production environments and how you collaborated with cross-functional teams. Share examples that showcase your proactive communication skills and your ability to mentor others in SRE best practices.