At a Glance
- Tasks: As a Site Reliability Engineer, you'll ensure our core services run smoothly and reliably.
- Company: Join Kroo, a fintech company on a mission to revolutionize banking for the better.
- Benefits: Enjoy 25 days of leave, mental health support, and a hybrid work environment.
- Why this job: Be part of a diverse team making a positive impact while growing your tech skills.
- Qualifications: Experience with cloud providers, IaC tools, and programming languages is essential.
- Other info: Flexible working hours and a modern office in Central London await you!
The predicted salary is between 48000 - 84000 £ per year.
We’re on a mission to build the world’s greatest social bank. We believe that banking needs to change for the better. When money is used correctly, it can transform our daily lives and positively impact the planet.
We’re a varied team of experienced tech, customer experience, marketing, legal and banking professionals and we’re looking for highly motivated and inquisitive people who want to understand customer behaviour and how to deliver the best customer experience.
How you will contribute and key responsibilities:
As a Site Reliability Engineer, you’ll take ownership of the implementation, monitoring, maintenance and improvement of our core services. Other responsibilities include:
- Advocate for reliability across the engineering team and within the wider business.
- Create, set and monitor SLOs for Kroo’s core services and manage SLO breaches with product managers and engineers.
- Develop and maintain system wide application monitoring including synthetic transactions to detect and identify potential issues.
- Monitor third party provider performance against agreed SLAs.
- Conduct reliability and resilience tests following chaos engineering practices.
- Work with Kroo’s codebase and infrastructure to improve reliability and resolve resilience issues.
- Build and maintain documentation detailing the configurations of deployed solutions and playbooks for incidents and outages.
- Identify, recommend, and implement changes that will result in positive improvement to performance, resilience and reliability of the platform.
- Assist with the development and testing of the technical Disaster Recovery (DR) Plan and conduct regular DR tests of core systems.
- Assist the product engineering team with major releases and go-lives, whilst managing risk and eliminating downtime.
- Be a part of the on-call schedule for major incident management.
We’re looking for someone who:
To be successful in this role you must have proven experience within the Technology sector with knowledge of the following skills:
- Experience working with common Public Cloud providers such as AWS, Azure or GCP.
- Knowledge and familiarity with IaC tools such as Cloudformation and Terraform.
- Knowledge of at least one programming language, preferably Typescript or Clojure.
- Strong understanding of the Software Development Life Cycle (SDLC) with close collaboration with development teams.
- In-depth knowledge of implementing and utilising monitoring and APM tools (Datadog preferred).
- Knowledge of common SRE practices and understanding of setting and measuring SLOs and error budgets.
- Experience in operating high-performance, high-security and high-availability applications and infrastructure, while minimising downtime and risks.
- Familiarity with microservice architecture and related design patterns.
- Experience managing and resolving technical incidents and service outages.
- Knowledge and awareness of common IT security practices.
- Knowledge of and experience implementing common disaster recovery strategies.
- Excellent oral, written communication and presentation skills.
Experience or knowledge in the following areas is highly desirable but not essential:
- Experience building, deploying, monitoring and operating containerised workloads (Docker, ECS, EC2 / Fargate) in AWS. Relevant certifications (AWS Solutions Architect etc) preferred.
- Working knowledge of agile DevSecOps environments, and CI/CD (Git, Concourse, Terraform for Infrastructure-as-Code).
- Experience working with suppliers to manage performance against agreed SLAs.
- Experience working in a highly regulated industry such as finance involving compliance reporting to regulatory authorities.
What we offer:
At our cutting-edge fintech company, we know that attracting and retaining the best talent means offering top-notch benefits that help our employees thrive both in and outside of work. Check out what we currently offer:
- Generous holiday time: 25 days annual leave, 8 bank holidays, 1 Kroo bank holiday (June 24th), and 1 day off during the week of your birthday.
- Personal days: We know that life can be unpredictable, so we offer 3 personal days to use as needed.
- Employer-sponsored volunteer program: We’re passionate about giving back to our community, and we support our employees in doing the same with up to 4 hours per month of employer-sponsored volunteer time.
- Mental health support: We care about the mental health of our team members and offer access to Spill, our mental health support partner.
- Workplace pension: We want you to feel secure about your future, so we offer a workplace pension with a 5% employee contribution and a 3% employer top-up.
- Top-notch equipment: We provide top-of-the-line equipment necessary for smooth hybrid work, including a MacBook laptop. Additionally, we also offer support in establishing your home office by contributing towards your setup if required.
- Modern office: When you’re in the office, you’ll enjoy access to our modern, bustling workspace in Central London.
- Cycle to Work scheme: We encourage sustainable transportation with our Cycle to Work scheme.
- Electric Car scheme: We’re committed to reducing our carbon footprint, and our Electric Car scheme makes it easy for our employees to do the same.
- Enhanced parental leave: We know that family comes first, and we offer an enhanced parental leave policy to support our employees in starting and growing their families.
- Room for growth: As a fast-paced, high-growth start-up, we’re dedicated to providing our employees with room to grow and excel.
- You get full healthcare for you and your nuclear family via Vitality.
Hybrid Working:
At Kroo Bank, we have a hybrid policy that gives both individuals and teams a lot of freedom when it comes to using the office space to boost productivity. Our London office is a great resource when used effectively. So, employees who can occasionally come to the office are a good fit for how we work right now. Keep in mind that this job involves working from Monday to Friday, with a mix of remote and office work, so you won’t need to be on-site all the time.
Diversity and Inclusion:
We wholeheartedly uphold our commitment to fostering a diverse and inclusive workplace. Every employee is highly regarded, respected, and supported without any form of judgement or prejudice. We consider Diversity, Equality, and Inclusion as fundamental pillars guiding our path in all aspects of our bank. We also ensure that reasonable adjustments are made available to all candidates throughout the recruitment process.
#J-18808-Ljbffr
Site Reliability Engineer employer: Kroo Bank Ltd
Contact Detail:
Kroo Bank Ltd Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer
✨Tip Number 1
Familiarize yourself with the specific tools and technologies mentioned in the job description, such as AWS, Terraform, and Datadog. Having hands-on experience or projects that showcase your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Understand the principles of Site Reliability Engineering (SRE) and be prepared to discuss how you've implemented SLOs and error budgets in past roles. This knowledge will demonstrate your alignment with our mission to enhance reliability across the engineering team.
✨Tip Number 3
Showcase your problem-solving skills by preparing examples of how you've managed technical incidents or service outages in previous positions. Highlighting your ability to remain calm under pressure and effectively resolve issues will resonate well with us.
✨Tip Number 4
Engage with our company culture by researching our values and mission. Be ready to discuss how your personal values align with ours, especially regarding sustainability and community involvement, as these are important aspects of our work environment.
We think you need these skills to ace Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure to customize your CV to highlight your experience with cloud providers, IaC tools, and monitoring systems. Emphasize your familiarity with SRE practices and any relevant programming languages.
Craft a Compelling Cover Letter: In your cover letter, express your passion for improving customer experiences through technology. Mention specific examples of how you've contributed to reliability and performance in previous roles.
Showcase Relevant Projects: If you have worked on projects involving chaos engineering, disaster recovery strategies, or high-availability applications, be sure to include these in your application. Detail your role and the impact of your contributions.
Highlight Communication Skills: Since excellent communication is crucial for this role, provide examples of how you've effectively collaborated with cross-functional teams. Mention any experience in presenting technical information to non-technical stakeholders.
How to prepare for a job interview at Kroo Bank Ltd
✨Understand the Role of a Site Reliability Engineer
Make sure you have a solid grasp of what a Site Reliability Engineer does, especially in terms of monitoring, maintenance, and improvement of core services. Be prepared to discuss your experience with SLOs, incident management, and reliability practices.
✨Showcase Your Technical Skills
Highlight your experience with public cloud providers like AWS, Azure, or GCP, and be ready to talk about your familiarity with IaC tools such as Terraform. If you have experience with programming languages like Typescript or Clojure, make sure to mention that too!
✨Demonstrate Problem-Solving Abilities
Prepare examples of how you've managed technical incidents or service outages in the past. Discuss your approach to resilience testing and how you’ve implemented disaster recovery strategies to ensure high availability.
✨Communicate Effectively
Since excellent communication skills are essential for this role, practice articulating your thoughts clearly. Be ready to explain complex technical concepts in a way that is understandable to non-technical stakeholders, showcasing your ability to collaborate across teams.