At a Glance
- Tasks: Design and manage cloud infrastructure, ensuring reliability and performance of services.
- Company: Join Delta Capita, a global leader in financial services and technology innovation.
- Benefits: Enjoy hybrid working, competitive salary, and opportunities for professional growth.
- Why this job: Make a real impact by optimising cloud solutions and mentoring junior engineers.
- Qualifications: Experience in cloud services, IaC, and container orchestration is essential.
- Other info: Be part of a diverse team committed to innovation and continuous improvement.
The predicted salary is between 36000 - 60000 £ per year.
We are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) to join our engineering team to support critical application deployments in a follow-the-sun environment. In this role, you will leverage your expertise in cloud provisioning, infrastructure as code, and container orchestration to ensure the reliability, scalability, and performance of our services. You will collaborate closely with development teams to design and implement robust infrastructure solutions utilizing Azure, GCP, and AWS and containerized technologies.
The Role and Responsibilities
- Cloud Infrastructure Management: Design, implement, and manage cloud infrastructure in Azure and AWS ensuring alignment with best practices and organizational standards.
- Infrastructure as Code (IaC): Utilize Terraform (HCL), AWS CDK, and AWS CloudFormation for scalable and maintainable IaC, enabling safe and efficient infrastructure builds, changes, and versioning.
- Containerization and Orchestration: Deploy, manage, and provide ongoing support for containerized applications using Kubernetes, including Amazon EKS and Azure Kubernetes Service (AKS), ensuring their reliability, availability, and performance.
- Monitoring and Alerting: Monitor application performance and system health through observability tools (e.g., Prometheus, Grafana, ELK stack), proactively identifying and resolving issues to ensure high availability and rapid incident response.
- Security and IAM: Implement security best practices, managing Identity and Access Management (IAM) policies across cloud environments. Utilize technologies such as OpenID Connect (OIDC), OAuth2, and SAML SSO to ensure secure authentication and authorization across services.
- Database Technologies: Manage and optimize database systems, including SQL databases and MongoDB, ensuring high availability, performance tuning, and data security.
- CI/CD Practices: Automate manual processes to enhance operational efficiency, employing CI/CD best practices for efficient code deployment.
- Scripting Languages: Demonstrate proficient scripting skills in languages such as Java, TypeScript, and Python to automate tasks and manage configurations.
- Load Balancing: Implement and maintain load balancing solutions to ensure optimal distribution of application traffic and high availability.
- Collaboration with Development Teams: Collaborate with software engineering teams to design, develop, and maintain robust systems and solutions, including RESTful APIs, ensuring seamless integration across platforms.
- Post-Mortem Analysis: Conduct comprehensive post-mortem analyses following incidents, identifying root causes and recommending improvements to enhance system reliability and performance.
- Mentorship: Mentor and guide junior engineers, fostering a culture of knowledge sharing and continuous improvement within the engineering team.
Skills and Experience
- Bachelor's degree in computer science, Engineering, or equivalent practical experience.
- Proven work experience as a Site Reliability Engineer, DevOps Engineer, or in a similar role within a high-availability environment.
- Strong experience with Azure, GCP, and AWS cloud services, including a deep understanding of cloud architecture and services.
- Expertise in Infrastructure as Code (IaC) using Terraform (HCL) and AWS CloudFormation.
- Experience with AWS CDK for programmatic management of cloud resources, primarily using TypeScript.
- Hands-on experience with container orchestration technologies, particularly Kubernetes.
- Familiarity with version control systems (e.g., Git) and CI/CD pipelines for efficient code deployment.
- Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) to ensure system observability.
- Strong experience with SQL databases and AWS DynamoDB, focusing on performance tuning and optimization.
- Proven ability to design and manage RESTful APIs, ensuring their reliability and scalability.
- Excellent troubleshooting skills, with a proactive approach to resolving complex technical issues.
- Strong communication and teamwork skills, enabling effective collaboration across cross-functional teams.
- A curious and open-minded attitude, committed to challenging the status quo and exploring innovative solutions.
Nice-to-have
- Experience with networking concepts and troubleshooting in cloud environments.
- Knowledge of security best practices in cloud computing.
- Contributions to open-source projects or the creation of technical articles/blog posts to share knowledge with the community.
- Familiarity with service mesh technologies.
- Exposure to Agile methodologies and project management tools.
- Financial services domain knowledge.
How We Work
Delta Capita is an equal opportunity employer. We positively encourage applications from suitably qualified and eligible candidates regardless of age, colour, disability, national origin, ancestry, race, religion, gender, sexual orientation, gender identity and/or expression, veteran status, genetic information, or any other status protected by applicable law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. If you require any reasonable adjustments through your interview process, please use the designated space within the application questionnaire.
This is a permanent full-time role based in London with hybrid working. As the selection and interview process is ongoing, please submit your application in English as soon as possible. If your profile is selected, a member of our team will contact you within 4 weeks. For this role a valid working permit for Poland is mandatory.
Who We Are
Delta Capita Group (a member of the Prytek Group) is a global managed services, consulting and solutions provider with experience in Financial Services and technology innovation capability. Our mission is to reinvent the financial services value chain by providing technology-based mutualized services for financial institutions for non-differentiating services.
Our Offerings
- Managed Services
- Consulting & Solutions
- Technology
To learn more about Delta Capita and our culture, see the section about Working at DC - Delta Capita.
Senior Site Reliability Engineer - Azure in London employer: Delta Capita Group
Contact Detail:
Delta Capita Group Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Site Reliability Engineer - Azure in London
✨Tip Number 1
Network like a pro! Reach out to your connections in the industry, attend meetups, and engage in online forums. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to Azure, Kubernetes, and IaC. This gives potential employers a tangible look at what you can do.
✨Tip Number 3
Prepare for interviews by practising common SRE scenarios. Brush up on your troubleshooting skills and be ready to discuss how you've handled incidents in the past. We want to see your thought process!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining our team. Don’t miss out on this opportunity!
We think you need these skills to ace Senior Site Reliability Engineer - Azure in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Senior Site Reliability Engineer role. Highlight your experience with Azure, GCP, and AWS, and don’t forget to showcase your skills in Infrastructure as Code and container orchestration.
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about SRE and how your background makes you a perfect fit for our team. Be sure to mention any relevant projects or achievements.
Showcase Your Technical Skills: In your application, be specific about your technical skills. Mention your experience with Terraform, Kubernetes, and monitoring tools like Prometheus. We love seeing concrete examples of how you've used these technologies!
Apply Through Our Website: We encourage you to apply through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we can’t wait to see what you bring to the table!
How to prepare for a job interview at Delta Capita Group
✨Know Your Cloud Inside Out
Make sure you brush up on your knowledge of Azure, GCP, and AWS. Be ready to discuss specific projects where you've implemented cloud solutions, focusing on how you ensured reliability and scalability. This will show that you not only understand the theory but have practical experience too.
✨Show Off Your IaC Skills
Be prepared to talk about your experience with Infrastructure as Code, especially using Terraform and AWS CloudFormation. Bring examples of how you've used these tools to automate infrastructure management and ensure best practices. This will demonstrate your technical prowess and attention to detail.
✨Containerisation Know-How
Since container orchestration is key for this role, be ready to discuss your experience with Kubernetes and any challenges you've faced. Share how you've deployed and managed containerised applications, and highlight any performance tuning you've done to ensure high availability.
✨Collaboration is Key
This role involves working closely with development teams, so be prepared to share examples of how you've collaborated in the past. Discuss how you’ve contributed to designing robust systems and how you handle post-mortem analyses after incidents. This will showcase your teamwork skills and proactive approach.