As a Senior Platform Engineer, you\’ll play a key role in improving the infrastructure that powers our partners FinTech platform. You\’ll work alongside cross functional teams to ensure the reliability, performance, and scalability of their systems.
About the Role:
Your responsibilities will include administering multiple Kubernetes clusters, managing complex system upgrades, and driving improvements in how we deploy and operate our services.
Responsibilities:
- Kubernetes Management: Administer and optimise multiple Kubernetes clusters, supporting a variety of systems across multiple private and public cloud environments.
- System Upgrades: Plan and execute upgrades for large, complex systems in collaboration with key staff across multiple sites, ensuring minimal disruption and maximum efficiency.
- Release Management: Oversee technical releases, including managing software deployments, de-risking processes, and reviewing change requests.
- Design Evaluation: Identify and evaluate alternative design options, weighing the trade-offs to ensure optimal solutions.
- Cross-Functional Collaboration: Work closely with engineering teams to ensure seamless integration of systems and services.
- Automation: Drive automation improvements and help refine deployment pipelines to increase operational efficiency.
- Incident Management: Participate in incident response and contribute to root cause analysis.
Experience Required:
- Experience managing Kubernetes clusters at scale, including troubleshooting, scaling, and optimising for high availability and performance.
- GCP experience knowledge of working with containerisation technologies like Docker.
- Strong experience with IaC tools such as Terraform, Ansible, or similar.
- Continuous Integration & Deployment (CI/CD) concepts and tools: Experience with Gitlab is highly desirable, with knowledge of build tools (Maven, Gradle), binary repositories (Nexus, Artifactory) and code quality tools (SonarQube).
- Release Management: Experience managing technical releases, coordinating deployments, and mitigating risks in a fast-paced environment.
- Observability & Monitoring: Expertise in observability tools such as Prometheus, Grafana, Fluentd, and Kibana to ensure system health and performance.
- Programming & Scripting: Proficiency in scripting languages (e.g., Bash, Python) and understanding of coding concepts
If interested in the above, please apply with an updated copy of your CV and one of the team will be in touch.
#J-18808-Ljbffr
Contact Detail:
Scalers Recruiting Team