At a Glance
- Tasks: Design and maintain cloud infrastructure using Kubernetes and Terraform for a secure platform.
- Company: Join Menlo Security, a leader in secure connectivity for top enterprises.
- Benefits: Collaborative culture, growth opportunities, and a chance to make a real impact.
- Other info: Dynamic work environment with a focus on automation and reliability.
- Why this job: Be part of a mission-driven team that values innovation and security.
- Qualifications: Experience with cloud platforms, Kubernetes, and scripting languages like Python.
The predicted salary is between 60000 - 80000 £ per year.
Menlo Security's mission is enabling the world to connect, communicate and collaborate securely without compromise. COVID-19 has made our mission all the more real. We support customers across various enterprises including Fortune 500 companies, 9/10 of the largest global banks and the Department of Defense. The world has fundamentally changed. We are growing from 400 employees into the next phase of our journey, and we need passionate talent filled with empathy and agility. The right candidate for the job is ethical, hyper-organized, fanatical about seeing things through to completion, service-oriented, and humble enough to take feedback and coaching yet confident enough to provide feedback and coaching.
About the Role: Platform Infrastructure Engineering is responsible for building and operating Menlo Security's Infrastructure Platform. Together with the rest of our engineering teams, we enable our customers to connect to the Internet without compromise. Our environment provides services globally. We expect failure, build security in by design, create evolvable systems, and enable multi-tenancy across the infrastructure. Automation is an absolute for us. We are committed to getting it done properly, the first time.
As a Platform Infrastructure Engineer, you'll join a group of experienced engineers who are part of a globally distributed team responsible for building and managing the company's core infrastructure services and maintaining our constantly growing platform. The team operates a sophisticated cloud-native infrastructure built on Google Kubernetes Engine and VMs spanning multiple environments globally from development to production. We manage infrastructure as code with Terraform and Spacelift orchestration, and deploy services using Helm charts. Our platform emphasizes security-first design, comprehensive observability, and multi-region resilience. Success in this role requires working with a vast VM fleet in AWS and GCP as well as Kubernetes, writing Infrastructure as Code, and a passion for automation and reliability engineering.
Responsibilities:
- Design, deploy, and maintain VM and Kubernetes infrastructure on GCP and AWS across dozens of clusters spanning development, staging, and production environments in multiple regions.
- Coordinate with your peers in your direct team as well as across teams to ensure that the tasks you’re working on are going to solve the problems that we need them to solve.
- Build and maintain Infrastructure as Code (IaC) using Terraform modules, managing resources through Spacelift or equivalent Terraform Automation and Collaboration Software (TACOS).
- Provision cloud infrastructure including networking, compute, storage, and security components primarily on GCP, with secondary AWS support.
- Implement and manage workflows with sophisticated multi-layer configuration management.
- Build and maintain comprehensive observability solutions using Grafana Cloud, Prometheus/Mimir, and OTel collectors.
- Design Grafana dashboards, configure alerting rules, and ensure visibility across all platform components.
- Manage certificate lifecycle, DNS automation, ingress controllers, and service mesh networking with Cilium.
- Partner with Engineering, Product, Compliance, and Security teams to design resilient, scalable systems.
- Consult on capacity planning, disaster recovery, and architectural decisions for cloud-native applications.
- Identify and eliminate toil through automation. Write scripts, develop tools, and build CI/CD pipelines to improve operational efficiency and reduce manual work.
- Participate in a 24x7 on-call rotation as part of a globally distributed team, responding to incidents and driving post-incident reviews.
Requirements:
- Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.
- Proficiency in common programming & scripting languages. We use a lot of python, bash and go.
- Understanding of network topologies, communication protocols (ie. TCP/IP, HTTP/S, UDP, TLS) and enterprise grade connectivity solutions.
- Kubernetes expertise including cluster administration, RBAC, networking, workload management, and troubleshooting across production environments.
- Proven experience with Terraform for infrastructure provisioning and management.
- Knowledge of Google Cloud Platform services including GKE, VPC networking, Cloud DNS, Artifact Registry, Secret Manager, IAM, Gemini Code Assist, and Workload Identity.
- Experience with GitOps methodologies and tools.
- Clear understanding of how to use LLM code assist tools to effectively build software.
Our culture is collaborative, inclusive, and fun! We have five core values: Stay Aligned, Get It Done, Customer Empathy, Think Creatively and Help Each Other Out. We believe in open communication, supporting new ideas, and sharing a mutual mindset of what we’re aiming to achieve together. There are tremendous opportunities to take initiative, implement new ideas, and have a hand in building a legacy.
Platform Infrastructure Engineer (SRE Core) employer: Menlo Security
Menlo Security is an exceptional employer that fosters a collaborative and inclusive work culture, where employees are encouraged to take initiative and contribute to meaningful projects. With a strong focus on employee growth and development, team members have the opportunity to work with cutting-edge technologies in a supportive environment that values creativity and open communication. Located in a dynamic industry, Menlo offers competitive benefits and the chance to be part of a mission-driven company that prioritises security and innovation.
StudySmarter Expert Advice🤫
We think this is how you could land Platform Infrastructure Engineer (SRE Core)
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, especially those already at Menlo Security. A friendly chat can give you insights into the company culture and maybe even a referral!
✨Tip Number 2
Show off your skills! Prepare a mini-project or a demo that highlights your expertise in Kubernetes and Terraform. This hands-on approach can really impress during interviews.
✨Tip Number 3
Be ready to discuss real-world scenarios. Think about how you've tackled challenges in past roles, especially around automation and reliability engineering. We love hearing about your problem-solving skills!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at Menlo Security.
We think you need these skills to ace Platform Infrastructure Engineer (SRE Core)
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the Platform Infrastructure Engineer role. Highlight your experience with Kubernetes, Terraform, and cloud services like GCP and AWS. We want to see how your skills align with our mission!
Showcase Your Passion for Automation:Since automation is key for us, share specific examples of how you've implemented automation in your previous roles. Whether it's through CI/CD pipelines or scripting, let us know how you’ve made processes more efficient.
Be Clear and Concise:When writing your application, keep it straightforward. Use bullet points where possible and avoid jargon that might confuse us. We appreciate clarity and want to understand your experience without wading through fluff!
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!
How to prepare for a job interview at Menlo Security
✨Know Your Tech Inside Out
Make sure you brush up on your knowledge of Kubernetes, Terraform, and cloud services like GCP and AWS. Be ready to discuss your experience with these technologies in detail, as well as any challenges you've faced and how you overcame them.
✨Showcase Your Problem-Solving Skills
Prepare examples of how you've tackled complex infrastructure issues in the past. Menlo Security values candidates who can think critically and provide solutions, so be ready to demonstrate your analytical skills and how you approach problem-solving.
✨Emphasise Collaboration
Since the role involves working closely with various teams, highlight your experience in cross-team collaboration. Share specific instances where you’ve partnered with engineering, product, or security teams to achieve a common goal.
✨Be Ready for Automation Talk
Automation is key for this role, so come prepared to discuss your experience with CI/CD pipelines and scripting. Be ready to share how you've eliminated manual processes through automation and the impact it had on operational efficiency.