At a Glance
- Tasks: Own and optimise core infrastructure for cutting-edge AI products.
- Company: Join a YC-backed startup revolutionising software engineering with AI.
- Benefits: Competitive salary, equity options, 30 days holiday, and a dog-friendly office.
- Why this job: Shape the future of AI while enjoying a balanced work-life culture.
- Qualifications: 5+ years in cloud infrastructure and strong Kubernetes experience required.
- Other info: Collaborative environment with excellent career growth and team socials.
The predicted salary is between 60000 - 90000 ÂŁ per year.
Location: London; full in-office working as default
Start date: ASAP
Reports to: CTO
Compensation: ÂŁ60 - 90k + Equity
At Cosine, we’re building autonomous AI engineers that plan, write, and ship code inside real development workflows. Cosine is designed for on-premise and virtual private cloud (VPC) deployments, including fully air-gapped environments. We build our agent tooling entirely in-house and post-train open-source models to deliver reliable, enterprise-grade coding performance in security-critical settings. In 2024, Cosine achieved a 72% score on OpenAI’s SWE-Lancer benchmark, placing us among the strongest real-world software-engineering AI systems evaluated. YC-backed and well-funded, Cosine was founded by experienced operators focused on building dependable, production-grade AI.
This role is based in our Hoxton office, five days a week, because close collaboration, fast feedback, and shared context matter for the problems we’re solving.
The role
We’re looking for a Devops / Senior Platform / Infra Engineer to own the core infrastructure that powers Cosine’s products — from Kubernetes and deployment pipelines to networking and platform services. You’ll design and run the “paved road” that our engineers, researchers, and customers build on: reliable Kubernetes clusters, fast and safe CI/CD, solid observability, and hardened environments for demanding enterprise and on-prem deployments. You’ll also wear a classic “DevOps/SRE” hat: thinking in SLOs, running incident response, and keeping us up even as we move quickly. This is a high-ownership role at a fast-paced, venture-backed Silicon Valley startup. You’ll work directly with founding engineers and leadership, and your decisions will materially shape how we build and ship products.
What You’ll Do
- Own core infrastructure
- Design, operate, and evolve our Kubernetes-based platform (EKS or similar), including cluster topology, node groups, autoscaling, and multi-environment isolation. Manage supporting cloud resources: container registries, load balancers, queues, caches, and data infra needed to run our APIs and agents.
- Design and maintain CI/CD pipelines for image builds and infra rollouts (e.g. Pulumi/Terraform + Helm/Docker). Implement safe rollout strategies (blue/green, canary, staged rollouts) and fast rollback paths. Build internal tools and abstractions that make it easy for product teams to self-serve infra safely.
- Define and track SLOs/SLIs for key services (latency, error rates, availability). Improve our observability stack (metrics, logs, traces, alerts) so issues are obvious, actionable, and debuggable. Participate in the on-call rotation, lead incident response when needed, and drive blameless post-mortems and fixes.
- Design and maintain networking: VPCs, subnets, ingress/egress, service meshes / L7 routing, DNS, and TLS. Implement least-privilege access via IAM, secure secret management, and hardened configurations for multi-tenant and isolated customer environments. Help design patterns for secure enterprise and on-prem / regulated deployments.
- Work closely with application, ML, and research teams to understand their needs and translate them into reusable infra building blocks. Provide guidance on “how to run this in production” — capacity planning, failure modes, and operational readiness reviews.
What We’re Looking For
- Have strong experience
- 5+ years building and operating production infrastructure on a major cloud (AWS, GCP, or Azure). Significant hands-on experience running Kubernetes in production (EKS/GKE/AKS or self-managed): Cluster upgrades, autoscaling, node group design, and multi-env setups. Helm or similar for packaging services.
- Deep experience with IaC tools (Pulumi, Terraform, CDK, or similar). Comfortable managing infra changes via code review, CI, and automated rollouts.
- Have owned the uptime and performance of user-facing systems. Comfortable participating in (and improving) on-call rotations and incident management. Experience setting up / tuning observability (Prometheus, Grafana, CloudWatch, OpenTelemetry, etc.).
- You’ve built internal tools, libraries, or platforms on top of cloud providers so product teams can move faster with fewer foot-guns. You think about developer experience and “golden paths,” not just raw infra.
- Strong scripting and programming skills in at least one modern language (e.g. TypeScript, Go, Python). Happy to dive into app code when needed to debug a production issue or improve an integration.
- Enjoy working in a fast-moving environment with evolving priorities and incomplete specs. Bias toward pragmatic solutions: ship something small, measure, iterate. Communicate clearly, give/receive direct feedback, and collaborate across functions.
Nice To Have (Not Required)
- Experience with: AWS primitives like EKS, ECS/Fargate, ECR, SQS, ElastiCache/Redis. Argo CD or other GitOps tools for Kubernetes. On-prem, air-gapped, or regulated industry deployments (e.g. finance, healthcare). AI/ML infrastructure (GPU workloads, model hosting, feature stores). Prior experience as an early infra / platform hire at a startup.
Cosine is an equal opportunity employer. We value diverse backgrounds, perspectives, and ways of thinking, and we’re committed to creating an inclusive and respectful workplace. We encourage applications from anyone who meets the role requirements, even if you don’t meet every single qualification. If you need reasonable adjustments at any stage of the hiring process, we’re happy to discuss them.
Compensation, Benefits & Ways Of Working
We’re an in-office team, five days a week, by design. We believe the work we’re doing benefits from being together, collaborating closely, and building shared context.
What You Can Expect
- Competitive salary, benchmarked to the market
- Equity / share options, so you share in the upside you help create
- 30 days’ holiday + bank holidays
- Genuine 9–5 working hours — we don’t expect late nights or weekend work
- Work hard in the office, collaborate closely, and switch off properly
- Dog-friendly office — bring your dog to work
- Daily lunch provided
- Monthly team breakfasts
- Monthly socials
- Pension
- High-quality equipment to do your best work
We care about focus, sustainability, and doing great work — not performative overwork. We value people who show up, contribute thoughtfully, collaborate well with their colleagues, and then go home. This role won’t suit everyone. But if you want structure, clarity, strong collaboration, and a team that takes both the work and work-life balance seriously, it’s a great place to be.
Agency & Data Protection Notice
To comply with UK GDPR and our internal data-protection and equal-opportunity obligations, we only accept candidate applications and agency submissions via our Applicant Tracking System (ATS). This ensures appropriate privacy notices, lawful processing, auditability, and consistent retention controls. Any CVs or candidate details received outside the ATS (including via email, Slack, or direct message) will be treated as unsolicited, will not be considered as part of the recruitment process, and will not give rise to any fee or payment obligation.
Devops Engineer in Harrow employer: Cosine
Contact Detail:
Cosine Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Devops Engineer in Harrow
✨Tip Number 1
Network like a pro! Get out there and connect with folks in the industry. Attend meetups, tech conferences, or even local events. You never know who might be looking for a DevOps Engineer just like you!
✨Tip Number 2
Show off your skills! Create a personal project or contribute to open-source. This not only sharpens your skills but also gives you something tangible to discuss during interviews. Plus, it shows you're passionate about what you do!
✨Tip Number 3
Prepare for those interviews! Research common DevOps interview questions and practice your answers. Be ready to discuss your experience with Kubernetes, CI/CD pipelines, and reliability. Confidence is key!
✨Tip Number 4
Apply through our website! We love seeing applications come directly from candidates who are excited about joining us at Cosine. It shows initiative and helps us get to know you better right from the start.
We think you need these skills to ace Devops Engineer in Harrow
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the DevOps role at Cosine. Highlight your experience with Kubernetes, cloud infrastructure, and any relevant tools you've used. We want to see how your skills align with what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for DevOps and how you can contribute to our mission at Cosine. Be sure to mention specific projects or experiences that showcase your expertise.
Showcase Your Problem-Solving Skills: In your application, don’t just list your skills—show us how you've used them to solve real problems. Whether it’s improving uptime or streamlining CI/CD processes, we love seeing concrete examples of your impact.
Apply Through Our Website: Remember to apply through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, it helps us keep everything organised and efficient as we review applications.
How to prepare for a job interview at Cosine
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially Kubernetes and cloud services like AWS, GCP, or Azure. Brush up on your experience with infrastructure-as-code tools like Terraform or Pulumi, as these will likely come up during technical discussions.
✨Demonstrate Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles, particularly around reliability and incident management. Think of examples where you improved uptime or resolved critical issues, as this shows you can handle the fast-paced environment at Cosine.
✨Showcase Your Collaboration Skills
Since this role involves working closely with product and research teams, be ready to talk about how you’ve successfully collaborated in the past. Highlight any experiences where you translated technical needs into actionable solutions, demonstrating your ability to communicate effectively across functions.
✨Ask Insightful Questions
Prepare thoughtful questions about the company’s culture, the team dynamics, and the specific challenges they face. This not only shows your genuine interest in the role but also helps you assess if Cosine is the right fit for you. Remember, interviews are a two-way street!