At a Glance
- Tasks: Lead a DevOps team to streamline infrastructure and automate deployments.
- Company: Join a remote-first tech collective focused on AI innovation.
- Benefits: Flexible hours, remote work, and opportunities for personal growth.
- Other info: Collaborative culture with regular team meet-ups and a focus on work-life balance.
- Why this job: Shape the future of AI infrastructure while working with cutting-edge technology.
- Qualifications: Experience in DevOps, automation, and system reliability.
Runware’s infrastructure is the foundation that enables our teams to deliver AI to the world. As DevOps Team Lead, you’ll turn complex, hardware-driven systems into streamlined, developer-friendly platforms. You’ll define how we automate deployments, orchestrate GPUs at scale, and observe workloads in real time. You’ll build systems that detect and recover from issues before users notice, and work closely with engineering and product teams to make shipping faster, safer, and more predictable. You’ll shape the foundation that lets teams move fast with confidence, building infrastructure that is dependable, observable, and designed to scale.
Is this role a fit for you? You thrive at the intersection of infrastructure and innovation. You enjoy unravelling complex systems, tuning performance, and engineering reliability into everything you build. You lead through clarity and example, not process, and elevate the teams around you by simplifying the hard things. You take pride in building systems that are resilient by design and empowering the engineers who depend on them. You understand that reliability is never accidental; it is built through intent, consistency, and a culture that values doing things right.
What this role will entail:
- Providing technical and people leadership to a small DevOps team
- Lead the design and operation of Runware’s infrastructure and orchestration systems
- Build automation and tooling to streamline model deployments, scaling, and hardware utilisation across distributed nodes
- Drive observability, alerting, and reliability practices to detect and resolve issues quickly and proactively
- Collaborate with engineers to optimise throughput, latency, and platform performance at every layer of the stack
- Develop and maintain infrastructure as code and deployment automation to ensure consistency and reproducibility across environments
- Establish and continuously evolve incident management, post-mortems, and reliability reviews as core engineering practices
- Mentor and coach engineers to think operationally, designing systems that fail gracefully and scale predictably
- Champion forward-looking improvements to our orchestration layer, hardware management, and overall infrastructure efficiency
- Have experience operating production systems on bare metal or hybrid environments such as HPC or GPU clusters, optimised for performance and low latency
- Are comfortable writing automation and systems tooling in Python, Go, or similar languages
- Understand container runtimes like Docker and containerd, and have built or worked with orchestration systems beyond Kubernetes
- Are fluent in observability and debugging practices across distributed systems, using logs, metrics, traces, and profiling to drive insight and reliability
- Care deeply about reliability, efficiency, and engineering quality, and know how to embed those values into team culture and everyday practice
- Thrive in fast-moving, evolving environments where impact is measured by how much better systems and teams perform over time
We’re a remote-first collective, meeting in person twice a year to plan, brainstorm, celebrate wins, and enjoy some face-to-face time. We have core hours for cooperative working and calls, but outside of that your calendar is yours. Work the hours that let you perform at your peak while also building a healthy life. Our release cycles are fast and intense, but they’re followed by real downtime. After big pushes we expect the team to unplug, recharge, and come back ready.
Remote DevOps Team Lead in Surrey employer: Runware
Runware is an exceptional employer that champions a remote-first culture, allowing you to work flexibly while leading a talented DevOps team in shaping innovative infrastructure solutions. With a strong emphasis on employee growth, we provide opportunities for mentorship and collaboration, ensuring that your contributions directly impact our mission of delivering AI to the world. Our commitment to work-life balance, combined with regular in-person gatherings to foster team spirit, makes Runware a rewarding place to thrive both personally and professionally.
StudySmarter Expert Advice🤫
We think this is how you could land Remote DevOps Team Lead in Surrey
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, join relevant online communities, and attend meetups. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This gives potential employers a taste of what you can do and how you tackle complex systems.
✨Tip Number 3
Prepare for interviews by practising common DevOps scenarios and technical questions. Think about how you would approach real-world problems, especially around automation and reliability, as these are key in our field.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Runware.
We think you need these skills to ace Remote DevOps Team Lead in Surrey
Some tips for your application 🫡
Show Your Passion for DevOps:When writing your application, let your enthusiasm for DevOps shine through! Share specific examples of how you've tackled complex systems and made them more efficient. We love seeing candidates who are genuinely excited about building reliable infrastructure.
Tailor Your Application:Make sure to customise your application to highlight the skills and experiences that align with our job description. Focus on your experience with automation, orchestration, and reliability practices. We want to see how you can contribute to our mission!
Be Clear and Concise:Keep your application straightforward and to the point. Use clear language to describe your achievements and avoid jargon that might confuse us. We appreciate candidates who can communicate effectively, especially in a remote setting!
Apply Through Our Website:We encourage you to submit your application directly through our website. This helps us keep everything organised and ensures your application gets the attention it deserves. Plus, it’s super easy to do!
How to prepare for a job interview at Runware
✨Know Your Tech Inside Out
Make sure you’re well-versed in the technologies mentioned in the job description, like Python, Go, and container runtimes. Brush up on your experience with orchestration systems and be ready to discuss how you've optimised performance in past roles.
✨Showcase Your Leadership Style
As a DevOps Team Lead, your leadership approach is crucial. Prepare examples of how you've led teams through complex challenges, simplified processes, and fostered a culture of reliability and efficiency. Highlight your mentoring experiences and how you've empowered others.
✨Demonstrate Problem-Solving Skills
Be ready to tackle hypothetical scenarios during the interview. Think about how you would handle incidents or optimise systems under pressure. Use the STAR method (Situation, Task, Action, Result) to structure your responses and showcase your analytical thinking.
✨Align with Company Culture
Research Runware’s values and work culture. Be prepared to discuss how you thrive in fast-moving environments and your approach to maintaining a healthy work-life balance. Show that you understand the importance of downtime and team collaboration in a remote-first setting.