At a Glance
- Tasks: Lead platform engineering projects and optimise client infrastructure with innovative solutions.
- Company: Join Focused, a dynamic tech company that values collaboration and client success.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Work in a collaborative environment with a focus on emerging technologies.
- Why this job: Make a real impact by solving complex problems in diverse industries.
- Qualifications: 7+ years in Platform Engineering or DevOps with strong technical skills.
The predicted salary is between 95000 - 130000 £ per year.
Who we are: At Focused, we move quickly to deliver quality software that achieves client outcomes and meets their customer's needs. We strategically partner with our clients to leverage our expertise in design and software, while our clients bring their own domain expertise. We work with a variety of clients from different industries, collaborating as we get new products to market, modernizing legacy systems, or helping teams learn the skills they need to be successful.
Our values: Listen first • We are experts in product practices but life long learners in the domain of our customers. We research, collaborate, and understand. Learn why • We ask questions and talk to users to understand problem spaces, objectives, and goals, which allows us to deeply invest and drive towards the outcomes of our clients. Love your craft • We love diving into a variety of domains and solving problems. We take pride in delivering value, in communicating progress, and guiding our clients to success.
We are seeking an experienced Staff Platform Engineer with deep expertise leading clients and teams, and strong Platform Engineering capabilities to help organizations implement, optimize, and scale their infrastructure.
Key Responsibilities:- Platform Engineering & Infrastructure
- Augment existing infrastructure with integrated observability solutions
- Implement Infrastructure as Code (IaC) solutions using Terraform, Pulumi, CloudFormation, etc.
- Architect and manage Kubernetes clusters with comprehensive monitoring and logging
- Build CI/CD pipelines with embedded observability and automated testing
- Site Reliability Engineering (SRE)
- Establish and maintain Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs)
- Implement error budgets, toil reduction strategies, and capacity planning
- Support incident response procedures and post-mortem processes
- Cloud & DevOps Engineering
- Deploy and manage observability infrastructure across AWS, GCP, and Azure
- Establish security, compliance, and governance frameworks for telemetry data
- Experience automating Agent Evaluations in CI/CD pipelines and observability backends.
- OpenTelemetry & Observability
- Design and implement end-to-end OpenTelemetry solutions across diverse technology stacks
- Configure and deploy OpenTelemetry Collectors for efficient data collection, processing, sampling, and routing
- Establish telemetry pipelines for metrics, traces, and logs across microservices architectures
- Optimize collector configurations for performance, reliability, and cost-effectiveness
- Platform Engineering & DevOps
- 7+ years of Platform Engineering or DevOps experience with focus on site reliability, observability, and incident response
- Proficiency with Infrastructure as Code tools (Terraform, Pulumi, CloudFormation, CDK)
- Strong experience with CI/CD platforms (GitHub Actions, GitLab CI, Jenkins, ArgoCD)
- Cloud & Infrastructure
- Hands-on experience with major cloud providers (AWS, GCP, Azure) and their observability services
- Experience with container technologies (Docker, Podman) and container registries
- Knowledge of networking, security, load balancing, and distributed systems concepts
- Site Reliability Engineering
- Experience implementing SRE practices including error budgets and toil metrics
- Proficiency in incident management, on-call procedures, and post-mortem culture
- Experience with capacity planning, performance optimization, and scalability design
- Programming & Automation
- Proficiency in multiple programming languages preferred (Go, Python, Java, Rust)
- Strong scripting and automation skills (Bash, Python, PowerShell)
- Understanding of software engineering best practices and testing methodologies
- Core Observability & OpenTelemetry
- 3-7 years of experience in observability, monitoring, and distributed systems
- Deep hands-on experience with OpenTelemetry ecosystem, including SDKs, APIs, and specifications
- Proficiency with OpenTelemetry Collector configuration, processors, exporters, and receivers
- Strong understanding of telemetry data models, semantic conventions, and instrumentation best practices
- AI & Agentic Frameworks
- Understanding of Large Language Models (LLMs) and their application in DevOps
- Knowledge of vector databases, embeddings, and retrieval-augmented generation (RAG)
- Experience with AI/ML model deployment and monitoring in production environments
- Leadership & Communication
- Experience leading teams, managing client relationships and expectations
- Strong technical writing and documentation skills
- Ability to present complex technical concepts to diverse stakeholders
- A passion for knowledge sharing
- Systems thinking and ability to design holistic observability solutions
- Strong analytical and troubleshooting skills for complex distributed systems
- Curiosity about emerging technologies, particularly AI applications in operations
- Adaptability to rapidly evolving cloud-native and observability technologies
- Collaborative mindset with focus on enabling developer productivity and system reliability
- Experience with Honeycomb
- Contributions to open-source observability or AI framework projects
- Track record of implementing platform engineering solutions that significantly improved developer experience
- Experience scaling observability infrastructure to handle high event volume
What to know before you apply: You will be expected to work for up to four days a week in person, be it from our office in London or from client sites. The London base salary range for this role is £95,000 - £130,000 GBP.
Staff Platform Engineer in London employer: Focused Labs
At Focused, we pride ourselves on fostering a dynamic work culture that encourages collaboration, continuous learning, and innovation. As a Staff Platform Engineer, you will have the opportunity to work with diverse clients across various industries, enhancing your skills while contributing to impactful projects. Our London office offers a vibrant environment with flexible in-person work arrangements, competitive salaries, and a commitment to employee growth, making it an excellent place for those seeking meaningful and rewarding employment.
StudySmarter Expert Advice🤫
We think this is how you could land Staff Platform Engineer in London
✨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. A friendly chat can open doors that a CV just can't.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects and contributions. It’s a great way to demonstrate your expertise in Platform Engineering.
✨Tip Number 3
Prepare for interviews by practising common questions and scenarios related to SRE and observability. We want you to feel confident and ready to impress!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who are proactive!
We think you need these skills to ace Staff Platform Engineer in London
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that align with the Staff Platform Engineer role. Highlight your expertise in Platform Engineering, DevOps, and any relevant projects you've worked on that showcase your problem-solving abilities.
Craft a Compelling Cover Letter:Use your cover letter to tell us why you're passionate about this role and how your background makes you a great fit. Share specific examples of how you've successfully implemented observability solutions or optimised infrastructure in past roles.
Showcase Your Technical Skills:Don’t forget to mention your proficiency with tools like Terraform, Kubernetes, and CI/CD platforms. We love seeing candidates who can demonstrate their hands-on experience with cloud providers and observability technologies.
Apply Through Our Website:We encourage you to apply directly through our website for the best chance of getting noticed. It’s the quickest way for us to see your application and get the ball rolling on your journey with us!
How to prepare for a job interview at Focused Labs
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially around Infrastructure as Code tools like Terraform and CloudFormation. Brush up on your Kubernetes knowledge and be ready to discuss how you've implemented CI/CD pipelines in past roles.
✨Showcase Your Problem-Solving Skills
Prepare examples of how you've tackled complex problems in previous positions. Focus on your experience with observability solutions and incident management. Be ready to explain your thought process and the impact of your solutions on client outcomes.
✨Demonstrate Your Collaborative Spirit
Since the role involves working closely with clients and teams, highlight your experience in collaboration. Share stories that showcase your ability to listen, learn, and adapt to different domains while driving towards successful outcomes.
✨Ask Insightful Questions
Prepare thoughtful questions about the company’s approach to platform engineering and their current challenges. This shows your genuine interest in the role and helps you understand how you can contribute to their success.