At a Glance
- Tasks: Architect and lead video delivery systems for live and VOD at massive scale.
- Company: Join a leading tech firm revolutionising video infrastructure.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Why this job: Make a significant impact on global video delivery and enhance user experience.
- Qualifications: Experience in large-scale video platforms and strong skills in monitoring systems.
- Other info: Dynamic role with potential for innovation and leadership in a fast-paced environment.
The predicted salary is between 72000 - 108000 ÂŁ per year.
We’re hiring a Senior Video Delivery & Monitoring Architect to own the end-to-end architecture for live and VOD delivery at massive scale—from origin and packaging through multi-CDN distribution, playback telemetry, and real-time observability. You’ll set the technical direction, design resilient systems, and lead implementation across infrastructure, APIs, CI/CD, and Kafka-driven event pipelines that power operational excellence and QoE.
What you’ll do
- Architect global video delivery systems at scale.
- Design origin, packaging, and distribution patterns that meet strict SLAs/SLOs for latency, startup time, rebuffering, and availability.
- Define strategies for multi-region, multi-CDN routing, origin shielding, cache efficiency, and failover.
- Drive delivery protocol decisions (e.g., LL-HLS, DASH/CMAF, WebRTC) based on product latency and device requirements.
- Own monitoring & QoE observability architecture.
- Define a unified telemetry model spanning player events, CDN logs, encoder signals, origin metrics, and control-plane events.
- Build and evolve SLO/SLI frameworks (availability, join time, stall ratio, bitrate stability, and error budgets).
- Establish dashboards, alerting standards, and incident workflows; lead postmortems and systemic improvements.
- Design Kafka topic taxonomy, partitioning, retention, compaction, and consumer-group patterns for high-throughput telemetry and operational signals.
- Implement reliable ingestion and processing patterns (idempotency, ordering guarantees where needed, backpressure handling, replayability).
- Define schema strategy and evolution (e.g., Avro/Protobuf + schema registry patterns).
- Architect and implement high-performance control-plane and data-plane APIs (REST/gRPC) for stream provisioning, policy/config management, routing, and monitoring access.
- Ensure versioning, compatibility, authn/authz, rate limiting, and operational safety (feature flags, safe defaults).
- Lead IaC design for multi-environment consistency, modularity, security, and reproducibility (Terraform/Pulumi/etc.).
- Build delivery pipelines that support rapid iteration with safety: automated tests, canaries, progressive delivery, and rollback.
- Capacity planning for peak events; model throughput, latency, and cost tradeoffs across CDN, origin, and telemetry pipelines.
- Establish load testing and chaos/failure injection practices for delivery and monitoring systems.
- Produce architecture docs, run design reviews, mentor engineers, and align stakeholders on the roadmap, risks, and operational priorities.
Required qualifications
- Significant experience designing and operating large-scale video delivery platforms (live and VOD) with global distribution.
- Deep expertise in monitoring and QoE telemetry systems (player instrumentation, log pipelines, metrics, tracing, alerting, SLOs).
- Strong hands‑on experience with Kafka in production (topic/partition design, retention strategy, throughput tuning, consumer scaling, replay, and schema evolution).
- Strong track record with API implementation at scale (REST and gRPC), including versioning, compatibility, security, performance, and reliability patterns.
- Strong experience with Infrastructure as Code (IaC) and operating multi-region environments in a major cloud.
- Strong experience building/owning CI/CD for distributed systems with safe rollout practices.
Nice‑to‑haves
- Multi-CDN steering experience (route optimisation, health‑based failover, traffic shaping, A/B experiments).
- Familiarity with video formats/protocols and operational implications: HLS/LL-HLS, DASH, CMAF, WebRTC, fMP4/TS, DRM, SSAI/CSAI.
- Experience with Kubernetes, service mesh, and platform engineering practices.
- Experience with OpenTelemetry, Prometheus/Grafana, ELK/Splunk, and large‑scale time‑series/log analytics.
- Experience with real‑time alerting for live events (sports/concerts) and war‑room operations.
What success looks like (first 90–180 days)
- Establish a clear reference architecture for delivery and monitoring, including SLOs and telemetry contracts.
- Improve detection and triage time for major incidents via better signals, dashboards, and alert quality.
- Deliver a scalable Kafka‑based telemetry pipeline design with schema/versioning standards and a replay strategy.
- Implement at least one high‑impact resilience improvement (multi‑region or multi‑CDN failover, origin hardening, or pipeline backpressure strategy).
Core competencies we value
- Systems design depth, pragmatic tradeoffs, and strong written architecture communication.
- Operational maturity: incident leadership, postmortems, and durable corrective actions.
- Ability to bridge product goals (latency/QoE) with engineering realities (cost/reliability/velocity).
Senior Video Delivery & Monitoring Architect (Video Infrastructure) employer: Team Creation
Contact Detail:
Team Creation Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Video Delivery & Monitoring Architect (Video Infrastructure)
✨Tip Number 1
Network, network, network! Get out there and connect with folks in the video delivery and monitoring space. Attend industry meetups, webinars, or even just grab a coffee with someone who’s already in the field. You never know who might have a lead on your dream job!
✨Tip Number 2
Show off your skills! Create a portfolio that highlights your experience with large-scale video delivery platforms and monitoring systems. Include case studies or projects that demonstrate your expertise in Kafka, API implementation, and IaC. This will make you stand out when we’re looking for someone to own the architecture.
✨Tip Number 3
Prepare for technical interviews by brushing up on your knowledge of video protocols and telemetry systems. Be ready to discuss your experience with SLOs, alerting standards, and incident workflows. We want to see how you think through problems and design resilient systems!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows us you’re genuinely interested in joining our team and contributing to our mission in video delivery and monitoring.
We think you need these skills to ace Senior Video Delivery & Monitoring Architect (Video Infrastructure)
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with video delivery systems and monitoring. We want to see how your skills align with the specific requirements mentioned in the job description.
Showcase Your Technical Skills: Don’t hold back on detailing your hands-on experience with Kafka, APIs, and IaC. We’re looking for someone who can demonstrate their technical prowess, so include relevant projects or achievements that showcase your expertise.
Be Clear and Concise: When writing your application, keep it straightforward and to the point. Use clear language to explain your past experiences and how they relate to the role. We appreciate a well-structured application that’s easy to read!
Apply Through Our Website: We encourage you to submit your application through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy to do!
How to prepare for a job interview at Team Creation
✨Know Your Video Delivery Systems
Make sure you brush up on your knowledge of large-scale video delivery platforms, especially live and VOD. Be ready to discuss specific architectures you've designed or worked with, and how they meet strict SLAs/SLOs for latency and availability.
✨Showcase Your Monitoring Expertise
Prepare to talk about your experience with monitoring and QoE telemetry systems. Bring examples of how you've defined telemetry models and built dashboards that improved incident response times. This will show your understanding of operational excellence.
✨Demonstrate Your Kafka Knowledge
Since Kafka is a key part of the role, be ready to dive deep into your hands-on experience. Discuss your approach to topic design, retention strategies, and how you've handled high-throughput telemetry. Real-world examples will make your expertise shine.
✨Highlight Your Technical Leadership Skills
This role requires strong technical leadership, so prepare to share instances where you've produced architecture docs, run design reviews, or mentored engineers. Emphasise your ability to align stakeholders on roadmaps and operational priorities.