At a Glance
- Tasks: Own and optimise ML infrastructure for cutting-edge healthcare AI solutions.
- Company: Join Circadia Health, a pioneering healthcare AI company transforming senior care.
- Benefits: Enjoy hybrid or remote work, competitive salary, and opportunities for professional growth.
- Other info: Be part of a dynamic team driving healthcare innovation and improving lives.
- Why this job: Make a real impact on patient care with innovative AI technology.
- Qualifications: 4+ years in MLOps or related fields, strong Python skills, and AWS experience.
The predicted salary is between 60000 - 80000 £ per year.
Circadia Health is a growth-stage healthcare AI company on a mission to prevent avoidable hospitalizations and transform senior-care operations. Our Circadia Intelligence Platform combines:
- Contactless sensing that monitors respiration and motion with medical-grade accuracy
- Predictive analytics & agentic AI workflows that detect 85% of preventable rehospitalizations ~11 days in advance
- Enterprise integrations that embed insights directly into EHR, care-coordination, billing, and compliance systems
Today our technology touches 40,000+ post-acute patients daily across skilled-nursing, home-health, and home-care networks. We are backed by leading healthcare and AI investors like Khosla Ventures, Village Global, Headline, Eric Yuan (CEO of Zoom), and others.
As an ML Ops Engineer at Circadia Health, you will own the infrastructure and operational lifecycle of the machine learning systems that power our clinical monitoring platform. You will build and maintain the production ML pipelines, deployment infrastructure, and monitoring systems that enable Circadia's predictive models to identify early signs of clinical deterioration. Reporting to the Principal ML Engineer, you will work across ML, backend, data, and clinical teams to ensure models are reliably trained, versioned, deployed, and monitored in both cloud and edge environments. You will be a key driver in elevating Circadia's ML practice – from reproducibility and experiment tracking to CI/CD for models and operational observability. This is a high-ownership role at a lean company where production reliability, rapid iteration, and pragmatic engineering are essential.
ML Pipeline Orchestration & Automation
- Own and extend Circadia’s ML pipeline orchestration using Apache Airflow, including training, evaluation, and deployment workflows.
- Build and maintain automated pipelines for model retraining, validation, and promotion across development, staging, and production environments.
- Implement pipeline monitoring, alerting, and failure recovery to eliminate silent failures and ensure operational reliability.
Model Deployment & Serving
- Deploy and manage ML models on AWS infrastructure (e.g. AWS Batch for batch inference workloads).
- Support deployment of models to edge devices, including Circadia’s clinical monitoring hardware, working with firmware and embedded engineering teams as needed.
- Manage model versioning, promotion, and rollback workflows through the MLflow model registry.
- Evaluate and implement strategies for safe model rollouts (e.g. shadow deployments, canary releases) as the platform matures.
- Enable ML engineers to move seamlessly from experimentation to production deployment with minimal friction.
Data & Model Versioning
- Implement and maintain training data versioning and dataset management practices to ensure reproducibility of model training runs.
- Collaborate with ML engineers and data engineers to formalise dataset release and validation workflows.
Monitoring, Observability & Data Quality
- Build monitoring systems for model performance in production, including data drift detection, prediction quality tracking, and alerting on degradation.
- Implement operational dashboards for pipeline health, compute utilisation, and deployment status.
- Collaborate with data engineering to ensure upstream data quality and pipeline reliability for ML feature inputs.
- Develop incident response procedures and runbooks for ML system failures.
- Manage and optimise AWS compute resources (Batch, EC2, or similar) used for model training and inference.
- Design infrastructure-as-code solutions for reproducible ML environments.
- Drive cost optimisation across ML compute, storage, and data transfer.
- Support Snowflake integrations for feature generation and training data pipelines.
Elevating ML Practice
- Introduce and champion ML engineering best practices including CI/CD for models, automated testing for ML pipelines, and reproducible training workflows.
- Build internal tooling and templates that accelerate the ML development-to-production cycle.
- Document operational processes, architecture decisions, and onboarding materials for the ML platform.
- Participate in architecture discussions and technical planning to ensure ML systems scale with Circadia’s growth.
- Ensure all ML pipelines and infrastructure meet healthcare security and privacy requirements, including HIPAA and SOC 2.
- Apply best practices for handling Protected Health Information (PHI) in training data, model artifacts, and inference outputs.
- Maintain audit trails for model decisions, data access, and deployment history.
Qualifications
- 4+ years of experience in MLOps, ML Engineering, DevOps, or a closely related infrastructure role.
- Strong proficiency in Python for ML pipeline development, tooling, and automation.
- Hands-on experience with ML pipeline orchestration tools, particularly Apache Airflow.
- Experience deploying and operating ML workloads on AWS (Batch, EC2, S3, IAM, CloudWatch).
- Solid understanding of the ML lifecycle: training, evaluation, deployment, monitoring, and retraining.
- Familiarity with SQL and data warehousing platforms (Snowflake preferred).
- Experience implementing monitoring, logging, and alerting for production systems.
- Background in healthcare, medical devices, or clinical data systems.
- Experience with CI/CD systems for ML.
- Experience with data versioning tools (e.g., Apache Spark, Dask) for large-scale data processing.
You take ownership of ML infrastructure end-to-end — from training pipelines to production monitoring. You care deeply about reliability, reproducibility, and operational excellence in ML systems. You have strong opinions (loosely held) on how to build a great ML platform, and you’re eager to put them into practice. You communicate clearly across engineering, data science, and clinical teams. You’re motivated by building technology that directly improves patient care. Circadia Health is redefining patient monitoring through contactless sensing and AI-driven clinical insights. As we scale from tens of thousands to hundreds of thousands of monitored patients, our data infrastructure is central to everything we do. Build data systems that power clinical-grade AI and ML.
Machine Learning Engineer (hybrid or remote) employer: Circadia Health
Circadia Health is an exceptional employer, offering a dynamic work environment where innovation meets purpose. As a Machine Learning Engineer, you will play a pivotal role in enhancing healthcare through cutting-edge AI technology, while enjoying a culture that prioritises collaboration, professional growth, and the opportunity to make a meaningful impact on patient care. With a hybrid or remote work model, you can enjoy flexibility while being part of a mission-driven team supported by leading investors in the healthcare sector.
StudySmarter Expert Advice🤫
We think this is how you could land Machine Learning Engineer (hybrid or remote)
✨Tip Number 1
Network like a pro! Reach out to people in the industry, attend meetups, and connect with professionals on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to ML Ops or healthcare AI. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on common ML Ops questions and scenarios. Practice explaining your past projects and how they relate to the role at Circadia Health. Confidence is key!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in being part of the StudySmarter community.
We think you need these skills to ace Machine Learning Engineer (hybrid or remote)
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the Machine Learning Engineer role. Highlight your experience with ML pipeline orchestration, AWS, and any relevant healthcare background. We want to see how your skills align with our mission at Circadia Health!
Showcase Your Projects:Include specific examples of projects you've worked on that demonstrate your proficiency in Python, Apache Airflow, and ML lifecycle management. We love seeing real-world applications of your skills, so don’t hold back!
Be Clear and Concise:When writing your application, keep it straightforward and to the point. Use clear language to describe your experiences and achievements. We appreciate clarity, especially when it comes to technical details!
Apply Through Our Website:We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!
How to prepare for a job interview at Circadia Health
✨Know Your ML Basics
Make sure you brush up on your machine learning fundamentals. Understand the entire ML lifecycle, from training to deployment and monitoring. Be ready to discuss how you've applied these concepts in real-world scenarios, especially in healthcare settings.
✨Familiarise with Apache Airflow
Since Circadia Health uses Apache Airflow for ML pipeline orchestration, it’s crucial to have hands-on experience with it. Prepare to talk about your previous projects involving Airflow, including any challenges you faced and how you overcame them.
✨Showcase Your AWS Skills
As the role involves deploying ML models on AWS, be prepared to discuss your experience with AWS services like Batch, EC2, and S3. Highlight any specific projects where you managed ML workloads and how you ensured operational reliability.
✨Communicate Clearly
This role requires collaboration across various teams, so practice articulating your thoughts clearly. Be ready to explain complex technical concepts in a way that non-technical team members can understand, especially when discussing patient care improvements.