At a Glance
- Tasks: Build and operate the core platform for machine learning and scientific AI workloads.
- Company: Join Chemify, a leader in innovative scientific computing.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Why this job: Make a real impact in ML infrastructure and support groundbreaking scientific research.
- Qualifications: Degree in Science or Engineering and experience with distributed systems and cloud infrastructure.
- Other info: Dynamic team environment with a focus on innovation and career advancement.
The predicted salary is between 70000 - 90000 £ per year.
We are hiring a Senior ML Infrastructure Engineer to build, enable and operate the core platform that powers Chemify's machine learning and scientific AI computing workloads. This role sits at the intersection of distributed systems engineering, machine learning infrastructure, scientific computing, and platform engineering. You will build and operate the operational backbone of the ML platform, ensuring that pipelines run reliably across Kubernetes clusters, on‐premise GPU infrastructure, and serverless compute environments. The systems you build will support ML engineers and computational chemists running workloads from large‐scale model training to molecular simulation.
Key Responsibilities
- ML Pipeline Orchestration: implement routing logic dispatching workloads to appropriate compute backends; maintain workflow reliability including retries, dependency management, and failure recovery.
- Linux Administration: Server administration and support including security and scaling.
- Kubernetes Platform Operations: Operate clusters for ML training, inference, and batch workloads; maintain container build pipelines and GitOps deployment workflows; optimise cluster scheduling, autoscaling, and GPU utilisation.
- HPC / GPU Compute Integration: Integrate orchestration systems with HPC job schedulers; maintain execution paths for workloads running on GPU clusters; ensure artifacts and results from HPC jobs are captured and versioned.
- Model & Experiment Lifecycle: Operate model registry and experiment tracking platforms; ensure training runs are reproducible and linked to code and datasets; support promotion of models from staging to production.
- Data Versioning & Pipeline Traceability: Implement dataset versioning and lineage tracking across ML pipelines; ensure predictions are traceable to model versions and datasets; maintain reproducible ML training pipelines.
- Platform Tooling & Developer Experience: Develop platform CLI tools and pipeline templates; maintain base container images used for ML workloads; improve developer workflows for ML engineers and scientists.
- Observability, Security & Governance: Implement monitoring, logging, and alerting across orchestration systems; maintain infrastructure as code for platform resources; ensure workloads are traceable to source code, container images, and execution environments.
What You'll Bring
- Degree in Science, Engineering or related field (or equivalent practical experience).
- Experience operating workflow orchestration platforms.
- Experience with containerisation and CI/CD pipelines.
- Experience with cloud infrastructure such as AWS & GCP.
- Experience operating distributed systems in production.
- Experience in Cyber Security and operating in regulated environments.
Beneficial Skills
- Argo Workflows or Kubernetes workflow engines.
- SLURM or other HPC job schedulers.
- ML experiment tracking tools such as Weights & Biases or MLflow.
- Data versioning or lakehouse technologies such as LakeFS, Iceberg, or Delta Lake.
- Scientific computing environments.
- Internal developer platform or CLI tooling experience.
Senior ML Infrastructure Engineer in Glasgow employer: Chemify Limited
Contact Detail:
Chemify Limited Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior ML Infrastructure Engineer in Glasgow
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to ML infrastructure and Kubernetes. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on relevant technologies and concepts. Practice explaining your past experiences with ML pipelines and distributed systems. We want you to feel confident and ready to impress!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our team.
We think you need these skills to ace Senior ML Infrastructure Engineer in Glasgow
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Senior ML Infrastructure Engineer role. Highlight your experience with Kubernetes, cloud infrastructure, and any relevant projects you've worked on. We want to see how you can contribute to our team!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about machine learning infrastructure and how your background makes you a great fit for us. Be sure to mention specific technologies or projects that relate to the job description.
Showcase Your Projects: If you've worked on any relevant projects, whether in a professional setting or as personal endeavours, make sure to include them. We love seeing practical applications of your skills, especially those involving ML pipelines, containerisation, or HPC integration.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about what we do at StudySmarter!
How to prepare for a job interview at Chemify Limited
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like Kubernetes, CI/CD pipelines, and cloud infrastructure. Brush up on your knowledge of ML pipeline orchestration and how to manage workloads across different compute environments.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles, especially related to distributed systems or ML infrastructure. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your problem-solving abilities.
✨Demonstrate Your Passion for ML and Scientific Computing
Express your enthusiasm for machine learning and scientific AI. Share any personal projects or experiences that showcase your interest and expertise in these areas. This will help you stand out as a candidate who is genuinely invested in the field.
✨Ask Insightful Questions
Prepare thoughtful questions about the company’s ML platform, team dynamics, and future projects. This shows your interest in the role and helps you gauge if the company culture aligns with your values and work style.