At a Glance
- Tasks: Build and operate the core platform for ML and scientific AI workloads.
- Company: Join Chemify, a revolutionary company transforming chemistry with AI and robotics.
- Benefits: Competitive salary, flexible working, and opportunities for professional growth.
- Other info: Dynamic team environment with exciting challenges in scientific computing.
- Why this job: Make a real impact in science by developing cutting-edge ML infrastructure.
- Qualifications: Degree in Science or Engineering, strong Python and Kubernetes skills.
The predicted salary is between 60000 - 80000 £ per year.
This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.
About Chemify
Chemify is revolutionising chemistry. We are creating a future where the synthesis of previously unimaginable molecules, drugs, and materials is instantly accessible. By combining AI, robotics, and the world's largest continually expanding database of chemical programs, we are accelerating chemical discovery to improve quality of life and extend the reach of humanity.
We are hiring a Senior ML Infrastructure Engineer to build, enable and operate the core platform that powers Chemify's machine learning and scientific AI computing workloads. This role sits at the intersection of distributed systems engineering, machine learning infrastructure, scientific computing, and platform engineering.
You will build and operate the operational backbone of the ML platform, ensuring that pipelines run reliably across Kubernetes clusters, on-premise GPU infrastructure, and serverless compute environments. The systems you build will support ML engineers and computational chemists running workloads from large-scale model training to molecular simulation.
If you enjoy building complex technical systems at the intersection of ML and scientific computing, working on platform problems that combine distributed systems, cloud and on-premise GPU infrastructure, and real-world scientific workloads, you'll thrive here.
Key Responsibilities
- ML Pipeline Orchestration: implement routing logic dispatching workloads to appropriate compute backends; maintain workflow reliability including retries, dependency management, and failure recovery.
- Linux Administration: Server administration and support including security and scaling.
- Kubernetes Platform Operations: Operate clusters for ML training, inference, and batch workloads; maintain container build pipelines and GitOps deployment workflows; optimise cluster scheduling, autoscaling, and GPU utilisation.
- HPC / GPU Compute Integration: Integrate orchestration systems with HPC job schedulers; maintain execution paths for workloads running on GPU clusters; ensure artifacts and results from HPC jobs are captured and versioned.
- Model & Experiment Lifecycle: Operate model registry and experiment tracking platforms; ensure training runs are reproducible and linked to code and datasets; support promotion of models from staging to production.
- Data Versioning & Pipeline Traceability: Implement dataset versioning and lineage tracking across ML pipelines; ensure predictions are traceable to model versions and datasets; maintain reproducible ML training pipelines.
- Platform Tooling & Developer Experience: Develop platform CLI tools and pipeline templates; maintain base container images used for ML workloads; improve developer workflows for ML engineers and scientists.
- Observability, Security & Governance: Implement monitoring, logging, and alerting across orchestration systems; maintain infrastructure as code for platform resources; ensure workloads are traceable to source code, container images, and execution environments.
What You'll Bring
- Degree in Science, Engineering or related field (or equivalent practical experience).
- Strong Python engineering skills.
- Experience operating workflow orchestration platforms.
- Strong Kubernetes platform experience.
- Experience with containerisation and CI/CD pipelines.
- Experience with cloud infrastructure such as AWS & GCP.
- Experience operating distributed systems in production.
- Strong Linux systems engineering skills.
Beneficial Skills
- Argo Workflows or Kubernetes workflow engines.
- SLURM or other HPC job schedulers.
- ML experiment tracking tools such as Weights & Biases or MLflow.
- Data versioning or lakehouse technologies such as LakeFS, Iceberg, or Delta Lake.
- Scientific computing environments.
- Internal developer platform or CLI tooling experience.
- Experience in Cyber Security and operating in regulated environments.
Senior ML Infrastructure Engineer | Kubernetes & GPU Compute in Glasgow employer: Chemify Ltd
At Chemify, we pride ourselves on being an innovative employer that fosters a collaborative and inclusive work culture, particularly for those passionate about advancing the field of chemistry through technology. Our team enjoys a range of benefits including flexible working arrangements, opportunities for professional development, and the chance to work with cutting-edge AI and machine learning technologies in a vibrant location. Join us to be part of a mission-driven company where your contributions directly impact the future of scientific discovery and improve quality of life.
StudySmarter Expert Advice🤫
We think this is how you could land Senior ML Infrastructure Engineer | Kubernetes & GPU Compute in Glasgow
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to ML infrastructure and Kubernetes. This gives potential employers a taste of what you can do.
✨Tip Number 3
Prepare for technical interviews by brushing up on your Python and Kubernetes knowledge. Practice common ML infrastructure scenarios and be ready to discuss how you've tackled similar challenges in the past.
✨Tip Number 4
Apply through our website! We make it easy for you to find roles that match your skills. Plus, it shows you're genuinely interested in joining our team at Chemify.
We think you need these skills to ace Senior ML Infrastructure Engineer | Kubernetes & GPU Compute in Glasgow
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that match the job description. Highlight your experience with Kubernetes, GPU compute, and any relevant ML infrastructure projects you've worked on. We want to see how you fit into our vision!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to tell us why you're passionate about ML infrastructure and how your background makes you the perfect fit for Chemify. Be genuine and let your personality come through!
Showcase Your Projects:If you've worked on any relevant projects, whether in a professional or personal capacity, make sure to mention them. We love seeing real-world applications of your skills, especially those involving distributed systems and scientific computing.
Apply Through Our Website:We encourage you to apply directly through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any important updates from us!
How to prepare for a job interview at Chemify Ltd
✨Know Your Tech Inside Out
Make sure you’re well-versed in Kubernetes, Python, and the specific tools mentioned in the job description. Brush up on your experience with CI/CD pipelines and cloud infrastructure like AWS and GCP. Being able to discuss your hands-on experience with these technologies will show that you're not just familiar but truly capable.
✨Demonstrate Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles, especially related to ML pipeline orchestration or distributed systems. Use the STAR method (Situation, Task, Action, Result) to structure your answers, showcasing how you tackled complex problems and what the outcomes were.
✨Show Enthusiasm for Scientific Computing
Chemify is all about revolutionising chemistry, so express your passion for scientific computing and how it intersects with machine learning. Share any relevant projects or experiences that highlight your interest in this field, and be ready to discuss how you can contribute to their mission.
✨Ask Insightful Questions
Prepare thoughtful questions that demonstrate your understanding of the role and the company’s goals. Inquire about their current challenges with ML infrastructure or how they envision the future of their platform. This shows that you’re not only interested in the position but also invested in the company’s success.