At a Glance
- Tasks: Build and operate the core platform for ML and scientific AI workloads.
- Company: Join Chemify, a pioneer in revolutionising chemistry with AI and robotics.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Dynamic team environment with exciting challenges in ML and scientific computing.
- Why this job: Make a real impact on chemical discovery and improve quality of life.
- Qualifications: Degree in Science or Engineering, strong Python and Kubernetes skills.
The predicted salary is between 70000 - 90000 £ per year.
About Chemify
Chemify is revolutionising chemistry. We are creating a future where the synthesis of previously unimaginable molecules, drugs, and materials is instantly accessible. By combining AI, robotics, and the world’s largest continually expanding database of chemical programs, we are accelerating chemical discovery to improve quality of life and extend the reach of humanity.
We are hiring a Senior/Staff ML Infrastructure Engineer to build, enable and operate the core platform that powers Chemify’s machine learning and scientific AI computing workloads. This role sits at the intersection of distributed systems engineering, machine learning infrastructure, scientific computing, and platform engineering. You will build and operate the operational backbone of the ML platform, ensuring that pipelines run reliably across Kubernetes clusters, on-premise GPU infrastructure, and serverless compute environments. The systems you build will support ML engineers and computational chemists running workloads from large-scale model training to molecular simulation. If you enjoy building complex technical systems at the intersection of ML and scientific computing, working on platform problems that combine distributed systems, cloud and on-premise GPU infrastructure, and real-world scientific workloads, you’ll thrive here.
Key Responsibilities
- ML Pipeline Orchestration: implement routing logic dispatching workloads to appropriate compute backends; maintain workflow reliability including retries, dependency management, and failure recovery.
- Linux Administration: Server administration and support including security and scaling.
- Kubernetes Platform Operations: Operate clusters for ML training, inference, and batch workloads; maintain container build pipelines and GitOps deployment workflows; optimise cluster scheduling, autoscaling, and GPU utilisation.
- HPC / GPU Compute Integration: Integrate orchestration systems with HPC job schedulers; maintain execution paths for workloads running on GPU clusters; ensure artifacts and results from HPC jobs are captured and versioned.
- Model & Experiment Lifecycle: Operate model registry and experiment tracking platforms; ensure training runs are reproducible and linked to code and datasets; support promotion of models from staging to production.
- Data Versioning & Pipeline Traceability: Implement dataset versioning and lineage tracking across ML pipelines; ensure predictions are traceable to model versions and datasets; maintain reproducible ML training pipelines.
- Platform Tooling & Developer Experience: Develop platform CLI tools and pipeline templates; maintain base container images used for ML workloads; improve developer workflows for ML engineers and scientists.
- Observability, Security & Governance: Implement monitoring, logging, and alerting across orchestration systems; maintain infrastructure as code for platform resources; ensure workloads are traceable to source code, container images, and execution environments.
What You’ll Bring
- Degree in Science, Engineering or related field (or equivalent practical experience).
- Strong Python engineering skills.
- Experience operating workflow orchestration platforms.
- Strong Kubernetes platform experience.
- Experience with containerisation and CI/CD pipelines.
- Experience with cloud infrastructure such as AWS & GCP.
- Experience operating distributed systems in production.
- Strong Linux systems engineering skills.
Beneficial Skills
- Argo Workflows or Kubernetes workflow engines.
- SLURM or other HPC job schedulers.
- ML experiment tracking tools such as Weights & Biases or MLflow.
- Data versioning or lakehouse technologies such as LakeFS, Iceberg, or Delta Lake.
- Scientific computing environments.
- Internal developer platform or CLI tooling experience.
- Experience in Cyber Security and operating in regulated environments.
Senior/Staff ML Infrastructure Engineer in Glasgow employer: Chemify
At Chemify, we pride ourselves on being an innovative employer that fosters a collaborative and dynamic work culture. Our commitment to employee growth is evident through our focus on cutting-edge technology and the opportunity to work at the forefront of scientific discovery. Located in a vibrant area, we offer competitive benefits and a supportive environment where your contributions directly impact the future of chemistry and improve quality of life.
StudySmarter Expert Advice🤫
We think this is how you could land Senior/Staff ML Infrastructure Engineer in Glasgow
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with Chemify employees on LinkedIn. A friendly chat can sometimes lead to opportunities that aren’t even advertised!
✨Tip Number 2
Show off your skills! If you’ve got a GitHub or portfolio showcasing your projects, make sure to share it during interviews. It’s a great way to demonstrate your expertise in ML infrastructure and distributed systems.
✨Tip Number 3
Prepare for technical interviews by brushing up on your Python and Kubernetes knowledge. Practice common ML infrastructure problems and be ready to discuss how you’d tackle real-world challenges at Chemify.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our mission!
We think you need these skills to ace Senior/Staff ML Infrastructure Engineer in Glasgow
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that match the job description. Highlight your Python engineering skills, Kubernetes experience, and any relevant projects you've worked on. We want to see how you can contribute to our mission!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for ML infrastructure and scientific computing. Tell us why you're excited about Chemify and how your background aligns with our goals. Keep it engaging and personal!
Showcase Relevant Projects:If you've worked on any projects related to ML pipelines, distributed systems, or cloud infrastructure, make sure to mention them. We love seeing real-world applications of your skills, so don’t hold back on the details!
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows us you’re keen to join the Chemify team!
How to prepare for a job interview at Chemify
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially Kubernetes, Python, and cloud infrastructure like AWS or GCP. Brush up on your experience with workflow orchestration platforms and be ready to discuss specific projects where you've implemented these technologies.
✨Showcase Problem-Solving Skills
Prepare to discuss complex technical problems you've solved in the past, particularly those related to distributed systems and ML infrastructure. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your thought process.
✨Demonstrate Collaboration
Since this role involves working closely with ML engineers and computational chemists, be ready to talk about how you’ve collaborated with cross-functional teams. Share examples of how you’ve improved workflows or developer experiences in previous roles.
✨Ask Insightful Questions
Prepare thoughtful questions that show your interest in Chemify’s mission and the role. Inquire about their current challenges in ML pipeline orchestration or how they envision the future of their platform. This not only shows your enthusiasm but also helps you gauge if the company is the right fit for you.