At a Glance
- Tasks: Design and maintain high-performance storage systems for AI and data workloads.
- Company: Join a non-profit focused on collaborative engineering in AI technology.
- Benefits: Competitive salary, pension, professional development, and networking opportunities.
- Other info: Vibrant office near Cambridge station with a supportive and inclusive culture.
- Why this job: Make a real impact in a growing organisation while working with cutting-edge technology.
- Qualifications: Strong Linux skills and experience with distributed storage solutions required.
The predicted salary is between 50000 - 65000 £ per year.
CommonAI CIC is a non-profit membership organisation, founded on a belief in collaborative engineering for the safe and responsible development of foundational AI technologies. A place where AI startups, enterprises large and small, public sector bodies and academia can share resources and knowledge, to co-develop and grow businesses, fast.
We support technology-focused start-ups, each with unique data management challenges, and are seeking an experienced Infrastructure Engineer to help them design, deploy and maintain high-performance storage systems for their AI and data-driven workloads. The successful candidate will combine deep experience architecting and managing distributed, cloud, and tiered storage solutions with strong Linux and automation skills.
In this role you will:
- Design, implement, and maintain storage platforms that support large-scale AI and data pipelines
- Manage distributed storage systems such as Ceph, Lustre, or BeeGFS
- Oversee tiered storage architectures, optimizing data movement across high-performance, object, and archival tiers
- Ensure data integrity, availability, and security across on-premises and cloud environments
- Develop automation and monitoring tools using Bash, Python, or similar scripting languages
- Manage and secure container images and related storage used for AI and ML workloads
- Integrate storage systems with public cloud services (AWS, Azure, GCP) and hybrid environments
- Troubleshoot complex storage and data flow issues, collaborating closely with AI platform and infrastructure teams
- Contribute to ongoing architecture improvements, performance tuning, and capacity planning
Requirements
To be considered candidates should meet most of the following requirements:
- Strong Linux system administration background
- Proven experience installing, configuring, and maintaining Ceph clusters or similar technologies in a production environment
- Familiarity with distributed filesystems (e.g., Lustre, BeeGFS) and cloud-based storage services (e.g. EC2)
- Experience with tiered storage management and lifecycle data policies
- Scripting and automation proficiency (e.g. Bash, Python, Terraform/OpenTofu, Ansible)
- Understanding of data security best practices and compliance considerations
- Experience working with container technologies (e.g. Docker, Kubernetes) and image storage registries
- Strong analytical, troubleshooting, communication and documentation skills
We also value:
- Knowledge of GPU compute environments or AI training infrastructure
- Experience with monitoring and observability tools (Prometheus, Grafana, etc.)
- Contributions to open-source storage, data management, or infrastructure projects
- Familiarity with object storage systems (S3, RADOS Gateway, MinIO, etc.)
Benefits
- A collaborative and supportive work environment
- The opportunity to have a high impact in a growing organisation
- Competitive salary package and pension
- Professional development opportunities
- Networking opportunities with influential people from across the tech sector and academia
- A vibrant office environment located a few minutes walk away from Cambridge train station
CommonAI CIC is an equal opportunity employer and is committed to creating an inclusive and diverse workplace.
AI Infrastructure Engineer (Storage) in Cambridge employer: CommonAI CIC
CommonAI CIC is an exceptional employer, offering a collaborative and supportive work environment where you can make a significant impact in the rapidly evolving field of AI technology. Located just minutes from Cambridge train station, our vibrant office fosters professional development and networking opportunities with influential figures in tech and academia, ensuring that you grow alongside your peers in a diverse and inclusive setting.
StudySmarter Expert Advice🤫
We think this is how you could land AI Infrastructure Engineer (Storage) in Cambridge
✨Tip Number 1
Network like a pro! Reach out to people in the AI and tech community, especially those who work at CommonAI CIC. Attend meetups or webinars, and don’t be shy about asking for informational interviews. You never know who might have the inside scoop on job openings!
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to storage systems, automation, or anything relevant to the role. This gives potential employers a tangible look at what you can do, making you stand out from the crowd.
✨Tip Number 3
Prepare for technical interviews by brushing up on your Linux and scripting skills. Practice common scenarios you might face as an AI Infrastructure Engineer, like troubleshooting storage issues or optimising data movement. The more prepared you are, the more confident you'll feel!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of our collaborative environment at CommonAI CIC.
We think you need these skills to ace AI Infrastructure Engineer (Storage) in Cambridge
Some tips for your application 🫡
Tailor Your CV:Make sure your CV is tailored to the AI Infrastructure Engineer role. Highlight your experience with distributed storage systems and Linux skills, as these are key for us. Use specific examples that showcase your expertise in managing cloud environments and automation.
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for AI and data management, and explain why you want to join our team at CommonAI CIC. Be sure to mention any relevant projects or experiences that align with our mission of collaborative engineering.
Showcase Your Technical Skills:Don’t hold back on showcasing your technical skills! Mention your proficiency in scripting languages like Bash and Python, and your experience with tools like Ceph or Kubernetes. We love seeing candidates who can demonstrate their hands-on experience with the technologies we use.
Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any important updates. Plus, it’s super easy to do!
How to prepare for a job interview at CommonAI CIC
✨Know Your Storage Solutions
Make sure you brush up on your knowledge of distributed storage systems like Ceph, Lustre, and BeeGFS. Be ready to discuss your hands-on experience with these technologies, as well as how you've tackled challenges in managing them in a production environment.
✨Show Off Your Scripting Skills
Since automation is key for this role, prepare to demonstrate your proficiency in scripting languages like Bash and Python. Think of specific examples where you've developed automation tools or scripts that improved efficiency or solved complex problems.
✨Understand Data Security
Data integrity and security are crucial in this position. Familiarise yourself with best practices and compliance considerations related to data management. Be prepared to discuss how you've ensured data security in previous roles, especially in cloud environments.
✨Collaborate and Communicate
This role involves working closely with AI platform and infrastructure teams. Highlight your communication skills and any experiences where collaboration led to successful project outcomes. Share examples of how you've troubleshot issues as part of a team.