AI Infrastructure Engineer (Storage) in Cambridge

AI Infrastructure Engineer (Storage) in Cambridge

Cambridge Full-Time 60000 - 80000 £ / year (est.) No working from home possible
CommonAI C.I.C.

At a Glance

  • Tasks: Design and maintain high-performance storage systems for AI and data workloads.
  • Company: Join CommonAI CIC, a non-profit focused on collaborative AI development.
  • Benefits: Competitive salary, pension, professional development, and networking opportunities.
  • Other info: Enjoy a vibrant office near Cambridge train station and a supportive work culture.
  • Why this job: Make a real impact in a growing organisation while working with cutting-edge technology.
  • Qualifications: Strong Linux skills and experience with distributed storage solutions required.

The predicted salary is between 60000 - 80000 £ per year.

CommonAI CIC is a non-profit membership organisation, founded on a belief in collaborative engineering for the safe and responsible development of foundational AI technologies. A place where AI startups, enterprises large and small, public sector bodies and academia can share resources and knowledge, to co-develop and grow businesses, fast. We support technology-focused start-ups, each with unique data management challenges, and are seeking an experienced Infrastructure Engineer to help them design, deploy and maintain high-performance storage systems for their AI and data-driven workloads.

The successful candidate will combine deep experience architecting and managing distributed, cloud, and tiered storage solutions with strong Linux and automation skills. In this role you will:

  • Design, implement, and maintain storage platforms that support large-scale AI and data pipelines.
  • Manage distributed storage systems such as Ceph, Lustre, or BeeGFS.
  • Oversee tiered storage architectures, optimising data movement across high-performance, object, and archival tiers.
  • Ensure data integrity, availability, and security across on-premises and cloud environments.
  • Develop automation and monitoring tools using Bash, Python, or similar scripting languages.
  • Manage and secure container images and related storage used for AI and ML workloads.
  • Integrate storage systems with public cloud services (AWS, Azure, GCP) and hybrid environments.
  • Troubleshoot complex storage and data flow issues, collaborating closely with AI platform and infrastructure teams.
  • Contribute to ongoing architecture improvements, performance tuning, and capacity planning.

To be considered candidates should meet most of the following requirements:

  • Strong Linux system administration background.
  • Proven experience installing, configuring, and maintaining Ceph clusters or similar technologies in a production environment.
  • Familiarity with distributed filesystems (e.g., Lustre, BeeGFS) and cloud-based storage services (e.g. EC2).
  • Experience with tiered storage management and lifecycle data policies.
  • Scripting and automation proficiency (e.g. Bash, Python, Terraform/OpenTofu, Ansible).
  • Understanding of data security best practices and compliance considerations.
  • Experience working with container technologies (e.g. Docker, Kubernetes) and image storage registries.
  • Strong analytical, troubleshooting, communication and documentation skills.

We also value:

  • Knowledge of GPU compute environments or AI training infrastructure.
  • Experience with monitoring and observability tools (Prometheus, Grafana, etc.).
  • Contributions to open-source storage, data management, or infrastructure projects.
  • Familiarity with object storage systems (S3, RADOS Gateway, MinIO, etc.).

A collaborative and supportive work environment. The opportunity to have a high impact in a growing organisation. Competitive salary package and pension. Professional development opportunities. Networking opportunities with influential people from across the tech sector and academia. A vibrant office environment located a few minutes walk away from Cambridge train station.

CommonAI CIC is an equal opportunity employer and is committed to creating an inclusive and diverse workplace.

AI Infrastructure Engineer (Storage) in Cambridge employer: CommonAI C.I.C.

CommonAI CIC is an exceptional employer, offering a collaborative and supportive work environment where you can make a significant impact in the rapidly evolving field of AI technology. Located just minutes from Cambridge train station, we provide competitive salaries, professional development opportunities, and networking with influential figures in tech and academia, all while fostering an inclusive and diverse workplace culture.

CommonAI C.I.C.

Contact Details:

CommonAI C.I.C. Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Infrastructure Engineer (Storage) in Cambridge

Tip Number 1

Network like a pro! Reach out to folks in the AI and tech community, attend meetups, and join online forums. The more connections you make, the better your chances of hearing about job openings before they even hit the market.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to storage systems and automation. This gives potential employers a taste of what you can do and sets you apart from the crowd.

Tip Number 3

Prepare for interviews by brushing up on common technical questions related to distributed storage and cloud services. Practice explaining your past experiences clearly and confidently, as communication is key in collaborative environments like CommonAI.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of our mission at CommonAI.

We think you need these skills to ace AI Infrastructure Engineer (Storage) in Cambridge

Linux System Administration
Ceph Clusters Management
Distributed Filesystems (Lustre, BeeGFS)
Cloud-Based Storage Services (AWS, Azure, GCP)
Tiered Storage Management
Scripting and Automation (Bash, Python, Terraform/OpenTofu, Ansible)
Data Security Best Practices

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the AI Infrastructure Engineer role. Highlight your experience with distributed storage systems and Linux administration, as these are key for us. Use specific examples that showcase your skills in managing cloud environments and automation.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for AI and data management, and explain why you want to join CommonAI CIC. Be sure to mention any relevant projects or experiences that align with our mission of collaborative engineering.

Showcase Your Technical Skills:Don’t hold back on showcasing your technical skills! Mention your proficiency in scripting languages like Bash and Python, and your experience with tools like Ceph or Kubernetes. We love seeing how you’ve used these skills to solve real-world problems.

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any important updates. Plus, it’s super easy!

How to prepare for a job interview at CommonAI C.I.C.

Know Your Storage Solutions

Make sure you brush up on your knowledge of distributed storage systems like Ceph, Lustre, and BeeGFS. Be ready to discuss your hands-on experience with these technologies, as well as any challenges you've faced and how you overcame them.

Show Off Your Scripting Skills

Since automation is key in this role, prepare to talk about your proficiency in scripting languages like Bash and Python. Have examples ready that demonstrate how you've used these skills to streamline processes or solve complex problems.

Understand Data Security

Data integrity and security are crucial for the role. Familiarise yourself with best practices and compliance considerations related to data management. Be prepared to discuss how you've implemented security measures in past projects.

Collaborate and Communicate

This position requires close collaboration with various teams. Think of examples where you've successfully worked with others to troubleshoot issues or improve systems. Highlight your communication skills and how they’ve helped in team settings.