Site Reliability Engineer (DataCosmos) in Harwell

Site Reliability Engineer (DataCosmos) in Harwell

Harwell Full-Time 50000 - 70000 £ / year (est.) Home office (partial)
PassFort

At a Glance

  • Tasks: Ensure our data platform is reliable, scalable, and performing at its best.
  • Company: Join Open Cosmos, a mission-driven company making space accessible.
  • Benefits: Work with cutting-edge technology and a supportive, diverse team.
  • Other info: Flexible location across Europe with excellent career growth opportunities.
  • Why this job: Make a real-world impact by transforming satellite data into insights.
  • Qualifications: Experience with Linux systems, cloud platforms, and Kubernetes.

The predicted salary is between 50000 - 70000 £ per year.

Aim high, go beyond! At Open Cosmos we are solving the world’s biggest challenges from space, providing businesses, governments and researchers access to more readily available information than ever before - ready for the challenge? Then read on…

Working in our Data Division At Open Cosmos, our Data division transforms satellite data into meaningful insights that drive real-world impact. The team delivers all data products generated by Open Cosmos and its partners, curates and develops DataCosmos (our geospatial data platform) and builds integrations that make satellite imagery easy to access and act on. We’re now looking for a Site Reliability Engineer to help us ensure our data platform is reliable, scalable, and performing at its best as we grow.

What will you be doing?

  • Owning the reliability, performance, and scalability of our data platform and processing pipelines
  • Monitoring systems end-to-end, ensuring full visibility across infrastructure and data flows
  • Responding to incidents, troubleshooting issues, and driving long-term fixes
  • Improving deployments and contributing to CI/CD pipelines for safe, repeatable releases
  • Working closely with engineering teams to design resilient, scalable systems
  • Automating processes and reducing operational overhead
  • Supporting customer-impacting issues alongside Customer Success teams

What You’ll bring

  • Strong demonstrable ability to work with Linux systems and cloud platforms (AWS, GCP or Azure)
  • Solid Kubernetes knowledge and ability to run production systems
  • A clear understanding of observability (monitoring, logging, tracing)
  • Capable of designing or operating high-availability, distributed systems
  • A mindset focused on automation, scalability, and continuous improvement
  • Confidence working in fast-moving environments where reliability really matters

This role can be based in any of our European Locations. To apply, you must have the legal right to work in our chosen location.

Why Open Cosmos?

  • Work at the cutting edge of space technology with customers around the globe.
  • A mission-driven company making space accessible to help solve real-world challenges.
  • A diverse, ambitious, and supportive team.

Site Reliability Engineer (DataCosmos) in Harwell employer: PassFort

Open Cosmos is an exceptional employer, offering a unique opportunity to work at the forefront of space technology while contributing to meaningful solutions for global challenges. With a diverse and ambitious team, we foster a supportive work culture that prioritises employee growth and innovation, ensuring that our Site Reliability Engineers thrive in a dynamic environment. Join us in making space accessible and experience the benefits of working in a mission-driven company that values collaboration and continuous improvement.

PassFort

Contact Details:

PassFort Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer (DataCosmos) in Harwell

Tip Number 1

Network like a pro! Reach out to current or former employees at Open Cosmos on LinkedIn. A friendly chat can give you insider info and maybe even a referral, which can really boost your chances.

Tip Number 2

Show off your skills! If you’ve got a GitHub or personal project that showcases your work with Linux systems or Kubernetes, make sure to highlight it during interviews. It’s a great way to demonstrate your hands-on experience.

Tip Number 3

Prepare for the technical grill! Brush up on your knowledge of observability and high-availability systems. Be ready to discuss how you’ve tackled reliability issues in the past – real examples will make you stand out.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Open Cosmos team.

We think you need these skills to ace Site Reliability Engineer (DataCosmos) in Harwell

Linux Systems
Cloud Platforms (AWS, GCP, Azure)
Kubernetes
Observability (Monitoring, Logging, Tracing)
High-Availability Systems Design
Distributed Systems Operation
Automation

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the Site Reliability Engineer role. Highlight your experience with Linux systems, cloud platforms, and Kubernetes to show us you’re the right fit!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for space technology and how your background aligns with our mission at Open Cosmos. Let us know why you want to be part of our team!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled challenges in previous roles. We love seeing candidates who can troubleshoot issues and drive long-term fixes, so don’t hold back!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates from our team!

How to prepare for a job interview at PassFort

Know Your Tech Inside Out

Make sure you brush up on your Linux systems and cloud platforms like AWS, GCP, or Azure. Be ready to discuss your experience with Kubernetes and how you've managed production systems in the past. This will show that you're not just familiar with the tech, but that you can really own it.

Demonstrate Your Problem-Solving Skills

Prepare to share specific examples of incidents you've handled in the past. Talk about how you monitored systems, troubleshot issues, and implemented long-term fixes. This will highlight your ability to respond effectively in high-pressure situations, which is crucial for a Site Reliability Engineer.

Showcase Your Automation Mindset

Discuss any automation processes you've implemented to reduce operational overhead. Highlight your experience with CI/CD pipelines and how you've contributed to safe, repeatable releases. This will demonstrate your focus on scalability and continuous improvement, which aligns perfectly with what Open Cosmos is looking for.

Be Ready for a Fast-Paced Environment

Since reliability matters in fast-moving environments, be prepared to talk about how you've thrived in similar situations. Share examples of how you've worked closely with engineering teams to design resilient systems and how you’ve supported customer-impacting issues. This will show that you can adapt and excel under pressure.