Site Reliability Engineer (DataCosmos)

Site Reliability Engineer (DataCosmos)

Full-Time 50000 - 70000 £ / year (est.) Home office (partial)
PassFort

At a Glance

  • Tasks: Ensure our data platform is reliable, scalable, and performing at its best.
  • Company: Join Open Cosmos, a mission-driven company making space accessible.
  • Benefits: Work with cutting-edge technology and a supportive, diverse team.
  • Other info: Flexible location across Europe with excellent career growth opportunities.
  • Why this job: Make a real-world impact by transforming satellite data into insights.
  • Qualifications: Experience with Linux systems, cloud platforms, and Kubernetes.

The predicted salary is between 50000 - 70000 £ per year.

Aim high, go beyond! At Open Cosmos we are solving the world’s biggest challenges from space, providing businesses, governments and researchers access to more readily available information than ever before - ready for the challenge? Then read on…

Working in our Data Division At Open Cosmos, our Data division transforms satellite data into meaningful insights that drive real-world impact. The team delivers all data products generated by Open Cosmos and its partners, curates and develops DataCosmos (our geospatial data platform) and builds integrations that make satellite imagery easy to access and act on. We’re now looking for a Site Reliability Engineer to help us ensure our data platform is reliable, scalable, and performing at its best as we grow.

What will you be doing?

  • Owning the reliability, performance, and scalability of our data platform and processing pipelines
  • Monitoring systems end-to-end, ensuring full visibility across infrastructure and data flows
  • Responding to incidents, troubleshooting issues, and driving long-term fixes
  • Improving deployments and contributing to CI/CD pipelines for safe, repeatable releases
  • Working closely with engineering teams to design resilient, scalable systems
  • Automating processes and reducing operational overhead
  • Supporting customer-impacting issues alongside Customer Success teams

What You’ll bring

  • Strong demonstrable ability to work with Linux systems and cloud platforms (AWS, GCP or Azure)
  • Solid Kubernetes knowledge and ability to run production systems
  • A clear understanding of observability (monitoring, logging, tracing)
  • Capable of designing or operating high-availability, distributed systems
  • A mindset focused on automation, scalability, and continuous improvement
  • Confidence working in fast-moving environments where reliability really matters

This role can be based in any of our European Locations. To apply, you must have the legal right to work in our chosen location.

Why Open Cosmos?

  • Work at the cutting edge of space technology with customers around the globe.
  • A mission-driven company making space accessible to help solve real-world challenges.
  • A diverse, ambitious, and supportive team.

Site Reliability Engineer (DataCosmos) employer: PassFort

Open Cosmos is an exceptional employer for those looking to make a meaningful impact in the field of space technology. With a mission-driven culture that prioritises innovation and collaboration, employees benefit from a supportive environment that fosters professional growth and development. Working in one of our European locations offers the unique advantage of being at the forefront of transforming satellite data into actionable insights, all while being part of a diverse and ambitious team dedicated to solving real-world challenges.

PassFort

Contact Details:

PassFort Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer (DataCosmos)

Tip Number 1

Network like a pro! Reach out to current or former employees at Open Cosmos on LinkedIn. A friendly chat can give you insider info and might just get your foot in the door.

Tip Number 2

Show off your skills! If you’ve got a GitHub or personal project that showcases your work with Linux systems, Kubernetes, or cloud platforms, make sure to highlight it during interviews. It’s a great way to demonstrate your hands-on experience.

Tip Number 3

Prepare for technical questions! Brush up on your knowledge of observability and high-availability systems. Practising common SRE scenarios can help you feel more confident when discussing your problem-solving approach.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Open Cosmos team.

We think you need these skills to ace Site Reliability Engineer (DataCosmos)

Linux Systems
Cloud Platforms (AWS, GCP, Azure)
Kubernetes
Observability (Monitoring, Logging, Tracing)
High-Availability Systems
Distributed Systems
Automation

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the Site Reliability Engineer role. Highlight your experience with Linux systems, cloud platforms, and Kubernetes to show us you’re the right fit!

Craft a Compelling Cover Letter:Use your cover letter to tell us why you're passionate about space technology and how your background aligns with our mission at Open Cosmos. Be genuine and let your personality shine through!

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled challenges in previous roles. We want to see your troubleshooting skills and how you’ve contributed to improving system reliability.

Apply Through Our Website:We encourage you to apply directly through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and get back to you quickly!

How to prepare for a job interview at PassFort

Know Your Tech Inside Out

Make sure you brush up on your Linux systems and cloud platforms like AWS, GCP, or Azure. Be ready to discuss your experience with Kubernetes and how you've managed production systems in the past. This will show that you're not just familiar with the tech, but that you can really own it.

Demonstrate Your Problem-Solving Skills

Prepare to share specific examples of how you've responded to incidents or troubleshot issues in previous roles. Highlight your approach to driving long-term fixes and improving deployments. This will give them confidence in your ability to handle real-world challenges.

Showcase Your Automation Mindset

Talk about any processes you've automated in the past and how that has reduced operational overhead. They’re looking for someone who focuses on scalability and continuous improvement, so be ready to discuss your strategies and successes in this area.

Be Ready for a Fast-Paced Environment

Since reliability is key, prepare to discuss how you've thrived in fast-moving environments. Share examples of how you've maintained high availability and designed resilient systems under pressure. This will demonstrate that you can keep cool when things get hectic.