Software Engineer - Site Reliability Engineering

Software Engineer - Site Reliability Engineering

Full-Time 60000 - 80000 € / year (est.) No home office possible
Albert Invent

At a Glance

  • Tasks: Automate systems for reliability and troubleshoot large-scale cloud environments.
  • Company: Join Neo4j, a leader in database technology with a global impact.
  • Benefits: Enjoy competitive pay, flexible work options, and opportunities for growth.
  • Other info: Collaborative culture focused on learning and continuous improvement.
  • Why this job: Make a real difference in reliability engineering while working with cutting-edge tech.
  • Qualifications: Experience in backend tools, automation, and SRE practices is essential.

The predicted salary is between 60000 - 80000 € per year.

Neo4j's Site Reliability Engineering team’s mission is to improve the reliability of Neo4j’s DBaaS product: Neo4j Aura. Operating at a global scale across all three major cloud providers, Aura runs hundreds of Kubernetes clusters and hosts thousands of Neo4j instances in production at any given time.

The Role

  • Automate for insight and scale: Build systems that make troubleshooting fast, safe, and scalable across thousands of Neo4j instances. From internal tools that surface clear insights to canaries that support safe rollouts, you'll focus on automation that elevates reliability engineering.
  • Treat operations as a software problem: Replace tribal knowledge and ad-hoc scripts with tools and systems that codify best practices - making operations predictable, scalable, and repeatable.
  • Design for resilience, learn from failure: Own and evolve the tooling and processes behind incident response. From clear alerts to blameless reviews, you'll help ensure teams respond with confidence and learn with clarity.
  • Champion reliability as a product feature: Help teams define and act on SLIs and SLOs, turning reliability into a shared, data-driven priority across engineering.
  • Create signals, not noise: Shape an observability stack that tells us what matters, when it matters - so we can detect issues early and resolve them quickly.

We're interested in hearing from Engineers with deep experience in some of the following areas:

  • Writing backend tools and automation in Go - our primary language - with an emphasis on sound architecture, testing, and maintainability. Strong software skills in other languages, like Python, are also welcome.
  • Applying SRE practices in real-world environments: defining SLIs and SLOs, reducing toil through automation, and driving reliability through engineering.
  • Collaborating with other teams to promote SRE thinking - educating on principles like observability, ownership, and service level objectives.
  • Troubleshooting large-scale, cloud-based systems with confidence and curiosity.
  • Monitoring distributed systems and understanding their performance characteristics.
  • Designing systems with reliability, safety, and debugability as first-class concerns.
  • Working with observability tools like OTel Collector, Prometheus, Grafana, and Google Cloud's operations suite.
  • Deploying and managing applications on Kubernetes; cluster-level administration is a plus.
  • Managing infrastructure with Kustomize and Terraform - keeping it clear, modular, and easy to evolve.
  • Building and maintaining CI/CD workflows - ours run on GitHub Actions.
  • Participating in on-call rotations and incident response with a focus on improvement, not blame.
  • Writing and contributing to postmortems that lead to meaningful, lasting changes.

Software Engineer - Site Reliability Engineering employer: Albert Invent

At Neo4j, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. Our commitment to employee growth is evident through continuous learning opportunities and a focus on automating for insight and scale, ensuring that our engineers can thrive in a supportive environment. Located at the forefront of technology, our team enjoys the unique advantage of working with cutting-edge tools and practices in a global setting, making a meaningful impact on the reliability of our DBaaS product, Neo4j Aura.

Albert Invent

Contact Detail:

Albert Invent Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Software Engineer - Site Reliability Engineering

Tip Number 1

Network like a pro! Reach out to current employees at Neo4j or in the SRE field on LinkedIn. A friendly chat can give you insider info and might even lead to a referral, which is always a bonus!

Tip Number 2

Show off your skills! If you’ve got a GitHub or personal project that showcases your backend tools and automation skills, make sure to highlight it during interviews. It’s a great way to demonstrate your expertise in Go and other languages.

Tip Number 3

Prepare for technical interviews by brushing up on SRE practices. Be ready to discuss how you've defined SLIs and SLOs in past roles, and think of examples where you’ve reduced toil through automation. We love hearing about real-world applications!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Neo4j team!

We think you need these skills to ace Software Engineer - Site Reliability Engineering

Go
Python
Site Reliability Engineering (SRE) practices
Defining SLIs and SLOs
Automation
Observability
Troubleshooting large-scale cloud-based systems

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the Software Engineer role. Highlight your experience with Go, Kubernetes, and any SRE practices you've applied. We want to see how your skills align with our mission at Neo4j!

Showcase Your Projects:Include any relevant projects or contributions that demonstrate your ability to automate processes and improve reliability. If you've worked with observability tools or CI/CD workflows, let us know! We love seeing practical examples of your work.

Be Clear and Concise:When writing your application, keep it straightforward and to the point. Use clear language to describe your experiences and achievements. We appreciate a well-structured application that makes it easy for us to see your potential.

Apply Through Our Website:Don't forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. We can’t wait to hear from you!

How to prepare for a job interview at Albert Invent

Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, especially Go, Kubernetes, and observability tools like Prometheus and Grafana. Brush up on your knowledge of SRE practices and be ready to discuss how you've applied them in real-world scenarios.

Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled reliability issues or automated processes in previous roles. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your impact.

Understand the Company’s Mission

Familiarise yourself with Neo4j’s DBaaS product, Neo4j Aura, and its importance in the market. Be ready to discuss how you can contribute to improving its reliability and what that means for users globally.

Ask Insightful Questions

Prepare thoughtful questions that show your interest in the role and the company. Inquire about their current challenges in reliability engineering or how they measure success with SLIs and SLOs. This demonstrates your proactive mindset and eagerness to contribute.