At a Glance
- Tasks: Join us in revolutionising travel tech and ensure our systems run smoothly.
- Company: Be part of Duffel, a forward-thinking company transforming the travel industry.
- Benefits: Enjoy competitive pay, personal growth opportunities, and a share in the company.
- Why this job: Make a real impact on travel experiences while working with cutting-edge technology.
- Qualifications: Experience in systems engineering and a passion for software development.
- Other info: Collaborative environment with a focus on diversity and inclusion.
The predicted salary is between 36000 - 60000 £ per year.
Overview
Create the future of travel with us. Whether it’s to visit the people closest to us, starting an exciting adventure, or a career-defining business trip, travel is an essential part of our lives. Yet we’ve all experienced the aches and pains of getting to our destination. Today, more than 4 billion airline passengers rely on technology that hasn’t kept up with the expectations of the modern connected traveller. That’s why we’ve started to rebuild the infrastructure that underpins the travel industry. We’re on a mission to unravel travel — simplifying systems and building the tools that will make the future of travel effortless.
Engineering at Duffel
We’re building tools to simplify travel distribution, search and booking. What does this actually mean? It’s one common and seamless API. This brings huge technical challenges as we need to design and build a beautiful API before integrating to hundreds of airlines. Along with that we need to navigate through the differing needs and systems of each airline whilst building a fantastic developer experience to go with it. The tools used on the team include Elixir, Phoenix, Kubernetes and Google Cloud Platform.
Site Reliability Engineering at Duffel
As an SRE at Duffel, you’ll be part of a small team within engineering that is responsible for the reliability, performance, and resilience of our infrastructure and applications. You will be working closely with engineering teams to understand their needs and help meet the demands of our product as we scale globally.
What We’re Looking For
- An infrastructure and systems engineering generalist who is comfortable diving deep into the weeds on different issues.
- An enthusiasm for both software development and systems engineering.
- A high bar for code and configuration quality and readability.
- A good understanding of current observability and reliability practices.
- Experienced and comfortable in running incident response.
- Big picture thinking - you can make trade offs on technical work streams against business impact.
- Fantastic communication skills. You’re able to articulate what you’re working on and why to the team in a clear and structured way.
- You thrive in a collaborative environment. You believe in your own methods but keep an open mind, taking suggestions and feedback onboard as well.
Technologies
We run our infrastructure on Google Cloud Platform, so you’ll be helping to run a few of their products such as GKE, CloudSQL for PostgreSQL, BigQuery, Memorystore (Redis) and more. We manage the infrastructure and security for a segregated PCI Cardholder Data Environment, entirely managed with Google Cloud Platform services and tooling. We follow an Infrastructure as Code approach to managing our infrastructure, using Terraform. We follow a GitOps approach to managing our Kubernetes configuration, using ArgoCD and Helm. We manage a high-availability metrics collection system using Grafana, Thanos & Prometheus. We’re in the process of transitioning to OpenTelemetry and Honeycomb for our application telemetry (traces and metrics). We manage a data pipeline using Pub/Sub, Airbyte, and dbt.
Our Current Focus
We’re currently driving a big shift in how we think about and monitor reliability across the engineering organisation, with a focus on early detection of customer-impacting issues. We’re extending and standardising our use of OpenTelemetry, and introducing Honeycomb as the single place for engineers to understand how our applications are operating in production. This project involves both technical work, on the application libraries and infrastructure that make up the OpenTelemetry pipeline, and an education piece, working to change perceptions and behaviours across engineering.
The Future
We currently run all our services from a single European region in Google Cloud. In the medium term, for performance, reliability, and data residency reasons, we’ll be starting to think about how to (re)architect our applications and infrastructure to span multiple regions, operating globally. We deploy our application multiple times a day, but deploys are all or nothing, and when we encounter issues, roll backs are slow. One way to address this would be to invest in CI/CD performance improvements, but we’d also like to explore alternative deployment strategies like Canaries, Blue/Green, and traffic mirroring, and get more comfortable testing changes in production with real customer traffic.
What you can expect from us
We’re dedicated to your personal growth. Our environment is comfortable both physically, but also in that our ears are always open to any ideas, concerns and questions. We believe that everyone should have pride in their work, taking full ownership of it and its impact. That’s why everyone who joins Duffel owns a share of the company. We are an equal opportunities employer. We believe that the key to our success is employing a diverse team, that’s why recruitment decisions are only based on your experience and skills. We value your ability to problem solve and build amazing things so we welcome applications for everyone – regardless of age, sex, disability, sexual orientation, race, religion or belief.
Site Reliability Engineer (SRE) in City of London employer: Duffel
Contact Detail:
Duffel Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (SRE) in City of London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to SRE. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by practising common SRE scenarios. Brush up on your incident response strategies and be ready to discuss how you've tackled challenges in the past. Confidence is key!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are genuinely interested in joining our mission to revolutionise travel.
We think you need these skills to ace Site Reliability Engineer (SRE) in City of London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Site Reliability Engineer role. Highlight your experience with technologies like Google Cloud Platform, Kubernetes, and any relevant incident response work.
Craft a Compelling Cover Letter: Use your cover letter to tell us why you're passionate about travel technology and how your background makes you a great fit for our team. Be sure to mention specific projects or experiences that showcase your problem-solving skills.
Showcase Your Communication Skills: Since fantastic communication is key for this role, make sure your application materials are clear and well-structured. We want to see how you articulate your thoughts and ideas, so don’t hold back!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!
How to prepare for a job interview at Duffel
✨Know Your Tech Stack
Familiarise yourself with the technologies mentioned in the job description, like Elixir, Kubernetes, and Google Cloud Platform. Be ready to discuss how you've used these tools in past projects or how you would approach challenges using them.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of how you've tackled complex issues in infrastructure or systems engineering. Highlight your experience with incident response and how you ensure reliability and performance in your work.
✨Communicate Clearly
Practice articulating your thoughts clearly and concisely. You’ll need to explain technical concepts to non-technical team members, so being able to break down complex ideas is crucial. Think about how you can convey your past experiences in a structured way.
✨Emphasise Collaboration
Duffel values teamwork, so be prepared to discuss how you thrive in collaborative environments. Share examples of how you've worked with cross-functional teams and how you handle feedback and suggestions from others.