At a Glance
- Tasks: Join GitLab to ensure our PostgreSQL database runs smoothly and efficiently.
- Company: Be part of an innovative tech company transforming software development with AI.
- Benefits: Enjoy flexible paid time off, equity compensation, and a growth fund.
- Why this job: Make a real impact on a platform used by millions of developers worldwide.
- Qualifications: Experience with PostgreSQL and automation tools like Ansible or Terraform is essential.
- Other info: Work remotely in a dynamic team that values collaboration and continuous learning.
The predicted salary is between 36000 - 60000 ÂŁ per year.
Join to apply for the Intermediate Site Reliability Engineer, Database Operations role at GitLab. GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world.
Overview
You will join our Database Operations team as an Intermediate Site Reliability Engineer, keeping GitLab.com—one of the largest single-tenancy open source SaaS platforms on the internet—running smoothly and reliably. In this role, you will take ownership of the PostgreSQL database infrastructure that powers millions of developers worldwide, automating operational tasks, improving system performance and reliability, and designing solutions that scale to support hundreds of thousands of concurrent users.
Projects
- Design and implement mature automation for database provisioning, replication, and backup testing using tools like Terraform and Ansible.
- Develop self-service tools and dashboards that empower other teams to manage their own database resources.
- Lead capacity planning and scalability initiatives to ensure GitLab.com continues growing reliably.
- Participate in production incident response and help implement systemic improvements to prevent recurrence.
Responsibilities
- Automate operational tasks across all environments—from package updates and configuration changes to provisioning of user-facing services—so manual effort becomes the exception, not the rule.
- Design and maintain PostgreSQL database infrastructure components that allow GitLab.com to scale reliably while supporting hundreds of thousands of concurrent users.
- Respond to production incidents and platform emergencies, working with peer SREs to diagnose and resolve database-related issues quickly and thoroughly.
- Build observability systems that monitor database health, predict capacity needs based on usage patterns, and alert on symptoms rather than outages.
- Develop and ship database performance solutions in collaboration with product and engineering teams, including query optimization, migration reviews, and infrastructure recommendations.
- Create self-service tools and automation—using Terraform, Ansible, Chef, and GitLab ChatOps—that empower engineering teams to manage their own database interactions safely.
- Document decisions, learnings, and operational procedures so that knowledge becomes repeatable actions and eventually becomes automation.
- Participate in regularly scheduled on-call rotations to ensure GitLab.com remains operational during off-hours and weekends when necessary.
Qualifications
- Hands-on experience running PostgreSQL in high-growth, large production environments, including both self-managed infrastructure and database-as-a-service platforms.
- Expertise with infrastructure automation and configuration management tools such as Ansible, Terraform, Chef, or Puppet to automate operational tasks and drive system reliability.
- Solid understanding of SQL, PL/pgSQL, data modeling, and data structure design; ability to analyze PostgreSQL internals to troubleshoot and optimize systems.
- Experience working in large-scale, distributed SaaS production environments where you have managed reliability, performance, and scalability challenges at significant scale.
- Strong written communication skills and commitment to documentation; you thrive in remote, asynchronous environments and share knowledge effectively across your team.
- Proactive, hands-on approach where you identify issues, take ownership of solutions, and contribute improvements to infrastructure and code.
- Capability to mentor junior team members and develop deep expertise in your domain areas, then share that knowledge to help others grow.
- Backend engineering experience with languages such as Ruby or Go, and/or familiarity with OLAP databases like Clickhouse.
About the Team
We are responsible for building, running, and evolving the entire lifecycle of the PostgreSQL database engine that powers GitLab.com. You will be part of our team focused on owning the reliability, scalability, performance, and security of our database infrastructure and supporting services.
Benefits
- Flexible Paid Time Off
- Team Member Resource Groups
- Equity Compensation & Employee Stock Purchase Plan
- Growth and Development Fund
- Parental leave
- Home office support
Please note that we welcome interest from candidates with varying levels of experience; many successful candidates do not meet every single requirement.
Intermediate Site Reliability Engineer, Database Operations in London employer: GitLab
Contact Detail:
GitLab Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Intermediate Site Reliability Engineer, Database Operations in London
✨Tip Number 1
Network like a pro! Reach out to current or former GitLab employees on LinkedIn. Ask them about their experiences and any tips they might have for your application process. Personal connections can give you insights that make a real difference.
✨Tip Number 2
Prepare for the interview by diving deep into GitLab's products and culture. Familiarise yourself with their AI-powered DevSecOps platform and think about how your skills in PostgreSQL and automation can contribute to their mission. Show them you’re not just another candidate!
✨Tip Number 3
Practice your problem-solving skills! Since this role involves hands-on infrastructure work, be ready to tackle technical challenges during interviews. Use platforms like StudySmarter to brush up on relevant concepts and scenarios that might come up.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of the GitLab team. Let’s get you that interview!
We think you need these skills to ace Intermediate Site Reliability Engineer, Database Operations in London
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter for the Intermediate Site Reliability Engineer role. Highlight your experience with PostgreSQL and automation tools like Terraform and Ansible, as these are key to what we do at GitLab.
Showcase Your Communication Skills: Since strong written communication is crucial in our remote environment, ensure your application reflects your ability to document processes and share knowledge effectively. This will help us see how you can contribute to our team culture.
Demonstrate Problem-Solving Abilities: In your application, share examples of how you've tackled challenges in high-growth environments. We love seeing candidates who take ownership of solutions and drive improvements, so don’t hold back on showcasing your proactive approach!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our awesome team!
How to prepare for a job interview at GitLab
✨Know Your PostgreSQL Inside Out
Make sure you brush up on your PostgreSQL knowledge before the interview. Be ready to discuss your hands-on experience with it, especially in high-growth environments. Prepare to explain how you've tackled performance and reliability challenges in the past.
✨Showcase Your Automation Skills
Since automation is key for this role, come prepared with examples of how you've used tools like Terraform or Ansible to streamline operations. Discuss specific projects where your automation efforts led to measurable improvements in efficiency.
✨Demonstrate Your Problem-Solving Approach
Be ready to share instances where you've identified issues proactively and taken ownership of solutions. Highlight your ability to work under pressure, especially during production incidents, and how you’ve contributed to systemic improvements.
✨Communicate Clearly and Collaboratively
Strong communication skills are essential, especially in a remote environment. Practice articulating your thoughts clearly and be prepared to discuss how you share knowledge with team members. Mention any mentoring experiences you've had, as this will show your commitment to team growth.