Site Reliability Engineer (SRE)

Job Board

Companies

United States Digital Space LLC

Site Reliability Engineer (SRE)

Full-Time 60000 - 80000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Own the reliability of a secure platform on Google Cloud and automate processes.
Company: Join a dynamic tech company focused on innovation and collaboration.
Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
Other info: Be part of a diverse team building foundational practices in a fast-paced environment.
Why this job: Make a real impact by ensuring platform reliability and enhancing client experiences.
Qualifications: Experience with GCP, incident management, and strong coding skills required.

The predicted salary is between 60000 - 80000 £ per year.

About the Role

The company is building a secure, multi-tenant platform on Google Cloud, and we’re hiring a Site Reliability Engineer to own the reliability and observability of that platform end-to-end. This is a hands-on role for someone who wants to do real SRE work - not a rebrand of L1 support. You’ll write the dashboards, define the SLOs, build the automation that kills toil, and take your turn on the on-call rotation that proves it all works. When something breaks at 2 AM, you’re the person who keeps it running; when nothing’s breaking, you’re the person making sure the next break is smaller, shorter, or doesn’t happen at all.

What You’ll Do

Observability and reliability engineering
- Define and maintain SLOs and SLIs for our tier-1 services: API gateway, application services, identity, and edge availability.
- Build canonical dashboards and alerts in Google Cloud Monitoring, backed by structured logs and BigQuery log analytics.
- Tune alert routing so every page is actionable — kill the rest.
- Instrument services for distributed tracing and structured logging; push back on services that ship without it.
- Own error budgets and use them to prioritise reliability work over feature work when burned.
- Reduce toil: automate the top recurring page from the previous quarter.
- Maintain runbooks so every page maps to one within a cycle of first occurrence.
On-call rotation and incident response
- First responder for production alerts across monitoring, API gateway, edge defence, and CI.
- Triage severity, run the incident bridge, drive mitigation (revision rollback, traffic shift, scaling, edge block, credential rotation).
- Own internal and external incident comms during your shift.
- Drive postmortems to closure with action items tracked as audit evidence.
- Clean written handoffs at end of shift.

Our stack

Google Cloud Platform across multiple environments.
Apigee X for API management.
Cloud Run, GKE Autopilot, Cloud SQL.
Identity Platform for customer identity.
Cloud Armor, Cloud IDS, Security Command Center for edge and posture.
BigQuery-backed log analytics from an org-level log sink.
OpenTofu / Terraform for everything; GitHub Actions for CI/CD.
Linear for work tracking.

What You Bring

Required
- Solid production experience on GCP (or comparable AWS/Azure depth with willingness to ramp on GCP fast).
- Comfortable on-call: you’ve run incidents, written postmortems, and shipped the action items.
- Strong observability fundamentals: SLOs, log-based metrics, alert hygiene, dashboard discipline.
- Working knowledge of Kubernetes, API gateways, identity systems, and at least one IaC tool.
- Scripting / coding fluency (Python, Go, Bash) for automation and tooling.
- Good written communication — handoffs, postmortems, and runbooks are part of the job.
- Bias toward fixing the system, not the symptom.
Nice to Have
- Apigee or another enterprise API gateway in production.
- BigQuery for log analytics or audit.
- Experience standing up observability from scratch, not just maintaining inherited dashboards.
- SOC2 or similar compliance environments.

Why Join Us

You’ll be at the centre of how we bring the company to life for our institutional clients. Your work directly shapes the success of every implementation—getting requirements right means we deliver faster, smoother, and with fewer surprises. You’ll be joining at a foundational moment, helping to build the delivery practice from the ground up alongside a Delivery Manager who will rely on you as a critical partner from day one. If you enjoy the puzzle of understanding complex environments, the satisfaction of a well-organised document, and the energy of working directly with clients, this is your role.

We are an equal opportunity employer and value diversity. We do not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Site Reliability Engineer (SRE) employer: United States Digital Space LLC

Join a dynamic team where your expertise as a Site Reliability Engineer will be pivotal in shaping a secure, multi-tenant platform on Google Cloud. We foster a collaborative work culture that prioritises employee growth and innovation, offering hands-on experience with cutting-edge technologies while ensuring a supportive environment for professional development. With a commitment to diversity and inclusion, we provide meaningful opportunities for you to make a real impact in a foundational role that directly influences our clients' success.

Contact Details:

United States Digital Space LLC Recruitment Team

View United States Digital Space LLC profile

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer (SRE)

✨Join the IT Consultancy Buzz

Get involved in local or virtual IT consultancy meetups and forums. This is where we can rub shoulders with industry professionals, get insights into what United States Digital Space LLC values, and even spot unadvertised opportunities. Don't miss out on these chances to make a name for ourselves in the IT world!

✨Show Off Your Skills

Create a personal project or case study relevant to the challenges United States Digital Space LLC might face. Use platforms like GitHub or Medium to share your findings. This not only demonstrates our consulting skills but shows a proactive attitude, making us stand out from the crowd when applying for that full-time gig.

✨Leverage LinkedIn for Connections

Follow and engage with the relevant thought leaders and influencers in IT consultancy on LinkedIn. Share insightful content and join discussions to gain visibility. A well-placed comment or shared article could catch the attention of someone at United States Digital Space LLC!

✨Direct Apply to United States Digital Space LLC

Let's not forget to apply directly through the United States Digital Space LLC website! Tailor your application to showcase our understanding of their consulting style and how we can contribute to their projects. A personalised approach can make a huge difference in landing that full-time position!

We think you need these skills to ace Site Reliability Engineer (SRE)

Google Cloud Platform (GCP)

API Management (Apigee X)

Kubernetes

Infrastructure as Code (IaC) tools

Scripting (Python, Go, Bash)

Observability Fundamentals (SLOs, log-based metrics)

Incident Response

Postmortem Documentation

Automation

BigQuery for log analytics

Monitoring and Alerting

Communication Skills

Problem-Solving Skills

On-call Experience

Some tips for your application 🫡

Showcase Your Problem-Solving Skills:In IT consulting, it's all about problem-solving, so make sure your CV highlights your analytical skills and any relevant projects you've tackled. Mention specific technologies or methodologies you've used to resolve issues or improve processes; this shows you can think critically and deliver results, which is vital for us at United States Digital Space LLC.

Highlight Relevant Certifications:Certifications like ITIL, PMP, or even specific tech stack qualifications can really make you stand out. Make sure to include these in your CV, as they not only demonstrate your expertise but also your commitment to staying current in the field. We love seeing candidates who are proactive about their professional development!

Tailor Your Cover Letter:Your cover letter is your chance to connect personally with us at United States Digital Space LLC. Share stories about your experiences in IT consulting, and how they shaped your desire to join our team. Mention why you’re excited about this particular role, and how you see yourself contributing to our projects.

Keep It Clear and Concise:We're all busy, so make sure your application is easy to read. Use bullet points for key achievements, and don’t overload us with jargon. A clean, professional layout goes a long way. Remember, the clearer your application, the more likely we are to invite you in for an interview!

How to prepare for a job interview at United States Digital Space LLC

✨Brush Up on Your Technical Skills

For an IT consulting role, be ready to demonstrate your technical prowess. You might face questions on systems integration, cloud technologies, or even troubleshooting specific software. If you have experience with tools like AWS, Azure, or even specific programming languages, make sure you can talk about them fluently.

✨Showcase Your Problem-Solving Approach

IT consulting is all about solving problems for clients. Think about how you can illustrate your approach to a past challenge using the STAR method (Situation, Task, Action, Result). It's a great way to show how you tackle complex issues and come up with effective solutions.

✨Know the Business Impact of IT Solutions

When discussing your experiences, focus not just on the tech solutions you implemented, but also on their business impact. Employers want to see that you can connect IT with organisational goals. Prep examples that highlight how your tech contributions improved efficiency or reduced costs for past clients or projects.

✨Prepare for Behavioural Questions

Since IT consulting often involves teamwork and client interactions, expect behavioural questions that assess your interpersonal skills. Be prepared with examples that demonstrate your adaptability, communication skills, and how you handle client feedback. Before the interview, think of situations where you worked closely with clients to create effective IT strategies or changes.

Site Reliability Engineer (SRE)

United States Digital Space LLC

Apply Now

Site Reliability Engineer (SRE)

At a Glance

Site Reliability Engineer (SRE) employer: United States Digital Space LLC

StudySmarter Expert Advice🤫

We think you need these skills to ace Site Reliability Engineer (SRE)

Some tips for your application 🫡

How to prepare for a job interview at United States Digital Space LLC

Company

Product

Help