At a Glance
- Tasks: Architect our Observability Centre of Excellence and enhance global platform reliability.
- Company: Join a leading tech firm focused on innovation and reliability.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Dynamic role with a focus on collaboration and career advancement.
- Why this job: Make a real impact on global platforms while working with cutting-edge technologies.
- Qualifications: Experience with OpenTelemetry, observability tools, and Infrastructure as Code.
The predicted salary is between 60000 - 80000 £ per year.
About the Role
In this role, you will be the primary architect of our Observability Centre of Excellence, directly influencing the reliability and uptime of global platforms that keep world industries moving.
Key Responsibilities
- Lead a global "OTel First" strategy, implementing OpenTelemetry at scale across a diverse technological landscape.
- Spearhead the development of automation scripts and Infrastructure as Code using Terraform to ensure seamless, reproducible platform delivery.
- Optimize platform performance and cost‑efficiency, ensuring our observability tools scale economically as our data grows.
- Collaborate with engineering teams to embed reliability and security standards into new features from the ground up.
- Drive root cause analysis and problem management to proactively prevent incidents and improve the customer experience.
Essential Skills & Experience
- Hands‑on experience with the OpenTelemetry Collector, APIs, and SDKs.
- Extensive experience with observability tools like NewRelic, Datadog, or Splunk.
- Strong proficiency in Infrastructure as Code (Terraform, Ansible) and cloud platforms (AWS, GCP, or Azure).
- Deep understanding of containerization and orchestration using Docker and Kubernetes.
- Advanced coding skills in Python, Go, or Java for building robust automation and monitoring tools.
Bonus Points For
- Experience leveraging AI coding assistants like GitHub Co‑Pilot to accelerate development.
Staff Site Reliability Engineer - Cloud employer: Trimble Inc.
As a Staff Site Reliability Engineer - Cloud, you will thrive in a dynamic and innovative environment that prioritises employee growth and collaboration. Our company fosters a culture of continuous learning, offering extensive training opportunities and the chance to work with cutting-edge technologies in a global setting. With a strong commitment to work-life balance and a focus on employee well-being, we provide a rewarding workplace where your contributions directly impact the reliability of platforms that support industries worldwide.
StudySmarter Expert Advice🤫
We think this is how you could land Staff Site Reliability Engineer - Cloud
✨Tip Number 1
Network like a pro! Reach out to folks in the industry on LinkedIn or at meetups. A friendly chat can open doors that a CV just can't.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those involving OpenTelemetry and Terraform. It’s a great way to demonstrate your hands-on experience.
✨Tip Number 3
Prepare for interviews by practising common technical questions related to SRE and observability tools. We recommend doing mock interviews with friends or using online platforms to get comfortable.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!
We think you need these skills to ace Staff Site Reliability Engineer - Cloud
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that match the job description. Highlight your hands-on experience with OpenTelemetry, observability tools, and Infrastructure as Code. We want to see how you can contribute to our 'OTel First' strategy!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about site reliability engineering and how your background aligns with our goals. Don’t forget to mention any relevant projects or achievements that showcase your expertise.
Showcase Your Technical Skills:When filling out your application, be sure to include specific examples of your coding skills in Python, Go, or Java. We love seeing how you've used these languages to build automation and monitoring tools, so don’t hold back!
Apply Through Our Website:We encourage you to apply directly through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we love seeing candidates who take the initiative!
How to prepare for a job interview at Trimble Inc.
✨Know Your Tech Inside Out
Make sure you’re well-versed in OpenTelemetry, Terraform, and the observability tools mentioned in the job description. Brush up on your coding skills in Python, Go, or Java, as you might be asked to solve a problem on the spot.
✨Showcase Your Problem-Solving Skills
Prepare to discuss past experiences where you’ve driven root cause analysis or improved platform reliability. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your impact.
✨Demonstrate Collaboration
Since collaboration with engineering teams is key, think of examples where you’ve worked cross-functionally. Be ready to explain how you’ve embedded reliability and security standards into new features in previous roles.
✨Ask Insightful Questions
Prepare thoughtful questions about their current observability practices and future goals. This shows your genuine interest in the role and helps you gauge if the company aligns with your career aspirations.