At a Glance
- Tasks: Ensure reliability and performance of large-scale software systems through automation and collaboration.
- Company: Join Altium, a leader in cloud platforms with a focus on innovation.
- Benefits: Enjoy private health insurance, pension scheme, remote work options, and professional development support.
- Why this job: Make a real impact on system reliability while working with cutting-edge technologies.
- Qualifications: 5+ years in SRE or DevOps, strong software development skills, and experience with cloud technologies.
- Other info: Flexible working arrangements and excellent career growth opportunities await you.
The predicted salary is between 28800 - 48000 £ per year.
Site Reliability Engineer ensuring the reliability, availability, and performance of large-scale software systems through a blend of software engineering and systems administration. Key responsibilities involve automating operational tasks, improving observability, and contributing to incident management, while also collaborating with development and technology teams to build more reliable and scalable applications.
Join Altium as a Site Reliability Engineer to ensure the reliability and performance of the Altium Cloud Platforms.
Key Responsibilities- Understand how an Altium Cloud Platform works.
- Pioneer improvements in observability, including logging, monitoring, and application performance management (APM), ensuring system reliability and proactive issue detection.
- Develop and implement reliability frameworks and patterns that standardize and elevate the resilience of our SaaS products across multiple regions and environments.
- Cultivate a shared responsibility model where the SRE team collaborates with and educates engineering teams on reliability best practices.
- Contribute to incident response and management, ensuring rapid resolution, clear stakeholder communication, and post-incident analysis for continuous improvement.
- Participate in system design consulting, platform management, infrastructure upgrades and capacity planning.
- Partner closely with engineering and development teams to enhance product stability, observability, and manageability through best practices in reliability engineering.
- Partner closely with DevOps/Operations, drive automation initiatives, promote Infrastructure as Code (IaC), and streamline deployment processes to improve operational efficiency and scalability.
- Champion Service-Oriented Organization (SOO) principles to ensure accountability and clarity in service ownership.
- 5+ years in SRE, DevOps or related role in a large-scale environment.
- Software development experience (ideally working with and as a .NET developer).
- Strong understanding of SDLC, microservice and HA architecture.
- Observability - NewRelic, ELK, Grafana, PagerDuty, OTEL or similar.
- Experience with Kubernetes clusters in production setting, AWS, IOC.
- Experience with operational tasks.
- Knowledge of CI-CD tooling Jenkins, Gitlab, GitHub, ArgoCD or similar.
- Knowledge of IaaC Terraform, Ansible.
- Basic knowledge of networking fundamentals.
- Experience with relational databases (mysql, postgres) as a plus.
- Private health insurance including dental coverage.
- Pension scheme with company match up to 9%.
- nilo.health, mental health and wellbeing support.
- Remote working abroad program.
- Professional development support and resources.
- Employee referral program.
- 28 days' holiday + public holidays and special leave.
- Flexible working arrangements available based on role and location.
- Enhanced family and special leave.
- Corporate membership rates with national gyms.
- Free lunch, snacks, and drinks in the office.
- Electric car charging stations, free office parking, bicycle, and scooter storage.
We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal opportunity workplace.
SRE Engineer in Cambridge employer: Altium
Contact Detail:
Altium Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land SRE Engineer in Cambridge
✨Tip Number 1
Get to know the company inside out! Research Altium's Cloud Platforms and their approach to reliability. This will help you tailor your conversations and show that you're genuinely interested in what they do.
✨Tip Number 2
Network like a pro! Connect with current employees on LinkedIn or attend industry meetups. Building relationships can give you insider info and might even lead to a referral, which is always a bonus!
✨Tip Number 3
Prepare for technical interviews by brushing up on your SRE skills. Practice common scenarios related to incident management and observability tools. Being ready to discuss your experience with Kubernetes and CI/CD tooling will set you apart.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re serious about joining the Altium team and ready to contribute to their mission.
We think you need these skills to ace SRE Engineer in Cambridge
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the SRE role. Highlight your software development experience, especially if you've worked with .NET or in large-scale environments.
Craft a Compelling Cover Letter: Use your cover letter to tell us why you're passionate about Site Reliability Engineering. Share specific examples of how you've improved system reliability or contributed to incident management in your previous roles.
Showcase Your Technical Skills: Don’t forget to mention your experience with tools like NewRelic, Kubernetes, and CI-CD tooling. We want to see how you’ve used these technologies to enhance observability and operational efficiency.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role without any hiccups!
How to prepare for a job interview at Altium
✨Know Your Stuff
Make sure you understand how Altium Cloud Platforms work. Brush up on your knowledge of observability tools like NewRelic and Grafana, as well as your experience with Kubernetes and AWS. Being able to discuss these topics confidently will show that you're ready to hit the ground running.
✨Showcase Your Experience
Prepare to talk about your past roles in SRE or DevOps, especially any large-scale environments you've worked in. Highlight specific projects where you automated operational tasks or improved system reliability. Real-world examples will make your experience more relatable and impressive.
✨Collaboration is Key
Since the role involves working closely with engineering teams, be ready to discuss how you've collaborated in the past. Share examples of how you've educated others on reliability best practices or contributed to incident management. This will demonstrate your ability to foster a shared responsibility model.
✨Ask Smart Questions
Prepare thoughtful questions about the company's approach to reliability engineering and their use of Infrastructure as Code. This not only shows your interest in the role but also gives you insight into their processes and culture. Plus, it’s a great way to engage with your interviewers!