At a Glance
- Tasks: Maintain and enhance cloud infrastructure while ensuring reliability and security of AI platforms.
- Company: Join an award-winning AI software company leading in machine learning innovation.
- Benefits: Competitive salary, benefits package, and a collaborative work environment.
- Why this job: Work on real-world problems and make a significant impact in a cutting-edge tech space.
- Qualifications: 2+ years in DevOps or similar roles with experience in cloud infrastructure and monitoring tools.
- Other info: Hybrid working model with opportunities for career growth in a dynamic team.
The predicted salary is between 50000 - 60000 £ per year.
An exciting opportunity for a Site Reliability Engineer to join an award-winning, Cambridge-based AI software company at the forefront of machine learning innovation.
As a Site Reliability Engineer, you will play a key role in maintaining and enhancing cloud infrastructure, monitoring systems, and deployment processes, ensuring the reliability, scalability, and security of a sophisticated machine learning platform deployed across cloud environments.
Cambridge, hybrid working model – 3 days in office – not easily reachable via public transport from London.
Requirements for Site Reliability Engineer:- Minimum 2:1 degree in Computer Science or a related field
- 2+ years’ experience in a DevOps, SRE, Platform Engineering or similar role
- Experience configuring and using monitoring tools such as Grafana and Prometheus
- Hands-on experience with cloud infrastructure, ideally GCP (Azure or AWS also considered)
- Scripting experience using Python and/or Bash
- Experience using Git within a professional software development environment
- Familiarity with technologies such as NGINX, Flask (Python), React (TypeScript), PostgreSQL
- Experience administering Linux-based systems
- Exposure to information security compliance standards
- Experience working within Agile development environments
- Develop and enhance monitoring systems to proactively identify performance, reliability, security, and cost issues
- Monitor platform performance and communicate insights to engineering teams
- Identify, plan, and implement improvements to cloud infrastructure and deployment processes
- Work closely with engineering teams to support product development and platform scalability
- Ensure infrastructure and deployments are secure, robust, and aligned with best practices
- Advocate for effective monitoring and reliability considerations throughout the development lifecycle
- Support ongoing compliance with information security standards including ISO 27001
Working for an award-winning AI software company at the forefront of machine learning innovation offers the opportunity to work on complex, real-world problems within industrial R&D environments. You will be part of a collaborative, high-calibre engineering team within a growing Cambridge-based business, with a competitive salary and benefits package.
If you are a Site Reliability Engineer looking to develop your career within a cutting-edge AI company, we would love to hear from you.
We are an equal opportunity employer and value diversity at RedTech. We do not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Site Reliability Engineer (Home-based) in Cambridge employer: RedTech Recruitment
Contact Detail:
RedTech Recruitment Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (Home-based) in Cambridge
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current employees at the company. A friendly chat can sometimes lead to opportunities that aren’t even advertised!
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to cloud infrastructure and monitoring tools. This gives us a tangible way to see what you can do.
✨Tip Number 3
Prepare for the technical interview! Brush up on your scripting skills and be ready to discuss your experience with tools like Grafana and Prometheus. We love seeing candidates who can talk through their problem-solving process.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we’ve got loads of other opportunities if this one isn’t quite right for you.
We think you need these skills to ace Site Reliability Engineer (Home-based) in Cambridge
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud infrastructure, monitoring tools, and scripting languages like Python or Bash. We want to see how your skills match what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about AI and how your background makes you a great fit for our team. Keep it concise but engaging – we love a good story!
Showcase Relevant Experience: When filling out your application, be sure to showcase any relevant experience in DevOps, SRE, or Platform Engineering. Mention specific projects where you've used tools like Grafana or Prometheus, as this will catch our eye!
Apply Through Our Website: We encourage you to apply through our website for the best chance of getting noticed. It’s super easy, and you’ll find all the details you need there. Plus, we love seeing applications come directly from our site!
How to prepare for a job interview at RedTech Recruitment
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like GCP, Grafana, and Prometheus. Brush up on your scripting skills in Python and Bash, as you might be asked to demonstrate your knowledge during the interview.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles, especially related to cloud infrastructure and monitoring systems. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight how you’ve made a positive impact.
✨Understand the Company’s Mission
Research the company’s focus on AI and machine learning innovation. Be ready to discuss how your experience aligns with their goals and how you can contribute to their projects, particularly in enhancing reliability and security.
✨Ask Insightful Questions
Prepare thoughtful questions about the team dynamics, the tools they use, and their approach to compliance with information security standards. This shows your genuine interest in the role and helps you assess if it’s the right fit for you.