Site Reliability Engineer
Site Reliability Engineer

Site Reliability Engineer

Full-Time 36000 - 60000 £ / year (est.) No home office possible
Go Premium
V

At a Glance

  • Tasks: Ensure system reliability and performance while collaborating with product engineering squads.
  • Company: Join Virtuoso QA, a leader in quality-first software testing revolution.
  • Benefits: Enjoy competitive pay, remote work, health insurance, and personal development budget.
  • Why this job: Be part of a game-changing team that empowers everyone to test software effortlessly.
  • Qualifications: 5 years experience with open-source tech and AWS; strong problem-solving skills.
  • Other info: Flexible working, career growth opportunities, and a fun culture await you!

The predicted salary is between 36000 - 60000 £ per year.

Join to apply for the Site Reliability Engineer role at Virtuoso QA.

A Bit About Us

Virtuoso's mission is to enable and lead the world's quality-first revolution. The field of QA has not kept pace with the software industry's transition to CI/CD. We are fixing that. Virtuoso has reimagined how software is tested by developing a game-changing platform that is already being used by the biggest names in software. We passionately believe that anyone should be able to create and maintain tests regardless of their technical skill, and that quality is a key driver for change and growth. The latest advances in AI and Machine Learning have been leveraged to produce test automation software that thinks like a human, empowers everyone to test, and for the first time delivers on the promise of codeless test automation.

Achieving remarkable success has become a business-as-usual activity for us and we need to rapidly expand our team for that to continue to increase. Want to join the quality-first revolution? Then read on.

A company without borders with employees that make an impact worldwide, with offices and a remote team spread across the globe. The nature of our product is reflected in our thorough and agile culture. We do the right things fast and our application process is no different. We want exceptional people and we will act to get them.

About The Role

As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, performance, and availability of our systems. You will be a member of the SRE team and work within a product engineering squad to help them ship and maintain reliable features.

You Will Have An Impact By:

  • Helping to design and implement cloud-native solutions around user needs.
  • Proposing and delivering architecture/system changes to improve the reliability, stability, and throughput of our systems.
  • Working closely with the SRE team to refine and plan enhancements to the cloud estate as a whole.
  • Responding to incidents and designing remediation plans to ensure they do not recur.

You will primarily work in Terraform with the AWS stack, but will also be trusted to understand and make changes to our primarily Java backend codebase.

Key Tasks:

  • Work with squads to design and deliver infrastructure required for product features.
  • Ensure that deliveries are sufficiently robust and monitored by designing for observability and reliability in collaboration with engineering team members.
  • Lead incident response activities, including root cause analysis, problem resolution, and post-incident reviews.
  • Identify and deliver initiatives to improve reliability, observability and throughput of key systems.
  • Assist cross-functional teams with upskilling and support in monitoring, CI pipelines, release engineering, and IaC/AWS.

How will success be measured in this role:

  • Coordinate the response to several incidents (if any), aiming for a TTFR of one hour (office hours only), including production of post mortem and preventative measures in future.
  • KPIs: number of incident responses handled, TTFR.
  • Ensure that at least 60% of features delivered by squad produce corresponding Grafana metrics.
  • KPI: number of epics with metric completed tasks.
  • Ensure that SRE knowledge is shared and documented through written documentation, workshops, and individual training.
  • KPI: number of notion articles, workshops, pairing sessions.
  • Identify at least one enhancement to the platform that leads to improved reliability, throughput, or cost.
  • KPI: number of completed SRE team tickets closed with measured impact.

Skills Required (Learned And Applied Abilities):

  • Experience with the AWS or an equivalent computing stack.
  • Knowledge of container based and/or serverless compute environments (e.g. ECS).
  • Basic awareness of RDBMS workloads.
  • Ability to acquire new skills as necessary.
  • Basic understanding of at least one major programming language.
  • Understanding of asynchronous message processing pipelines and their failure modes.
  • Experience with conducting software releases/migrations.

Competencies Required And To Be Demonstrated (Traits, Attitudes, Behaviours):

  • Strong problem-solving and troubleshooting abilities.
  • Excellent communication and collaboration skills.
  • Ability to work effectively in a cross-functional team environment.
  • Adaptability and flexibility to work in a fast-paced, dynamic organization.
  • Attention to detail and a focus on delivering high-quality results.
  • Willingness to learn and stay updated on industry best practices and emerging technologies.

Qualifications And Experience Required:

  • 5 years of Experience with open-source technologies.
  • Working within a business managing stakeholders etc.
  • Education level required if any: Masters (or equivalent work experience) in an engineering/computer/software discipline.

Workplace Experience If Required (industry, Role Etc, Technology Space):

  • 5 years of Experience with open-source technologies.

What's In It For You:

  • Competitive Package, including generous and achievable uncapped commission.
  • Employee Share Options - Share in the success of Virtuoso.
  • A defined, transparent, career path to more senior roles.
  • Remote/flexible working.
  • Private health insurance.
  • Training/personal development budget of a minimum of £500 per year.
  • Take your birthday as a holiday every year!
  • Holiday allowance increases by one day per year of service up to 5 years.
  • Employee Referral Scheme - we put money in your pocket for referring awesome people!

Virtuoso was developed by a team passionate about improving the quality of low-code/no-code test automation software without slowing down the development process. As work shifts more to the cloud and teams work remotely, on-premise software has become unwieldy and a bottleneck. We've reimagined test automation software by pioneering the next generation of low-code/no-code testing - all on the cloud. We believe anyone can test, and we're delivering on the promise of low-code/no-code test automation.

Seniority level: Mid-Senior level

Employment type: Full-time

Job function: Engineering and Information Technology, Software Development

Site Reliability Engineer employer: Virtuoso QA

At Virtuoso QA, we pride ourselves on being an exceptional employer that champions a quality-first culture and fosters innovation in the software testing landscape. Our commitment to employee growth is evident through our transparent career paths, generous training budgets, and flexible remote working options, allowing you to thrive in a dynamic environment while contributing to groundbreaking advancements in test automation. Join us and enjoy unique benefits like private health insurance, a birthday holiday, and an employee referral scheme that rewards your network connections.
V

Contact Detail:

Virtuoso QA Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer

✨Tip Number 1

Network like a pro! Reach out to current employees at Virtuoso QA on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for the interview process. This can give you insider knowledge and make you stand out.

✨Tip Number 2

Prepare for technical interviews by brushing up on your AWS and Terraform skills. Practice common SRE scenarios and incident response strategies. The more confident you are in your technical abilities, the better you'll perform during the interview.

✨Tip Number 3

Showcase your problem-solving skills! During interviews, be ready to discuss past incidents you've handled and how you resolved them. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your impact.

✨Tip Number 4

Don’t forget to apply through our website! It’s the quickest way to get your application noticed. Plus, it shows you're genuinely interested in joining the quality-first revolution at Virtuoso QA.

We think you need these skills to ace Site Reliability Engineer

AWS
Terraform
Java
Container-based environments
Serverless compute environments
RDBMS workloads
Asynchronous message processing
Incident response
Root cause analysis
Problem resolution
Collaboration skills
Adaptability
Attention to detail
Communication skills
Software release management

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter for the Site Reliability Engineer role. Highlight your experience with AWS, Terraform, and any relevant programming languages. We want to see how your skills align with our mission at Virtuoso!

Showcase Your Problem-Solving Skills: In your application, share examples of how you've tackled complex issues in past roles. We love candidates who can demonstrate strong troubleshooting abilities and a knack for improving system reliability. Let us know how you’ve made an impact!

Be Clear and Concise: When writing your application, keep it straightforward and to the point. Use clear language and avoid jargon unless necessary. We appreciate well-structured applications that are easy to read and understand.

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re keen on joining our quality-first revolution!

How to prepare for a job interview at Virtuoso QA

✨Know Your Tech Stack

Make sure you’re well-versed in AWS and Terraform, as these are crucial for the Site Reliability Engineer role. Brush up on your knowledge of container-based environments and be ready to discuss how you've used these technologies in past projects.

✨Showcase Problem-Solving Skills

Prepare examples of how you've tackled incidents in the past. Be ready to explain your thought process during root cause analysis and how you implemented solutions to prevent future issues. This will demonstrate your strong troubleshooting abilities.

✨Communicate Effectively

Since collaboration is key in this role, practice articulating your ideas clearly. Think about how you can convey complex technical concepts in a way that’s easy for non-technical team members to understand. Good communication can set you apart!

✨Emphasise Continuous Learning

Virtuoso values adaptability and a willingness to learn. Be prepared to discuss how you stay updated with industry trends and emerging technologies. Mention any recent courses or certifications you've completed that relate to the role.

Site Reliability Engineer
Virtuoso QA
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>