Operations Site Reliability Engineer
Operations Site Reliability Engineer

Operations Site Reliability Engineer

Bristol Full-Time 48000 - 72000 £ / year (est.) No home office possible
B

At a Glance

  • Tasks: Join a team ensuring the performance and availability of production services.
  • Company: Broadcom is a leading technology company focused on innovation and excellence.
  • Benefits: Enjoy a competitive salary, bonus scheme, equity package, and private medical insurance.
  • Why this job: Be part of a dynamic team, tackle real-world challenges, and enhance your tech skills.
  • Qualifications: Degree in Systems Engineering or Computer Science with 5+ years of Linux experience required.
  • Other info: Weekend and holiday on-call support may be needed; equal opportunity employer.

The predicted salary is between 48000 - 72000 £ per year.

Operations Site Reliability Engineer page is loaded

Operations Site Reliability Engineer

Apply locations United Kingdom-Bristol-Almondsbury-Hempton Court time type Full time posted on Posted 30+ Days Ago job requisition id R022662

Please Note:

1. If you are a first time user, please create your candidatelogin account before you apply for a job. (Click Sign In > Create Account)

2. If you already have a Candidate Account, please Sign-In before you apply.

Job Description:

The primary responsibilities include:

· To form part of a critical operations function that is responsible for the monitoring, availability and performance of production services.

· Responding to stakeholder requests within agreed timescales or SLO

· Drive automation to reduce failures, manual tasks and therefore improving overall application performance and availability.

· Perform systems administration activities to ensure the smooth operation of applications across multiple platforms

· Coordinate and communicate with impacted stakeholders as per incident management process.

· Demonstrate ownership of events and incidents through to restoration

· Perform daily shift handovers to peers and management across multiple geographies.

· Support maintenance activities which impact production applications.

· Support critical systems that handle sensitive and proprietary data

· Create, maintain and update work instructions for troubleshooting and supporting applications.

· Contribute to the planning of application/infrastructure releases and configuration changes

· Provide input to administering and maintaining all production environments

· Patching and upgrade of existing applications

· Provide feedback and coaching to upstream teams (both internal and vendors) to reduce escalations and to continually improve overall experience for customers.

Professional Experience Required

  • A degree in Systems Engineering, Computer Science or related fields with related experience preferred
  • 5+ years of experience administering Linux systems
  • Strong hands-on experience of variants of linux distros
  • 2+ years Operational experience of working with Amazon Web Services or Google Cloud Platform
  • Experience of working with an automation platform to automate repetitive actions that reduce manual effort
  • Familiarity with deployment tools such as Ansible Tower and Jenkins
  • Experience in carrying out large deployments to global infrastructure
  • Proficient with orchestration/configuration tools such as Ansible and Terraform
  • Strong working knowledge of networking, packet tracing, understanding latency and throughput in order to pinpoint or resolve application issues.
  • Thorough knowledge of HTTP(S), SMTP, TLS/SSL, DNS, LDAP, Kubernetes and Docker containers
  • Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments
  • Experienced and confident in at least one scripting language such as Perl, shell, Ruby or Python.
  • Experience of tuning and optimising monitoring systems.

Personal Experience Required

  • A strong team player with the ability to grasp new technologies, adapt to change in methodologies, with a focus on delivery
  • Extensive troubleshooting and problem-solving skills with respect to application technologies
  • Ability to remain calm and work well under pressure
  • A keen interest and desire to work within the security arena
  • Ability to communicate effectively at all levels up to senior management.

Benefits

  • Highly competitive salary
  • Generous bonus scheme
  • Equity package
  • Competitive company pension
  • Employee stock purchase plan (ESPP)
  • Private Medical Insurance (Individual or family)
  • Life Assurance scheme (up to 4x salary)
  • Ample on-site parking.

This role will need to participate in weekends and holidays on-call support as and when required.

Broadcom is proud to be an equal opportunity employer. We will consider qualified applicants without regard to race, color, creed, religion, sex, sexual orientation, gender identity, national origin, citizenship, disability status, medical condition, pregnancy, protected veteran status or any other characteristic protected by federal, state, or local law. We will also consider qualified applicants with arrest and conviction records consistent with local law.

If you are located outside USA, please be sure to fill out a home address as this will be used for future correspondence.

#J-18808-Ljbffr

Operations Site Reliability Engineer employer: Broadcom Inc.

At Broadcom, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. Located in the vibrant area of Almondsbury, Bristol, our Operations Site Reliability Engineer role provides ample opportunities for professional growth, competitive benefits including a generous bonus scheme and equity package, and a commitment to employee well-being through private medical insurance and life assurance. Join us to be part of a team that values your contributions and supports your career aspirations in a high-availability environment.
B

Contact Detail:

Broadcom Inc. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Operations Site Reliability Engineer

✨Tip Number 1

Familiarise yourself with the specific technologies mentioned in the job description, such as AWS, Google Cloud Platform, and automation tools like Ansible and Jenkins. Having hands-on experience or projects that showcase your skills with these technologies can set you apart.

✨Tip Number 2

Network with current or former employees of StudySmarter or similar companies. Engaging with them on platforms like LinkedIn can provide you with insights into the company culture and expectations, which can be invaluable during interviews.

✨Tip Number 3

Prepare to discuss real-world scenarios where you've successfully resolved incidents or improved system performance. Being able to articulate your problem-solving process and the impact of your actions will demonstrate your capability for the role.

✨Tip Number 4

Showcase your teamwork and communication skills by preparing examples of how you've collaborated with cross-functional teams. This is crucial for the Operations Site Reliability Engineer role, as effective communication with stakeholders is key to success.

We think you need these skills to ace Operations Site Reliability Engineer

Linux System Administration
Amazon Web Services (AWS)
Google Cloud Platform (GCP)
Automation Tools (e.g., Ansible, Jenkins)
Orchestration/Configuration Management (e.g., Ansible, Terraform)
Networking Knowledge
Application Performance Monitoring
Scripting Languages (e.g., Perl, Shell, Ruby, Python)
Incident Management
Troubleshooting Skills
Communication Skills
Team Collaboration
Adaptability to New Technologies
High-Availability Systems Management

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in systems administration, cloud platforms, and automation tools. Use keywords from the job description to demonstrate your fit for the Operations Site Reliability Engineer role.

Craft a Strong Cover Letter: Write a cover letter that showcases your problem-solving skills and ability to work under pressure. Mention specific examples of how you've contributed to application performance and availability in previous roles.

Highlight Technical Skills: Clearly list your technical skills related to Linux systems, AWS or Google Cloud, and scripting languages. Provide examples of how you've used these skills in real-world scenarios to improve operations.

Showcase Team Collaboration: Emphasise your experience as a team player. Include instances where you effectively communicated with stakeholders or contributed to team projects, especially in high-pressure situations.

How to prepare for a job interview at Broadcom Inc.

✨Showcase Your Technical Skills

Be prepared to discuss your hands-on experience with Linux systems and cloud platforms like AWS or Google Cloud. Highlight specific projects where you automated tasks or improved application performance, as this aligns closely with the role's responsibilities.

✨Demonstrate Problem-Solving Abilities

Expect scenario-based questions that assess your troubleshooting skills. Share examples of how you've resolved incidents under pressure, particularly in high-availability environments, to showcase your ability to remain calm and effective.

✨Communicate Clearly and Effectively

Since the role involves coordinating with stakeholders, practice articulating complex technical concepts in simple terms. This will demonstrate your ability to communicate effectively at all levels, which is crucial for the position.

✨Prepare for Cultural Fit Questions

Research the company's values and culture. Be ready to discuss how you work within a team and adapt to new technologies, as well as your interest in security, which is a key aspect of the role.

Operations Site Reliability Engineer
Broadcom Inc.
B
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>