Lead Engineer Support Linux Engineer

Lead Engineer Support Linux Engineer

Full-Time 70000 - 90000 € / year (est.) No home office possible
graphcore

At a Glance

  • Tasks: Lead a team to support Linux systems in a fast-paced AI environment.
  • Company: Join Graphcore, a leader in AI compute technology backed by SoftBank.
  • Benefits: Flexible work culture, competitive salary, and opportunities for professional growth.
  • Other info: Collaborative team culture with exciting projects ahead.
  • Why this job: Make a real impact on AI technology while solving complex challenges.
  • Qualifications: Extensive Linux experience, strong troubleshooting skills, and leadership capabilities.

The predicted salary is between 70000 - 90000 € per year.

At Graphcore, we’re building the future of AI compute. We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacentre scale. As part of the SoftBank Group, backed by significant long‑term investment, we are delivering key technology into the fast‑growing SoftBank AI ecosystem. To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world. We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.

Job Summary

We are looking for a highly experienced Lead Engineer Support Linux Engineer to guide and develop a small group supporting engineering systems in a fast‑paced AI‑centered environment. The position requires strong Linux skills combined with leadership, automation, and DevOps approaches to maintain systems that are reliable, scalable, and easy to support at scale. An important responsibility is developing and managing a configuration‑as‑code environment. In this setup, system configuration and operations are handled through automation, pipelines, and source control rather than manual intervention. You will be responsible for leading incident response, driving operational improvements, and setting standards for how Linux systems are managed and supported across the organization. While the role includes leadership responsibilities, it will initially require a hands‑on approach, including direct involvement in troubleshooting, system support, and automation efforts, while building team capability and scaling processes. Collaborating intimately with engineering groups, platform engineers, and infrastructure experts, you will guarantee systems stay stable, efficient, and consistent with changing business and product delivery requirements.

The Team

You’ll be joining a multi‑disciplinary team with strong technical skills and a very supportive culture. We work closely together, regularly share knowledge, and your skills will make a direct impact on our business. It’s an exciting and pivotal moment for us right now, with plenty of new projects ahead. If you're looking to solve interesting problems and see your work deliver real‑world results, this is the team for you!

Responsibilities and Duties

  • Guide, mentor, and cultivate a team of Linux Engineering Support Engineers, defining clear roles, responsibilities, and methods of collaboration.
  • Own and oversee support for Linux‑based systems and engineering environments, ensuring stability, performance, and availability.
  • Act as a point of contact for complex technical issues and outages, providing hands‑on support when a customer concern arises.
  • Diagnose and resolve high‑impact system and interoperability issues across mixed and distributed environments.
  • Perform hands‑on investigation and troubleshooting to understand issues and drive effective solutions.
  • Direct incident response efforts, encompassing triage, coordination, and resolution.
  • Take responsibility for and lead Root Cause Analysis (RCA) processes, ensuring preventative improvements are identified and applied.
  • Establish and improve incident management processes, driving operational maturity and reliability.
  • Drive adoption of automation and configuration‑as‑code practices across Linux systems.
  • Ensure system changes are delivered through controlled, auditable processes wherever possible.
  • Oversee development and implementation of automation solutions for system management and operational tasks.
  • Promote and support use of workflows based on Git and CI/CD pipelines for configuration and operational processes.
  • Identify and prioritize opportunities to reduce manual effort through automation and improved tooling.
  • Collaborate with engineering teams to assist development environments and system requirements.
  • Act as a senior technical liaison between engineering teams and infrastructure/platform functions.
  • Support onboarding of new systems, services, and environments using standardized and automated approaches.
  • Ensure system configurations stay consistent and aligned with established standards and governance.
  • Oversee integration points (e.g., identity, CI/CD, tooling) and ensure issues are resolved effectively.
  • Identify and drive improvements in system performance, scalability, and maintainability.
  • Contribute to and enforce documentation, standards, and operational guidelines.
  • Ensure systems meet audit, compliance, and governance requirements, with full traceability of changes.

Candidate Profile

Essential

  • Extensive experience managing and maintaining Linux‑based systems in complex technical or engineering environments.
  • Strong troubleshooting skills across operating systems, networking, storage, and application layers.
  • Demonstrated ability to identify and solve intricate technical problems, including within diverse or distributed settings.
  • Demonstrated experience managing significant incidents and outages, including directing resolution efforts and participating in Root Cause Analysis (RCA).
  • Extensive background in automation and scripting (e.g., Bash, Python, or similar).
  • Extensive background in configuration management or infrastructure‑as‑code tools (e.g., Ansible, Terraform, Puppet, or similar).
  • Experience working with configuration‑as‑code practices and workflows managed through Git.
  • Experience building, managing, or assisting with CI/CD pipelines for configuration and operational processes.
  • Strong understanding of system interoperability across distributed environments.
  • Experience working within defined standards, governance frameworks, and controlled processes.
  • Strong communication skills and ability to collaborate closely with engineering, platform, and infrastructure teams.
  • Experience mentoring or supporting the development of other engineers.
  • Capability to work efficiently across different time zones within a dispersed organization.
  • Demonstrated capability to work autonomously, establish goals, and achieve results.

Desirable

  • Experience managing or coordinating incident response activities.
  • Experience working alongside DevOps, platform, or infrastructure engineering teams.
  • Experience with monitoring, observability, and logging systems.
  • Experience supporting AI/ML or high‑performance computing environments.
  • Understanding of identity and access management concepts.
  • Experience building or scaling operational processes or support functions.
  • Experience managing and maintaining Linux‑based systems in a technical or engineering environment.

We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments.

Lead Engineer Support Linux Engineer employer: graphcore

At Graphcore, we pride ourselves on being an exceptional employer, fostering a collaborative and innovative work culture that empowers our employees to make a tangible impact in the rapidly evolving field of AI. With a strong commitment to professional development, we offer numerous growth opportunities and encourage a hands-on approach to problem-solving within a supportive team environment. Located in a vibrant tech hub, our employees benefit from significant long-term investment and the chance to work alongside some of the brightest minds in the industry, all while contributing to groundbreaking advancements in AI technology.

graphcore

Contact Detail:

graphcore Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Lead Engineer Support Linux Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This is a great way to demonstrate your expertise in Linux systems and automation to potential employers.

Tip Number 3

Prepare for interviews by brushing up on common technical questions and scenarios related to Linux support and automation. Practice explaining your thought process and problem-solving approach, as this will help you stand out during the interview.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our awesome team at Graphcore.

We think you need these skills to ace Lead Engineer Support Linux Engineer

Linux System Administration
Troubleshooting Skills
Automation and Scripting (Bash, Python)
Configuration Management (Ansible, Terraform, Puppet)
CI/CD Pipeline Management
Incident Management
Root Cause Analysis (RCA)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the Lead Engineer Support Linux Engineer role. Highlight your Linux expertise, automation skills, and any leadership experience you have. We want to see how you can contribute to our team!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to tell us why you're passionate about AI and how your background makes you a perfect fit for Graphcore. Be genuine and let your personality come through – we love seeing the real you!

Showcase Your Problem-Solving Skills:In your application, don’t just list your skills; give us examples of how you've tackled complex technical issues in the past. We’re looking for someone who can think on their feet and drive effective solutions, so share those success stories!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, you’ll find all the details about the role and our company culture there!

How to prepare for a job interview at graphcore

Know Your Linux Inside Out

Make sure you brush up on your Linux skills before the interview. Be prepared to discuss your experience managing and maintaining Linux-based systems, as well as any troubleshooting you've done in complex environments. They’ll want to see that you can handle intricate technical problems with ease.

Showcase Your Automation Skills

Since the role involves a lot of automation and configuration-as-code practices, be ready to talk about your experience with tools like Ansible or Terraform. Bring examples of how you've implemented automation solutions in the past, and how they improved system management and operational tasks.

Demonstrate Leadership and Team Collaboration

This position requires guiding and mentoring a team, so think of examples where you've led a group or collaborated closely with others. Highlight your ability to define roles and responsibilities, and how you’ve driven operational improvements in previous roles.

Prepare for Incident Management Scenarios

Expect questions around incident response and Root Cause Analysis (RCA). Prepare to discuss specific incidents you've managed, how you triaged issues, and what steps you took to ensure stability and prevent future occurrences. This will show your hands-on approach and leadership capabilities.