Lead Staff Systems Reliability Engineer (Linux & Distributed Systems)

Lead Staff Systems Reliability Engineer (Linux & Distributed Systems)

Full-Time 80000 - 100000 € / year (est.) Home office (partial)
The Trade Desk

At a Glance

  • Tasks: Lead a team to manage and optimise systems at scale in a global ecosystem.
  • Company: The Trade Desk, a top-rated tech company focused on innovative advertising solutions.
  • Benefits: Inclusive culture, training opportunities, and the chance to work with cutting-edge technology.
  • Other info: Join a diverse team committed to fostering an inclusive environment.
  • Why this job: Make a real impact by solving complex problems and shaping the future of infrastructure.
  • Qualifications: Experience with Linux and a passion for learning; no prior experience required.

The predicted salary is between 80000 - 100000 € per year.

The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day, our platform operates at an unprecedented scale. We value the unique experiences and perspectives that each person brings to The Trade Desk, and we are committed to fostering inclusive spaces where everyone can bring their authentic selves to work every day.

We are looking to hire a Lead Systems Reliability Engineer to join our engineering team to continue building and maintaining our data-driven platform. We leverage technologies like Aerospike, MongoDB, and Kafka to perform many real-time activities, translating to a p99 latency under 1 millisecond on the back end.

What makes this role different:

  • First in the Industry: The Trade Desk is the first company to run over 5MM QPS to NVMe in Aerospike on a single node, forcing core software redesigns to achieve this scale.
  • Work on Cutting-Edge Hardware: Design clusters with nodes featuring 300TB of NVMe, 3TB RAM, and 512 cores, delivering a global 2,500GB/s throughput directly from flash.
  • Shape the Future of Infrastructure: Spec your own systems and collaborate directly with AMD and NoSQL vendors to run PoCs and optimize bleeding-edge technology for internet-scale workloads.
  • Deep Performance Engineering: Dive into kernel, hardware, and system interactions, leveraging tools like flamegraphs, NUMA counters, BIOS tuning, and synthetic testing to achieve world-class performance.
  • Push Hardware Endurance Limits: Build clusters engineered to withstand over 1 zettabyte of endurance.

What you’ll do:

  • Lead a team to influence, manage, and plan work streams, systems, and data structures at scale within a global ecosystem, spanning multiple infrastructure providers (cloud and traditional datacenters).
  • Encourage, improve, and build infrastructure automation in a way that works with stateful systems at scale.
  • Own operations for Linux-based systems running Aerospike, Kafka, and Mongo.
  • Serve as a point of contact to review new use cases, answer questions, and participate in on-call rotation.
  • Learn to be a NoSQL SME. You do not need experience to apply – we will train you.
  • Benchmark and analyze next generation hardware offerings.

Who you are:

  • Skills And Experience:
  • Linux operating system
  • Leadership experience and ability to mentor
  • Troubleshooting techniques for isolation, scientific method
  • Identify bottlenecks (Is it CPU? IO?)
  • Nice-To-Have experience:
  • Physical hardware (on-prem) internals, management, and operation
  • Performing testing and tuning
  • Databases (relational or NoSQL)
  • Ansible/PyInfra/Chef
  • Prometheus
  • Kubernetes
  • Python/Ruby/Rust/Bash/Golang/C#
  • Empathetic, Objective, Critical Thinker: Thinking beyond the task at hand to deeply understand the 'why' behind an objective. A welcoming of ideas, and understanding of, perspectives that are different from your own and an interest in seeking and building from a common ground.
  • You are a creative thinker, not bound by "the way things have always been done" but are thinking of the questions nobody has thought of and are "yet to be asked".
  • What you know is less important than how well you learn, innovate, collaborate, and adapt.
  • As a global team from many diverse backgrounds, experiences, and perspectives, you value and seek out paths for fostering diversity.

The Trade Desk is an equal opportunity employer. All aspects of employment will be based on merit, competence, performance, and business needs. We do not discriminate on the basis of race, colour, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, veteran status, or any other status protected under federal, state, or local law.

As an Equal Opportunity Employer, The Trade Desk is committed to creating an inclusive hiring experience where everyone has the opportunity to thrive. Please reach out to us at accommodations@thetradedesk.com to request an accommodation or discuss any accessibility needs you may require to access our Company Website or navigate any part of the hiring process.

Lead Staff Systems Reliability Engineer (Linux & Distributed Systems) employer: The Trade Desk

The Trade Desk is an exceptional employer that champions innovation and inclusivity, making it a fantastic place for a Lead Staff Systems Reliability Engineer to thrive. With a commitment to employee growth, cutting-edge technology, and a collaborative work culture, team members are empowered to tackle complex challenges while shaping the future of infrastructure. Located in a dynamic environment, The Trade Desk offers unique opportunities to work with industry-leading hardware and engage in meaningful projects that drive the evolution of digital advertising.

The Trade Desk

Contact Detail:

The Trade Desk Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Lead Staff Systems Reliability Engineer (Linux & Distributed Systems)

Tip Number 1

Network like a pro! Reach out to current employees at The Trade Desk on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for landing a role like the Lead Systems Reliability Engineer.

Tip Number 2

Prepare for technical interviews by brushing up on your Linux skills and distributed systems knowledge. Practice troubleshooting scenarios and be ready to discuss how you would handle real-world problems at scale.

Tip Number 3

Showcase your passion for innovation! During interviews, share examples of how you've approached complex problems creatively. Highlight any experience with cutting-edge technologies or automation that aligns with what The Trade Desk is doing.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in being part of our team at The Trade Desk.

We think you need these skills to ace Lead Staff Systems Reliability Engineer (Linux & Distributed Systems)

Linux Operating System
Leadership Experience
Mentoring
Troubleshooting Techniques
Performance Testing
Automation
NoSQL Databases

Some tips for your application 🫡

Show Your Passion:When writing your application, let your enthusiasm for solving complex problems shine through. We want to see that you're genuinely excited about the role and the impact you can make at The Trade Desk.

Tailor Your Experience:Make sure to highlight your relevant skills and experiences that align with the job description. We love seeing how your background in Linux, distributed systems, or performance engineering can contribute to our team.

Be Authentic:We value diverse perspectives, so don’t be afraid to show your true self in your application. Share your unique experiences and how they shape your approach to problem-solving and teamwork.

Apply Through Our Website:For the best chance of success, make sure to submit your application through our website. This helps us keep everything organised and ensures your application gets the attention it deserves!

How to prepare for a job interview at The Trade Desk

Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, like Linux, Aerospike, and Kafka. Brush up on your knowledge of distributed systems and be ready to discuss how you've used these technologies in past projects.

Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled complex problems at scale. Think about times when you identified bottlenecks or improved performance, and be ready to explain your thought process and the outcomes.

Demonstrate Leadership and Mentorship

Since this role involves leading a team, be prepared to discuss your leadership style and experiences. Share examples of how you've mentored others or influenced a team’s direction, highlighting your ability to foster collaboration and inclusivity.

Ask Insightful Questions

Prepare thoughtful questions that show your interest in the company and the role. Inquire about their approach to infrastructure automation or how they handle performance testing. This not only demonstrates your enthusiasm but also helps you gauge if the company is the right fit for you.