At a Glance
- Tasks: Lead a team to manage and optimise systems at scale in a global ecosystem.
- Company: The Trade Desk, a top-rated tech company focused on creating a better internet.
- Benefits: Inclusive culture, training opportunities, and the chance to work with cutting-edge technology.
- Why this job: Make a real impact by solving complex problems and shaping the future of infrastructure.
- Qualifications: Experience with Linux and a passion for learning; no prior experience required.
- Other info: Join a diverse team committed to innovation and collaboration.
The predicted salary is between 36000 - 60000 £ per year.
The Trade Desk is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising. Handling over 1 trillion queries per day, our platform operates at an unprecedented scale. We value the unique experiences and perspectives that each person brings to The Trade Desk, and we are committed to fostering inclusive spaces where everyone can bring their authentic selves to work every day.
We are looking to hire a Lead Systems Reliability Engineer to join our engineering team to continue building and maintaining our data-driven platform. We leverage technologies like Aerospike, MongoDB, and Kafka to perform many real-time activities, translating to a p99 latency under 1 millisecond on the back end.
What makes this role different:
- First in the Industry: The Trade Desk is the first company to run over 5MM QPS to NVMe in Aerospike on a single node, forcing core software redesigns to achieve this scale.
- Work on Cutting-Edge Hardware: Design clusters with nodes featuring 300TB of NVMe, 3TB RAM, and 512 cores, delivering a global 2,500GB/s throughput directly from flash.
- Shape the Future of Infrastructure: Spec your own systems and collaborate directly with AMD and NoSQL vendors to run PoCs and optimize bleeding-edge technology for internet-scale workloads.
- Deep Performance Engineering: Dive into kernel, hardware, and system interactions, leveraging tools like flamegraphs, NUMA counters, BIOS tuning, and synthetic testing to achieve world-class performance.
- Push Hardware Endurance Limits: Build clusters engineered to withstand over 1 zettabyte of endurance.
What you’ll do:
- Lead a team to influence, manage, and plan work streams, systems, and data structures at scale within a global ecosystem, spanning multiple infrastructure providers (cloud and traditional datacenters).
- Encourage, improve, and build infrastructure automation in a way that works with stateful systems at scale.
- Own operations for Linux-based systems running Aerospike, Kafka, and Mongo.
- Serve as a point of contact to review new use cases, answer questions, and participate in on-call rotation.
- Learn to be a NoSQL SME. You do not need experience to apply – we will train you.
- Benchmark and analyze next generation hardware offerings.
Who you are:
- Skills And Experience:
- Linux operating system
- Leadership experience and ability to mentor
- Troubleshooting techniques for isolation, scientific method
- Identify bottlenecks (Is it CPU? IO?)
- Nice-To-Have experience:
- Physical hardware (on-prem) internals, management, and operation
- Performing testing and tuning
- Databases (relational or NoSQL)
- Ansible/PyInfra/Chef
- Prometheus
- Kubernetes
- Python/Ruby/Rust/Bash/Golang/C#
- Empathetic, Objective, Critical Thinker: Thinking beyond the task at hand to deeply understand the 'why' behind an objective. A welcoming of ideas, and understanding of, perspectives that are different from your own and an interest in seeking and building from a common ground. You are a creative thinker, not bound by "the way things have always been done" but are thinking of the questions nobody has thought of and are "yet to be asked". What you know is less important than how well you learn, innovate, collaborate, and adapt. As a global team from many diverse backgrounds, experiences, and perspectives, you value and seek out paths for fostering diversity.
Lead Systems Reliability Engineer (Linux & Distributed Systems) employer: The Trade Desk
Contact Detail:
The Trade Desk Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Lead Systems Reliability Engineer (Linux & Distributed Systems)
✨Tip Number 1
Network like a pro! Reach out to current employees at The Trade Desk on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for landing the Lead Systems Reliability Engineer role.
✨Tip Number 2
Prepare for technical interviews by brushing up on your Linux and distributed systems knowledge. Practice troubleshooting scenarios and be ready to discuss how you would handle real-world problems at scale.
✨Tip Number 3
Show off your passion for innovation! During interviews, share examples of how you've pushed boundaries in previous roles or projects. Highlight your creative thinking and willingness to explore new ideas.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our team at The Trade Desk.
We think you need these skills to ace Lead Systems Reliability Engineer (Linux & Distributed Systems)
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter for the Lead Systems Reliability Engineer role. Highlight your experience with Linux, distributed systems, and any relevant technologies like Aerospike or Kafka. We want to see how your unique skills fit into our mission!
Showcase Your Problem-Solving Skills: In your application, share examples of how you've tackled complex problems in the past. Whether it's tuning performance or troubleshooting issues, we love to see how you approach challenges and what innovative solutions you've come up with.
Be Authentic: We value diverse perspectives, so don’t be afraid to let your personality shine through in your application. Share your passion for technology and how you can contribute to building a better media ecosystem. Authenticity goes a long way with us!
Apply Through Our Website: For the best chance of getting noticed, make sure to apply directly through our website. This helps us keep track of applications and ensures you’re considered for the role. Plus, it’s super easy to do!
How to prepare for a job interview at The Trade Desk
✨Know Your Tech Stack
Familiarise yourself with the technologies mentioned in the job description, like Aerospike, Kafka, and MongoDB. Be ready to discuss how you've used similar tools in past projects or how you would approach learning them.
✨Showcase Problem-Solving Skills
Prepare examples of how you've tackled complex problems, especially in distributed systems or Linux environments. Use the STAR method (Situation, Task, Action, Result) to structure your responses and highlight your critical thinking.
✨Demonstrate Leadership and Mentorship
Since this role involves leading a team, think of instances where you've mentored others or led projects. Be prepared to discuss your leadership style and how you encourage collaboration and innovation within a team.
✨Ask Insightful Questions
Prepare thoughtful questions about the company's culture, the team you'll be working with, and the challenges they face. This shows your genuine interest in the role and helps you assess if it's the right fit for you.