At a Glance
- Tasks: Join our Inference team to build and maintain systems serving millions of users worldwide.
- Company: Innovative tech company focused on AI and machine learning breakthroughs.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Encouraging diverse applicants; strong candidates may not meet every qualification.
- Why this job: Make a real impact in AI while tackling complex distributed systems challenges.
- Qualifications: Experience in software engineering, particularly with distributed systems and machine learning.
The predicted salary is between 70000 - 90000 € per year.
Requirements
- Have significant software engineering experience, particularly with distributed systems
- Are results-oriented, with a bias towards flexibility and impact
- Pick up slack, even if it goes outside your job description
- Want to learn more about machine learning systems and infrastructure
- Thrive in environments where technical excellence directly drives both business results and research breakthroughs
- Care about the societal impacts of your work
- (Desirable) High-performance, large-scale distributed systems
- (Desirable) Implementing and deploying machine learning systems at scale
- (Desirable) Load balancing, request routing, or traffic management systems
- (Desirable) LLM inference optimization, batching, and caching strategies
- (Desirable) Kubernetes and cloud infrastructure (AWS, GCP)
- (Desirable) Python or Rust
Education requirements:
We require at least a Bachelor's degree in a related field or equivalent experience. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work.
What the job involves
Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute-agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet-wide orchestration across diverse AI accelerators.
The team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, while enabling breakthrough research by giving our scientists the high-performance inference infrastructure they need to develop next-generation models. We tackle complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms.
- Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators
- Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads
- Building production-grade deployment pipelines for releasing new models to millions of users
- Integrating new AI accelerator platforms to maintain our hardware-agnostic competitive advantage
- Contributing to new inference features (e.g., structured sampling, prompt caching)
- Supporting inference for new model architectures
- Analyzing observability data to tune performance based on real-world production workloads
- Managing multi-region deployments and geographic routing for global customers
Senior Software Engineer (Inference) in London employer: Deepstreamtech
As a Senior Software Engineer (Inference) at our company, you will join a dynamic team dedicated to pushing the boundaries of machine learning systems and infrastructure. We pride ourselves on fostering a collaborative work culture that values technical excellence and innovation, offering ample opportunities for professional growth and development. Located in a vibrant tech hub, we provide a stimulating environment where your contributions directly impact millions of users while also addressing the societal implications of our work.
StudySmarter Expert Advice🤫
We think this is how you could land Senior Software Engineer (Inference) in London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to distributed systems or machine learning. This gives potential employers a taste of what you can do beyond your CV.
✨Tip Number 3
Prepare for technical interviews by brushing up on your coding skills and system design principles. Practice common interview questions and consider mock interviews with friends or mentors to build confidence.
✨Tip Number 4
Don’t hesitate to apply through our website! Even if you don’t tick every box, we value diverse experiences and perspectives. If you're passionate about the role, go for it – you might just surprise yourself!
We think you need these skills to ace Senior Software Engineer (Inference) in London
Some tips for your application 🫡
Show Off Your Experience:Make sure to highlight your significant software engineering experience, especially with distributed systems. We want to see how your past projects align with what we do at StudySmarter, so don’t hold back!
Be Results-Oriented:We love candidates who are results-oriented and flexible. In your application, share examples of how you've made an impact in previous roles. This will show us that you thrive in environments where technical excellence drives business results.
Don’t Sweat the Small Stuff:If you don’t meet every single qualification, don’t let that stop you! We encourage you to apply anyway. Just be honest about your skills and experiences, and let us know how you're eager to learn more about machine learning systems.
Apply Through Our Website:We recommend applying through our website for a smoother process. It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. We can’t wait to hear from you!
How to prepare for a job interview at Deepstreamtech
✨Know Your Stuff
Make sure you brush up on your knowledge of distributed systems and machine learning infrastructure. Be ready to discuss your past experiences with high-performance systems, load balancing, and any relevant projects you've worked on. This will show that you're not just a fit for the role but also genuinely interested in the work.
✨Show Flexibility and Impact
During the interview, highlight instances where you've picked up slack or tackled challenges outside your usual responsibilities. Companies love candidates who are results-oriented and can adapt to changing needs, so share examples that demonstrate your flexibility and the impact of your contributions.
✨Ask Smart Questions
Prepare thoughtful questions about the team’s current projects, challenges they face, and how they measure success. This not only shows your interest in the role but also gives you insight into whether the company aligns with your values, especially regarding societal impacts.
✨Be Yourself
Don’t worry if you don’t meet every single qualification listed. Be honest about your skills and experiences, and express your eagerness to learn. Remember, companies appreciate authenticity and passion, so let your personality shine through during the conversation!