Staff Software Engineer, Inference Infrastructure in London
Staff Software Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure in London

London Full-Time 48000 - 72000 ÂŁ / year (est.) No home office possible
Go Premium
C

At a Glance

  • Tasks: Build high-performance AI systems and deploy cutting-edge NLP models.
  • Company: Join a pioneering tech company shaping the future of AI.
  • Benefits: Enjoy flexible remote work, generous vacation, and comprehensive health benefits.
  • Why this job: Make a real impact in AI while working with innovative technologies.
  • Qualifications: 5+ years in engineering, experience with Kubernetes and cloud platforms.
  • Other info: Inclusive culture with opportunities for personal and professional growth.

The predicted salary is between 48000 - 72000 ÂŁ per year.

Who are we? Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

Why this role? Are you energized by building high-performance, scalable and reliable machine learning systems? Do you want to help define and build the next generation of AI platforms powering advanced NLP applications? We are looking for Members of Technical Staff to join the Model Serving team at Cohere. The team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. In this role, you will work closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments. You will also get the opportunity to interface with customers and create customized deployments to meet their specific needs.

You May Be a Good Fit If You Have:

  • 5+ years of engineering experience running production infrastructure at a large scale
  • Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters
  • Experience with Kubernetes dev and production coding and support
  • Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving
  • Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments
  • Experience in compute/storage/network resource and cost management
  • Excellent collaboration and troubleshooting skills to build mission-critical systems, and ensure smooth operations and efficient teamwork
  • The grit and adaptability to solve complex technical challenges that evolve day to day
  • Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), especially how they influence latency and throughput of inference
  • Strong understanding or working experience with distributed systems
  • Experience in Golang, C++ or other languages designed for high-performance scalable servers

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees At Cohere Enjoy These Perks:

  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote‐flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co‐working stipend
  • 6 weeks of vacation (30 working days!)

Staff Software Engineer, Inference Infrastructure in London employer: Cohere

Cohere is an exceptional employer that fosters an open and inclusive culture, making it a fantastic place for Staff Software Engineers to thrive. With a focus on cutting-edge AI research, employees enjoy generous benefits such as a weekly lunch stipend, full health and dental coverage, and a remarkable six weeks of vacation. The company also prioritises personal growth with enrichment benefits and offers flexible remote work options across major cities like Toronto, New York, and London, ensuring a balanced and rewarding work experience.
C

Contact Detail:

Cohere Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Staff Software Engineer, Inference Infrastructure in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with current employees at Cohere. A friendly chat can sometimes lead to opportunities that aren’t even advertised!

✨Tip Number 2

Show off your skills! If you’ve got a GitHub or portfolio showcasing your projects, make sure to highlight them during interviews. It’s a great way to demonstrate your experience with high-performance systems and distributed architectures.

✨Tip Number 3

Prepare for technical interviews by brushing up on your coding skills and system design knowledge. Practice common algorithms and system architecture questions, especially those related to Kubernetes and cloud services.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our mission to scale intelligence for humanity.

We think you need these skills to ace Staff Software Engineer, Inference Infrastructure in London

Machine Learning Systems
NLP Applications
Kubernetes
GPU Workloads
GCP
Azure
AWS
OCI
Linux-based Computing Environments
Resource Management
Collaboration Skills
Troubleshooting Skills
Adaptability
Distributed Systems
Golang
C++

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the role. Highlight your experience with distributed systems, Kubernetes, and any relevant cloud platforms. We want to see how you can contribute to our mission!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for AI and how your background makes you a great fit for the Staff Software Engineer position. Let us know why you're excited about building high-performance machine learning systems.

Showcase Your Projects: If you've worked on any relevant projects, make sure to include them! Whether it's deploying models or optimising infrastructure, we love seeing real-world applications of your skills. It helps us understand your hands-on experience.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you're serious about joining our team at StudySmarter!

How to prepare for a job interview at Cohere

✨Know Your Tech Inside Out

Make sure you brush up on your knowledge of distributed systems, Kubernetes, and the specific cloud platforms mentioned in the job description. Be ready to discuss your past experiences with these technologies and how they relate to building scalable AI systems.

✨Showcase Your Problem-Solving Skills

Prepare to share examples of complex technical challenges you've faced in previous roles. Highlight how you approached these problems, the solutions you implemented, and the outcomes. This will demonstrate your grit and adaptability, which are key for this role.

✨Understand the Business Impact

Familiarise yourself with how AI platforms can transform businesses. Be prepared to discuss how your work can contribute to the mission of scaling intelligence to serve humanity, and how you can help deliver high-performance models that meet customer needs.

✨Practice Collaboration Scenarios

Since this role involves working closely with various teams, think of examples where you've successfully collaborated in the past. Be ready to discuss how you handle teamwork, communication, and troubleshooting in a fast-paced environment.

Staff Software Engineer, Inference Infrastructure in London
Cohere
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

C
  • Staff Software Engineer, Inference Infrastructure in London

    London
    Full-Time
    48000 - 72000 ÂŁ / year (est.)
  • C

    Cohere

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>