Staff Software Engineer, Inference Infrastructure
Staff Software Engineer, Inference Infrastructure

Staff Software Engineer, Inference Infrastructure

Full-Time 36000 - 60000 £ / year (est.) Home office (partial)
Go Premium
C

At a Glance

  • Tasks: Build high-performance AI systems and deploy cutting-edge NLP models.
  • Company: Join a pioneering tech company focused on scaling intelligence for humanity.
  • Benefits: Enjoy competitive pay, health perks, remote flexibility, and generous vacation time.
  • Why this job: Make a real impact in the AI space while working with innovative technologies.
  • Qualifications: 5+ years in engineering, experience with Kubernetes, and strong collaboration skills.
  • Other info: Inclusive culture with excellent career growth and personal enrichment opportunities.

The predicted salary is between 36000 - 60000 £ per year.

Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

Why this role? Are you energized by building high-performance, scalable and reliable machine learning systems? Do you want to help define and build the next generation of AI platforms powering advanced NLP applications? We are looking for Members of Technical Staff to join the Model Serving team at Cohere. The team is responsible for developing, deploying, and operating the AI platform delivering Cohere's large language models through easy to use API endpoints. In this role, you will work closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments. You will also get the opportunity to interface with customers and create customized deployments to meet their specific needs.

You May Be a Good Fit If You Have:

  • 5+ years of engineering experience running production infrastructure at a large scale
  • Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters
  • Experience with Kubernetes dev and production coding and support
  • Experience with GCP, Azure, AWS, OCI, multi-cloud on-prem / hybrid serving
  • Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments
  • Experience in compute/storage/network resource and cost management
  • Excellent collaboration and troubleshooting skills to build mission-critical systems, and ensure smooth operations and efficient teamwork
  • The grit and adaptability to solve complex technical challenges that evolve day to day
  • Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or custom accelerators), especially how they influence latency and throughput of inference
  • Strong understanding or working experience with distributed systems
  • Experience in Golang, C++ or other languages designed for high-performance scalable servers

If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees At Cohere Enjoy These Perks:

  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote‑flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co‑working stipend
  • 6 weeks of vacation (30 working days!)

Staff Software Engineer, Inference Infrastructure employer: Cohere

Cohere is an exceptional employer that fosters an open and inclusive culture, making it a fantastic place for Staff Software Engineers to thrive. With a focus on cutting-edge AI research, employees benefit from generous perks such as a weekly lunch stipend, comprehensive health and dental benefits, and a remarkable six weeks of vacation. The company prioritises personal growth and well-being, offering enrichment benefits and a flexible remote work environment across major cities like Toronto, New York, and London.
C

Contact Detail:

Cohere Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Staff Software Engineer, Inference Infrastructure

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Cohere. A friendly chat can open doors and give you insights that a job description just can't.

✨Tip Number 2

Show off your skills! If you've got a project or two that showcases your experience with Kubernetes or distributed systems, make sure to highlight them in conversations. Real-world examples speak volumes!

✨Tip Number 3

Prepare for technical interviews by brushing up on your problem-solving skills. Practice coding challenges related to high-performance systems and be ready to discuss your thought process. We love seeing how you tackle complex challenges!

✨Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining our mission to scale intelligence for humanity.

We think you need these skills to ace Staff Software Engineer, Inference Infrastructure

High-performance machine learning systems
Scalable and reliable infrastructure
Distributed systems design
Kubernetes
GPU workloads
GCP
Azure
AWS
OCI
Linux-based computing environments
Compute/storage/network resource management
Collaboration skills
Troubleshooting skills
Golang
C++

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the role. Highlight your experience with distributed systems, Kubernetes, and any relevant cloud platforms. We want to see how you can contribute to our mission!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for AI and how your background makes you a great fit for the Staff Software Engineer role. Don’t forget to mention specific projects or achievements that showcase your expertise.

Showcase Your Problem-Solving Skills: In your application, give examples of complex technical challenges you've tackled in the past. We love seeing how you approach problems and what solutions you've implemented, especially in high-performance environments.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at Cohere!

How to prepare for a job interview at Cohere

✨Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, especially Kubernetes and cloud platforms like GCP, AWS, or Azure. Brush up on your experience with distributed systems and be ready to discuss specific projects where you’ve implemented these technologies.

✨Showcase Your Problem-Solving Skills

Prepare to share examples of complex technical challenges you've faced and how you tackled them. Highlight your adaptability and grit, as these qualities are crucial for the role. Use the STAR method (Situation, Task, Action, Result) to structure your responses.

✨Understand the Company’s Mission

Familiarise yourself with Cohere's mission to scale intelligence for humanity. Be prepared to discuss how your skills and experiences align with their goals, particularly in building high-performance AI systems. This shows that you’re not just looking for a job, but are genuinely interested in contributing to their vision.

✨Ask Insightful Questions

Prepare thoughtful questions about the team dynamics, the challenges they face, and how success is measured in this role. This not only demonstrates your interest but also helps you gauge if the company culture and expectations align with your career goals.

Staff Software Engineer, Inference Infrastructure
Cohere
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

C
  • Staff Software Engineer, Inference Infrastructure

    Full-Time
    36000 - 60000 £ / year (est.)
  • C

    Cohere

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>