At a Glance
- Tasks: Lead the evaluation and improvement of chat quality for Elastic's AI platform.
- Company: Join Elastic, a leader in search AI, empowering businesses with real-time data insights.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Collaborative environment with mentorship opportunities and a focus on innovation.
- Why this job: Shape the future of conversational AI and make a tangible impact on user experiences.
- Qualifications: 8+ years in data science/ML with expertise in NLP and AI systems.
The predicted salary is between 80000 - 100000 £ per year.
Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.
What is The Role
The Search Conversational Experiences team builds Elastic’s new conversational and agentic platform that lets customers chat with their own data in Elasticsearch. We build the core quality layer for RAG, agents and tools, retrieval and citations, streaming, memory, and the evaluation signals that turn open-ended questions into grounded, reliable answers. As a Principal Data Scientist, you will help set the technical direction for how we evaluate, improve, and scale chat quality across Elastic’s agentic platform. You will define the evaluation strategy that guides product decisions, including which models we standardize on, how we route requests across agents, which tools we enable and when, and how we tailor agents to different Elastic use cases in search and beyond. You will work closely with backend engineering, product, UX, and other data scientists to turn ambiguous, cutting-edge problems into measurable product improvements. You’ll help lead work on frontier problems such as folding RAG and vector search into an agent’s knowledge base, dynamically enriching model context to improve groundedness, shaping reasoning strategies and tool-selection policies, lighting up agent-driven visualizations on top of Elasticsearch data, and exploring multimodality where it can create meaningful user value. This is an applied leadership role: you will prototype, evaluate, influence roadmap direction, and help teams ship improvements that customers can feel.
What You Will Be Doing
- Define the evaluation strategy for conversational and agentic search, including offline and online evaluation, golden datasets, rubrics, LLM-as-judge calibration, groundedness and citation checks, and A/B testing.
- Lead the design of quality metrics and decision frameworks for RAG, agents, tools, model selection, agent routing, prompt behavior, and cost/latency trade-offs.
- Build, compare, and guide improvements across retrieval and re-ranking approaches, including sparse and dense retrieval, vector search, query understanding, semantic rewrites, and context enrichment.
- Turn experimental results into product and business decisions: which models to use, how to route requests efficiently, which tools should be exposed, and how agents should be customized for different Elastic use cases.
- Partner with engineering to productionize evaluation pipelines, telemetry, dashboards, CI guardrails, and regression detection for chat quality, helpfulness, dedication, latency, and cost.
- Influence the roadmap by identifying the highest-leverage quality gaps, proposing practical solutions, and communicating trade-offs clearly to product, engineering, and leadership.
- Mentor other data scientists and engineers in experiment design, evaluation methodology, statistical rigor, and practical approaches to improving LLM-powered systems.
- Share outcomes through clear docs, notebooks, PRs, dashboards, technical proposals, and cross-functional reviews.
What You Bring
- 8+ years of applied DS/ML experience, with deep expertise in IR, NLP, ranking, semantic search, RAG, or LLM-powered product experiences.
- Strong track record defining and leading evaluation for production AI/ML systems, including offline metrics, online experimentation, LLM-as-judge approaches, groundedness, citation quality, and model comparison.
- Experience influencing product and technical strategy through data, especially in ambiguous or emerging domains where the “right” metric or approach is not obvious at the start.
- Hands-on ability with Python, PyTorch/Transformers, Pandas, notebooks, reproducible experiments, versioned datasets, and clean, reviewable code.
- Strong understanding of retrieval systems, including dense and sparse retrieval, re-ranking, vector search, query understanding, and evaluation metrics such as nDCG, MRR, Recall@k, precision, and latency/cost trade-offs.
- Experience collaborating closely with engineering teams to move from prototype to production, including telemetry design, dashboards, CI guardrails, and quality regression tracking.
- Practical Elasticsearch experience, or experience with similar search and distributed data systems. ES|QL familiarity is a plus.
- Excellent written and verbal communication, with the ability to explain complex scientific and technical trade-offs to engineering, product, design, and leadership audiences.
- A collaborative, low-ego style and a strong ability to mentor, raise standards, and develop transparency for others in a distributed team.
Principal Data Scientist - Agent Builder employer: Elastic
Elastic is an exceptional employer that fosters a culture of innovation and collaboration, making it an ideal place for a Principal Data Scientist to thrive. With a commitment to employee growth, Elastic offers ample opportunities for professional development and mentorship, ensuring that team members can continuously enhance their skills in a dynamic environment. Located in a vibrant tech hub, employees benefit from a stimulating work atmosphere that encourages creativity and the pursuit of cutting-edge solutions in AI and search technology.
StudySmarter Expert Advice🤫
We think this is how you could land Principal Data Scientist - Agent Builder
✨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. We can’t stress enough how important it is to make connections that could lead to job opportunities.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This gives potential employers a taste of what you can do, especially in data science and AI.
✨Tip Number 3
Prepare for interviews by practising common questions and scenarios related to data science. We recommend doing mock interviews with friends or using online platforms to get comfortable with the process.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive about their job search.
We think you need these skills to ace Principal Data Scientist - Agent Builder
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that align with the Principal Data Scientist role. Highlight your expertise in AI/ML, evaluation strategies, and any relevant projects you've worked on that showcase your ability to tackle complex problems.
Craft a Compelling Cover Letter:Use your cover letter to tell us why you're passionate about the role and how your background makes you a perfect fit. Be specific about your experience with conversational AI and how you can contribute to our team at Elastic.
Showcase Your Technical Skills:Don’t forget to mention your hands-on experience with Python, PyTorch, and any other relevant tools. We want to see how you’ve applied these skills in real-world scenarios, especially in relation to retrieval systems and evaluation metrics.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our company culture and values!
How to prepare for a job interview at Elastic
✨Know Your Stuff
Make sure you brush up on your knowledge of data science, machine learning, and specifically the areas mentioned in the job description like IR, NLP, and RAG. Be ready to discuss your past experiences and how they relate to the role at Elastic.
✨Prepare for Technical Questions
Expect to dive deep into technical discussions. Prepare to explain your approach to evaluation strategies, model selection, and how you've tackled similar challenges in previous roles. Practise articulating your thought process clearly.
✨Show Your Collaborative Spirit
Elastic values collaboration, so be prepared to share examples of how you've worked with cross-functional teams. Highlight your mentoring experiences and how you've helped others grow in their roles, as this will resonate well with the interviewers.
✨Communicate Clearly
Since you'll need to explain complex concepts to various stakeholders, practise simplifying your explanations. Use clear, concise language when discussing your past projects and the outcomes, ensuring you can convey your ideas effectively.