Generative AI Data Scientist (LLMs & Synthetic Data)

Generative AI Data Scientist (LLMs & Synthetic Data)

Full-Time 60000 - 80000 £ / year (est.) Home office (partial)
Cint

At a Glance

  • Tasks: Develop next-gen AI solutions and collaborate on innovative data science projects.
  • Company: Cint, a pioneer in research technology with a global reach.
  • Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
  • Other info: Dynamic environment with a focus on innovation and career advancement.
  • Why this job: Join a cutting-edge team and make an impact in the world of Generative AI.
  • Qualifications: 2-4 years in Data Science, strong skills in Python and LLMs.

The predicted salary is between 60000 - 80000 £ per year.

About Cint

Cint is a pioneer in research technology (ResTech). Our customers use the Cint platform to post questions and get answers from real people to build business strategies, confidently publish research, accurately measure the impact of digital advertising, and more. The Cint platform is built on a programmatic marketplace, which is the world’s largest, with nearly 300 million respondents in over 150 countries who consent to sharing their opinions, motivations, and behaviours.

As a Data Scientist at Cint, you will play a pivotal role in developing next‑generation AI solutions that power our product portfolio. Collaborating closely with Product and Engineering teams, you will bridge the gap between traditional research data and synthetic intelligence. You will focus on the research, validation, and delivery of models—including Large Language Models (LLMs)—that augment high‑quality human signals across the Cint Exchange. This role involves advanced data mining, robust data validation, and the development of sophisticated statistical and machine learning methodologies.

Responsibilities

  • Contribute to the research, discovery, and development of machine learning models – specifically focused on synthetic row generation, open‑ended text generation, and data augmentation.
  • Execute statistical tests and experiments to validate LLM performance and synthetic modeling hypotheses.
  • Develop logic for on‑demand and dynamic boosting capabilities, collaborating with Engineering to integrate these models into Cint Exchange fielding workflows.
  • Design and refine sophisticated profiling taxonomies, leveraging large‑scale datasets to create syndicated audiences.
  • Manage technical workflows and development cycles with guidance.
  • Collaborate with Product and Engineering teams to support integration.
  • Create clear, effective prototypes and deliverables that explain and defend complex Generative AI concepts to both technical and non‑technical audiences.

Qualifications Required

  • Minimum 2–4 years of experience in a Data Science capacity, with experience delivering end‑to‑end data science solutions.
  • A Master’s degree (or equivalent) in Statistics, Data Science, or a related quantitative field.
  • Deep understanding of Generative AI and LLMs, particularly for applications in text generation and data synthesis.
  • Advanced knowledge of statistical techniques: hypothesis testing, sampling theory, experimental design, and causal inference.
  • Strong knowledge of a variety of ML techniques (e.g., clustering, regression, neural networks, etc.) and their real‑world trade‑offs.
  • Expert proficiency in Python (DS/ML stack) and experience with frameworks used for LLM development and fine‑tuning.
  • Advanced SQL skills and experience working with large‑scale databases.
  • Ability to research and adopt new methods.

Essential Qualities

  • Highly accountable self‑starter and quick learner, consistently motivated to deliver high‑quality, impactful results.
  • Strong data‑driven mindset with the ability to translate abstract business requests into actionable AI initiatives and solutions.
  • Excellent written and verbal communication skills, with the ability to communicate technical findings clearly.

Nice to Have

  • Direct experience with Synthetic Data Generation techniques and the evaluation of synthetic data quality/utility.
  • Experience with Prompt Engineering, RAG (Retrieval‑Augmented Generation), or fine‑tuning open‑source LLMs for open‑end generation.
  • Experience with probabilistic modeling, or advanced profiling techniques.
  • Familiarity with online market research or survey exchange platforms.
  • Experience using Databricks, Spark, or PySpark for large‑scale workflows.

Generative AI Data Scientist (LLMs & Synthetic Data) employer: Cint

Cint is an exceptional employer that fosters a dynamic and innovative work culture, particularly for those passionate about advancing AI technologies. As a Data Scientist, you will have the opportunity to collaborate with talented teams, engage in cutting-edge research, and contribute to meaningful projects that shape the future of market research. With a strong emphasis on employee growth and development, Cint offers a supportive environment where your skills can flourish, making it an ideal place for professionals eager to make a significant impact in the field of Generative AI.

Cint

Contact Details:

Cint Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Generative AI Data Scientist (LLMs & Synthetic Data)

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with Cint employees on LinkedIn. A friendly chat can sometimes lead to opportunities that aren’t even advertised!

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects related to Generative AI and LLMs. This will give you an edge and demonstrate your hands-on experience when you get that interview.

Tip Number 3

Prepare for technical interviews by brushing up on your Python and SQL skills. Practice common data science problems and be ready to discuss your thought process. We want to see how you tackle challenges!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining the Cint team!

We think you need these skills to ace Generative AI Data Scientist (LLMs & Synthetic Data)

Data Science
Generative AI
Large Language Models (LLMs)
Statistical Techniques
Hypothesis Testing
Experimental Design
Machine Learning Techniques

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the role of Generative AI Data Scientist. Highlight your experience with LLMs and synthetic data, and don’t forget to showcase your Python and SQL skills. We want to see how your background aligns with what we’re looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re passionate about generative AI and how your skills can contribute to our team at Cint. Keep it engaging and relevant to the job description.

Showcase Your Projects:If you’ve worked on any projects related to machine learning or data science, make sure to include them in your application. We love seeing real-world applications of your skills, especially those involving LLMs or synthetic data!

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team!

How to prepare for a job interview at Cint

Know Your Generative AI Inside Out

Make sure you brush up on your knowledge of Generative AI and Large Language Models (LLMs). Be prepared to discuss specific applications, like text generation and data synthesis, and how they relate to the role. This will show that you're not just familiar with the concepts but can also apply them in a practical context.

Showcase Your Data Science Skills

Be ready to talk about your experience with statistical techniques and machine learning methodologies. Prepare examples of past projects where you executed statistical tests or developed models. Highlight your proficiency in Python and SQL, as well as any frameworks you've used for LLM development.

Communicate Clearly and Effectively

Since you'll need to explain complex AI concepts to both technical and non-technical audiences, practice articulating your thoughts clearly. Use simple language to describe your work and be ready to answer questions about your findings. Good communication can set you apart from other candidates.

Prepare for Collaboration Questions

As this role involves working closely with Product and Engineering teams, think about your past collaborative experiences. Be prepared to discuss how you’ve managed technical workflows and integrated models into existing systems. Show that you can bridge the gap between different teams effectively.