At a Glance
- Tasks: Design and develop innovative data architectures to support cutting-edge AI and machine learning workflows.
- Company: Join GSK, a global biopharma leader dedicated to advancing health through science and technology.
- Benefits: Enjoy competitive salary, flexible working options, and opportunities for professional growth.
- Why this job: Make a real impact in medical discovery while working with top-tier talent and technology.
- Qualifications: 5+ years in data architecture and experience with big data platforms required.
- Other info: Collaborative environment focused on innovation and career development.
The predicted salary is between 43200 - 72000 £ per year.
The Onyx Research Data Tech organization is GSK’s Research data ecosystem which has the capability to bring together, analyze, and power the exploration of data at scale. We partner with scientists across GSK to define and understand their challenges and develop tailored solutions that meet their needs. The goal is to ensure scientists have the right data and insights when they need it to give them a better starting point for and accelerate medical discovery. Ultimately, this helps us get ahead of disease in more predictive and powerful ways.
Onyx is a full-stack shop consisting of product and portfolio leadership, data engineering, infrastructure and DevOps, data / metadata / knowledge platforms, and AI/ML and analysis platforms, all geared toward:
- Building a next-generation, metadata- and automation-driven data experience for GSK’s scientists, engineers, and decision-makers, increasing productivity and reducing time spent on “data mechanics”
- Providing best-in-class AI/ML and data analysis environments to accelerate our predictive capabilities and attract top-tier talent
- Aggressively engineering our data at scale, as one unified asset, to unlock the value of our unique collection of data and predictions in real-time
The Onyx Data Architecture team sits within the Data Engineering team, which is responsible for the design, delivery, support, and maintenance of industrialized automated end-to-end data services and pipelines. They apply standardized data models and mapping to ensure data is accessible for end users in end-to-end user tools through use of APIs. They define and embed best practices and ensure compliance with Quality Management practices and alignment to automated data governance. They also acquire and process internal and external, structured and unstructured data in line with Product requirements.
As a Data Architect II, you'll apply your expertise in big data and AI/GenAI workflows to support GSK's complex, regulated R&D environment. You'll contribute to designing Data Mesh/Data Fabric architectures while enabling modern AI and machine learning capabilities across our platform.
You will be responsible for:
- Partnering with the Scientific Knowledge Engineering team to develop physical data models to build fit-for-purpose data products
- Designing data architecture aligned with enterprise-wide standards to promote interoperability
- Collaborating with the platform teams and data engineers to maintain architecture principles, standards, and guidelines
- Designing data foundations that support GenAI workflows including RAG (Retrieval-Augmented Generation), vector databases, and embedding pipelines
- Working across business areas and stakeholders to ensure consistent implementation of architecture standards
- Leading reviews and maintaining architecture documentation and best practices for Onyx and our stakeholders
- Adopting security-first design with robust authentication and resilient connectivity
- Providing best practices and leadership, subject matter, and GSK expertise to architecture and engineering teams composed of GSK FTEs, strategic partners, and software vendors.
Basic Qualifications:
- Bachelor’s degree in computer science, engineering, Data Science or similar discipline
- 5+ years of experience in data architecture, data engineering, or related fields in pharma, healthcare, or life sciences R&D.
- 3+ years’ experience of defining architecture standards, patterns on Big Data platforms
- 3+ years’ experience with data warehouse, data lake, and enterprise big data platforms
- 3+ years’ experience with enterprise cloud data architecture (preferably Azure or GCP) and delivering solutions at scale
- 3+ years of hands-on relational, dimensional, and/or analytic experience (using RDBMS, dimensional, NoSQL data platform technologies, and ETL and data ingestion protocols)
Preferred Qualifications:
- Master's or PhD in computer science, engineering, Data Science or similar discipline
- Deep knowledge and use of at least one common programming language: e.g., Python, Scala, Java
- Experience with AI/ML data workflows: feature stores, vector databases, embedding pipelines, model serving architectures
- Familiarity with GenAI/LLM data patterns: RAG architectures, prompt engineering data requirements, fine-tuning data preparation
- Experience with GCP data/analytics stack: Spark, Dataflow, Dataproc, GCS, BigQuery
- Experience with enterprise data tools: Ataccama, Collibra, Acryl
- Experience with Agile frameworks: SAFe, Jira, Confluence, Azure DevOps
- Experience applying CI/CD principles to data solution
- Experience with Spark and RAG-based architectures for data science and ML use cases
- Strong communication skills—ability to explain technical concepts to non-technical stakeholders
- Pharmaceutical, healthcare, or life sciences background
GSK is a global biopharma company with a purpose to unite science, technology and talent to get ahead of disease together. We aim to positively impact the health of 2.5 billion people by the end of the decade, as a successful, growing company where people can thrive. We get ahead of disease by preventing and treating it with innovation in specialty medicines and vaccines. We focus on four therapeutic areas: respiratory, immunology and inflammation; oncology; HIV; and infectious diseases – to impact health at scale.
People and patients around the world count on the medicines and vaccines we make, so we’re committed to creating an environment where our people can thrive and focus on what matters most. Our culture of being ambitious for patients, accountable for impact and doing the right thing is the foundation for how, together, we deliver for patients, shareholders and our people.
GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.
We believe in an agile working culture for all our roles. If flexibility is important to you, we encourage you to explore with our hiring team what the opportunities are.
Data Architect II in Stevenage employer: 1054 GlaxoSmithKline Services Unlimited
Contact Detail:
1054 GlaxoSmithKline Services Unlimited Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Data Architect II in Stevenage
✨Tip Number 1
Network like a pro! Reach out to current employees at GSK or in the data architecture field on LinkedIn. A friendly chat can give you insider info and might just lead to a referral.
✨Tip Number 2
Prepare for your interview by brushing up on your technical skills and understanding GSK's mission. Show how your experience aligns with their goals, especially in AI/ML workflows and data architecture.
✨Tip Number 3
Don’t forget to showcase your soft skills! Communication is key, especially when explaining complex concepts to non-tech folks. Practice articulating your thoughts clearly and confidently.
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining the GSK team and contributing to their mission.
We think you need these skills to ace Data Architect II in Stevenage
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that align with the Data Architect II role. Highlight your expertise in data architecture, big data platforms, and any relevant projects you've worked on. We want to see how you can contribute to our mission!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about data architecture and how your background fits with GSK's goals. Be sure to mention any experience with AI/ML workflows or cloud data architecture, as these are key for us.
Showcase Your Technical Skills: Don’t forget to highlight your technical skills, especially in programming languages like Python or Scala, and your experience with data tools. We love seeing candidates who can bridge the gap between technical and non-technical stakeholders, so make that clear!
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team at GSK!
How to prepare for a job interview at 1054 GlaxoSmithKline Services Unlimited
✨Know Your Data Architecture Inside Out
Make sure you’re well-versed in data architecture principles, especially those relevant to big data platforms. Brush up on your knowledge of data mesh and data fabric architectures, as well as how they apply to AI/ML workflows. Being able to discuss these concepts confidently will show that you’re the right fit for the role.
✨Showcase Your Technical Skills
Prepare to demonstrate your expertise in programming languages like Python or Scala, and be ready to discuss your experience with cloud data architecture, particularly Azure or GCP. Bring examples of past projects where you’ve successfully implemented data solutions at scale, as this will highlight your hands-on experience.
✨Understand GSK’s Mission and Values
Familiarise yourself with GSK’s goals and how the Onyx Research Data Tech organisation fits into their mission. Be prepared to discuss how your work can contribute to getting ahead of disease and improving medical discovery. This shows that you’re not just looking for a job, but are genuinely interested in making an impact.
✨Prepare for Scenario-Based Questions
Expect to face scenario-based questions that assess your problem-solving skills and ability to collaborate across teams. Think of examples from your past experiences where you’ve had to lead architecture reviews or maintain compliance with quality management practices. This will demonstrate your leadership and teamwork capabilities.