At a Glance
- Tasks: Lead the design and implementation of cutting-edge ML systems for a new matching platform.
- Company: Join the London Stock Exchange Group, a leader in financial technology.
- Benefits: Competitive salary, flexible working options, and opportunities for professional growth.
- Other info: Dynamic team environment with a focus on innovation and collaboration.
- Why this job: Make a real impact in ML engineering while shaping the future of finance.
- Qualifications: Expertise in AWS SageMaker, Python, and ML frameworks required.
The predicted salary is between 80000 - 100000 € per year.
We are seeking a Principal Machine Learning Engineer (SageMaker, MLOps, Model Governance & Explainability) to provide technical leadership across the full lifecycle of machine learning systems powering a new matching platform. This role is accountable for defining ML architecture, establishing engineering standards, driving MLOps maturity, and ensuring that our models are scalable, secure, explainable, and governed to enterprise‑grade standards. You will contribute to the strategic direction of our ML platform—spanning data pipelines, model development, deployment automation, inference runtime design, telemetry, drift detection, and cross‑account productionisation. You will mentor engineers, influence product and architectural decisions, and ensure that our ML systems operate reliably at scale, underpinned by a robust governance and compliance framework. This is a highly hands‑on, highly technical, principal‑level role that combines architectural vision with deep practical expertise in ML engineering and AWS-native MLOps.
Key Responsibilities
- Technical Leadership & Architecture: Define the end‑to‑end ML architecture for the matching platform, including data pipelines, model training workflows, inference runtimes, and telemetry ecosystems. Lead adoption of best‑in‑class MLOps patterns, platform tooling, and AWS SageMaker capabilities across training, processing, registry, monitoring, and deployment. Partner with platform, security, and data engineering teams to implement scalable data lakehouse oriented feature architectures and enterprise‑grade ML governance. Champion engineering standards for model quality, documentation, observability, and platform resilience.
- Feature Engineering & Data Architecture: Architect highly scalable, production‑ready feature pipelines within Lakehouse environments. Set the technical direction for fallback and resilience strategies (e.g., fallback pipelines). Establish and enforce data‑quality guardrails, validation schemas, and monitoring frameworks. Drive adoption and standards for enterprise feature stores.
- Model Development & Technical Excellence: Lead the design of ranking, scoring, and similarity models tailored to the matching platform requirements. Define model calibration, scoring logic, confidence thresholds, and optimisation strategies. Mentor teams on advanced ML techniques using Model frameworks such as PyTorch, TensorFlow, and XGBoost. Review and approve technical designs for complex modeling workflows.
- Explainability & Regulatory-Grade Reasoning: Establish explainability standards across the ML stack, using SHAP or equivalent frameworks. Define patterns to generate regulator‑ready reason codes, aligned with compliance requirements. Ensure explainability artefacts are accurate, robust, and traceable across model versions.
- ML Deployment & Automation (MLOps): Architect automated training, deployment, and retraining pipelines using AWS SageMaker. Set standards for model registry usage, automated approvals, and rollback orchestration. Drive infrastructure-as-code and CI/CD maturity for ML systems across multiple environments. Lead design of enterprise‑wide weight‑update patterns and lineage‑aware deployment strategies.
- Inference Runtime & Cross‑Account Productionisation: Architect low‑latency, high‑throughput inference services that meet strict matching platform SLAs. Lead the design of secure cross‑account IAM patterns for model consumption. Own end‑to‑end telemetry design, including scoring metrics, latency, error analytics, and SLOs. Partner with platform teams to optimise cost, scale, and reliability of inference endpoints.
- Monitoring, Drift Detection & Observability: Define observability standards for feature drift, concept drift, performance degradation, and data integrity. Lead the creation of dashboards, benchmarks, and automated alerting across the ML ecosystem. Ensure telemetry pipelines adhere to privacy, data minimisation, and compliance policies. Drive adoption of proactive failover, shadow-mode testing, and continuous validation patterns.
- Security, Compliance & ML Governance: Set and enforce ML-specific security standards including data minimisation, encryption, and PII handling. Oversee creation of Model Cards, lineage artefacts, and compliance documentation. Ensure ML systems meet governance standards for auditability, reproducibility, versioning, and traceability. Collaborate with InfoSec and Risk teams to define ML governance frameworks and secure cross‑environment workflows.
- Testing, Validation & Performance Engineering: Lead validation strategies using golden datasets, behavioural tests, and benchmark suites. Architect performance testing for latency‑sensitive inference paths and model hot paths. Establish standards for A/B testing, shadow deployments, canary rollouts, and controlled experiments.
Qualifications
Essential: Proven track record architecting and delivering production ML systems at scale in enterprise environments. Deep expertise with AWS SageMaker (training, processing, pipelines, endpoints, registry) and complementary AWS services. Expert‑level Python and ML Model frameworks (e.g. PyTorch, TensorFlow, XGBoost). Strong thought leadership in MLOps automation, CI/CD for ML, and model lifecycle management. Advanced experience designing explainability systems, reason codes, and governance artefacts. Expertise in low‑latency inference architectures and real‑time model serving. Strong grounding in drift detection, telemetry pipelines, observability patterns, and model QA. Experience shaping ML security practices, including cross‑account IAM, data minimisation, and PII-safe design. Ability to influence architecture, mentor senior engineers, and set long‑term technical direction.
Nice to Have: Experience building or leading feature store adoption. Background in ranking, search relevance, entity matching, or similarity modelling. Experience designing or governing multi‑account AWS ML platforms. Knowledge of distributed training, GPU/accelerator optimisation, and scaling strategies. Bachelors in a STEM subject, e.g. mathematics, physics, engineering, computer science, or adjacent degrees. Masters or PhD or equivalent experience in STEM desirable but not essential.
Principal Machine Learning Engineer employer: NLP PEOPLE
At London Stock Exchange Group, we pride ourselves on being an exceptional employer that fosters a culture of innovation and collaboration. As a Principal Machine Learning Engineer, you will not only lead cutting-edge projects but also benefit from a supportive environment that prioritises professional growth and mentorship. Our commitment to employee development, coupled with the dynamic atmosphere of London, makes this an ideal place for those seeking meaningful and rewarding careers in technology.
StudySmarter Expert Advice🤫
We think this is how you could land Principal Machine Learning Engineer
✨Tip Number 1
Network like a pro! Get out there and connect with folks in the industry. Attend meetups, webinars, or conferences related to machine learning and MLOps. You never know who might be looking for someone just like you!
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those involving AWS SageMaker and ML systems. This is your chance to demonstrate your technical prowess and make a lasting impression.
✨Tip Number 3
Prepare for interviews by brushing up on your knowledge of explainability and governance in ML. Be ready to discuss how you've tackled challenges in previous roles and how you can contribute to our ML platform's strategic direction.
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, it shows you're genuinely interested in joining our team at StudySmarter.
We think you need these skills to ace Principal Machine Learning Engineer
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with AWS SageMaker, MLOps, and ML governance. We want to see how your skills align with the role, so don’t hold back on showcasing your relevant projects!
Show Off Your Technical Skills:This role is all about technical leadership, so be sure to include specific examples of your work with ML systems, Python, and model frameworks like PyTorch or TensorFlow. We love seeing hands-on experience that demonstrates your expertise!
Be Clear and Concise:When writing your application, keep it straightforward and to the point. Use bullet points for key achievements and avoid jargon unless it’s necessary. We appreciate clarity and want to easily understand your qualifications.
Apply Through Our Website:We encourage you to submit your application through our website. It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it’s super easy to do!
How to prepare for a job interview at NLP PEOPLE
✨Know Your ML Architecture Inside Out
Make sure you can clearly articulate your understanding of end-to-end ML architecture, especially in relation to data pipelines and model training workflows. Be prepared to discuss how you've implemented scalable solutions in the past, particularly using AWS SageMaker.
✨Showcase Your MLOps Expertise
Highlight your experience with MLOps patterns and automation. Be ready to share specific examples of how you've driven CI/CD maturity for ML systems and the impact it had on project outcomes. This will demonstrate your hands-on technical leadership.
✨Explainability is Key
Familiarise yourself with explainability standards and frameworks like SHAP. Be prepared to discuss how you've ensured compliance and traceability in your previous projects, as this role places a strong emphasis on regulatory-grade reasoning.
✨Prepare for Technical Challenges
Expect to face technical questions or challenges during the interview. Brush up on your knowledge of Python and ML frameworks like PyTorch and TensorFlow. Practising coding problems related to model development and deployment can give you an edge.