At a Glance
- Tasks: Shape the future of AI/ML data platforms and mentor innovative teams.
- Company: Join JPMorgan Chase, a leader in tech and finance with a diverse culture.
- Benefits: Competitive salary, health coverage, tuition reimbursement, and mental health support.
- Other info: Diverse workplace valuing inclusion and offering excellent growth opportunities.
- Why this job: Make a real impact in AI/ML while advancing your career in a collaborative environment.
- Qualifications: Experience in site reliability, Python, and mentoring team members.
The predicted salary is between 80000 - 100000 £ per year.
Join us to shape the future of AI/ML data platforms, where your expertise will help create resilient and market‑leading solutions. You will have the opportunity to collaborate with innovators across our global network, driving strategic change and mentoring others. We value your skills in solving complex challenges and fostering a culture of reliability and growth. At JPMorgan Chase, your impact will reach far beyond your team, opening doors to career advancement and meaningful relationships.
As a Site Reliability Engineer in the AI/ML Data Platforms team, you will play a key role in building scalable and resilient data solutions. You will engage in root cause analysis, production changes, and operational improvements, while supporting budgetary and staffing decisions. You will mentor team members and partner with colleagues across the organization to drive strategic change. Your contributions will help shape a collaborative, innovative, and high‑performing team culture.
Job Responsibilities
- Demonstrate expertise in application development and support across technologies such as Databricks, Snowflake, AWS, and Kubernetes.
- Coordinate incident management coverage to ensure effective resolution of application issues.
- Collaborate with cross‑functional teams to perform root cause analysis and implement production changes.
- Develop and support AI/ML solutions for troubleshooting and incident resolution.
- Mentor and guide team members to foster growth and drive strategic change.
- Build and maintain scalable, resilient, and market‑leading data solutions.
- Support budgetary and staffing considerations to optimize team performance.
- Engage in operational stability and disaster recovery planning.
- Implement automation tools to reduce toil and improve efficiency.
- Ensure compliance with risk controls and company‑wide standards.
- Build meaningful relationships across teams to achieve common goals.
Required Qualifications
- Proficient in site reliability culture and principles, with experience implementing site reliability within applications or platforms.
- Skilled in running production incident calls and managing incident resolution.
- Experienced in observability, including white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
- Strong understanding of SLI/SLO/SLA and Error Budgets.
- Proficient in Python or PySpark for AI/ML modeling.
- Able to reduce toil by building automation tools for repeated tasks.
- Hands‑on experience in system design, resiliency, testing, operational stability, and disaster recovery.
- Awareness of risk controls and compliance with departmental and company‑wide standards.
- Collaborative team player with the ability to build meaningful relationships.
Preferred Qualifications
- Experience in an SRE or production support role with AWS Cloud, Databricks, Snowflake, or similar technologies.
- AWS and Databricks certifications.
- Advanced knowledge of AI/ML troubleshooting and incident resolution.
- Familiarity with budgetary and staffing optimization.
- Experience mentoring and guiding team members.
- Strong communication and interpersonal skills.
- Demonstrated ability to drive strategic change across teams.
Benefits
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission‑based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on‑site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more.
Equal Opportunity Employer
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.
Senior Lead Software Engineering - AI/ML Engineer in London employer: Fairygodboss
At JPMorgan Chase, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. As a Senior Lead Software Engineer in our AI/ML Data Platforms team, you will not only have the chance to work with cutting-edge technologies but also enjoy comprehensive benefits, including health care coverage, tuition reimbursement, and mental health support. Our commitment to employee growth and diversity ensures that your contributions will be valued and your career can flourish in a supportive environment.
StudySmarter Expert Advice🤫
We think this is how you could land Senior Lead Software Engineering - AI/ML Engineer in London
✨Tip Number 1
Network like a pro! Reach out to current employees at JPMorganChase through LinkedIn or other platforms. A friendly chat can give us insights into the company culture and might even lead to a referral.
✨Tip Number 2
Prepare for those interviews by brushing up on your technical skills. Make sure we can confidently discuss AI/ML solutions and site reliability principles. Practice common interview questions and scenarios related to the role.
✨Tip Number 3
Showcase your problem-solving skills! Be ready to share examples of how you've tackled complex challenges in previous roles. This will demonstrate our ability to contribute to building resilient data solutions.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, it shows that we’re genuinely interested in being part of the team at JPMorganChase.
We think you need these skills to ace Senior Lead Software Engineering - AI/ML Engineer in London
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with AI/ML and site reliability. We want to see how your skills align with the role, so don’t hold back on showcasing your relevant projects!
Showcase Your Technical Skills:When applying, emphasise your proficiency in tools like Databricks, AWS, and Python. We’re looking for candidates who can demonstrate their technical expertise, so include specific examples of how you've used these technologies in past roles.
Highlight Collaboration Experience:Since this role involves working with cross-functional teams, share instances where you’ve successfully collaborated with others. We value teamwork, so let us know how you’ve built meaningful relationships in your previous positions.
Apply Through Our Website:We encourage you to submit your application through our website for a smoother process. It’s the best way for us to receive your details and get you into our system quickly!
How to prepare for a job interview at Fairygodboss
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like Databricks, Snowflake, AWS, and Kubernetes. Brush up on your Python or PySpark skills, as you'll likely be asked to demonstrate your expertise in AI/ML solutions during the interview.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific examples of how you've tackled complex challenges in previous roles. Be ready to explain your approach to root cause analysis and incident management, as these are key responsibilities for the role.
✨Emphasise Collaboration
Since this role involves working with cross-functional teams, highlight your experience in building relationships and mentoring others. Share stories that showcase your ability to drive strategic change and foster a collaborative team culture.
✨Understand Site Reliability Principles
Familiarise yourself with site reliability culture and principles, especially if you have experience in an SRE or production support role. Be prepared to discuss how you’ve implemented these principles in past projects and how they can benefit the team at JPMorgan Chase.