At a Glance
- Tasks: Develop and support AI/ML solutions while collaborating with talented teams.
- Company: Join JPMorgan Chase, a leader in financial innovation and technology.
- Benefits: Competitive salary, health coverage, tuition reimbursement, and mental health support.
- Other info: Diverse and inclusive workplace with excellent growth opportunities.
- Why this job: Make a real impact on AI/ML platforms and drive strategic change.
- Qualifications: Experience in site reliability and proficiency in Python or PySpark.
The predicted salary is between 60000 - 80000 £ per year.
Join us to shape the future of AI/ML data platforms and make a real impact on how we deliver market‑leading solutions. You will collaborate with talented colleagues, solve complex challenges, and help drive strategic change across our organization. At JPMorgan Chase, you’ll find opportunities for growth, mentorship, and the chance to work with cutting‑edge technologies. Your contributions will help us deliver resilient and innovative data solutions that power our business.
As a Site Reliability Engineer in the AI/ML Data Platforms team, you will play a key role in building and supporting scalable, resilient data solutions. You will engage in root cause analysis, production changes, and collaborate with cross‑functional teams to drive improvements. You will also mentor team members and partner with colleagues across our global network. Your work will directly impact the reliability and performance of our AI/ML platforms.
Job Responsibilities
- Develop and support AI/ML solutions for troubleshooting and incident resolution
- Coordinate incident management coverage to ensure effective resolution of application issues
- Collaborate with cross‑functional teams to perform root cause analysis and implement production changes
- Apply expertise in application development and support using technologies such as Databricks, Snowflake, AWS, and Kubernetes
- Mentor and guide team members to drive strategic change
- Build tools to automate repeated tasks and reduce operational toil
- Ensure compliance with risk controls and company standards
- Contribute to system design, resiliency, testing, operational stability, and disaster recovery
- Foster a collaborative team environment to achieve common goals
Required Qualifications, Capabilities, and Skills
- Proficient in site reliability culture and principles, with experience implementing them within applications or platforms
- Skilled in running production incident calls and managing incident resolution
- Experience with observability, including monitoring, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, or Splunk
- Strong understanding of SLI/SLO/SLA and error budgets
- Proficiency in Python or PySpark for AI/ML modeling
- Ability to automate tasks and reduce toil through tool development
- Hands‑on experience in system design, resiliency, testing, operational stability, and disaster recovery
- Awareness of risk controls and compliance with organizational standards
- Ability to work collaboratively and build meaningful relationships
Preferred Qualifications, Capabilities, and Skills
- Experience in an SRE or production support role with AWS Cloud, Databricks, Snowflake, or similar technologies
- AWS and Databricks certifications
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission‑based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on‑site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs.
Software Engineering III - AMDP employer: J.P. Morgan
At JPMorgan Chase, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters collaboration and innovation. Our commitment to employee growth is evident through mentorship opportunities and access to cutting-edge technologies, ensuring that you can make a meaningful impact in the AI/ML data platforms space. With comprehensive benefits, including health care coverage, retirement plans, and mental health support, we prioritise the well-being of our employees while championing diversity and inclusion across our global workforce.
StudySmarter Expert Advice🤫
We think this is how you could land Software Engineering III - AMDP
✨Tip Number 1
Network like a pro! Reach out to current employees at JPMorganChase on LinkedIn or through mutual connections. A friendly chat can give you insider info and might just get your foot in the door.
✨Tip Number 2
Prepare for those technical interviews! Brush up on your Python or PySpark skills, and be ready to discuss your experience with AWS, Databricks, and Kubernetes. Practice common SRE scenarios to show off your problem-solving chops.
✨Tip Number 3
Show your passion for AI/ML! Be ready to share your thoughts on the latest trends in data platforms and how you can contribute to innovative solutions at JPMorganChase. Your enthusiasm can set you apart from other candidates.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re serious about joining the team and making an impact.
We think you need these skills to ace Software Engineering III - AMDP
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the Software Engineering III role. Highlight your experience with AI/ML solutions and any relevant technologies like AWS or Databricks. We want to see how your skills align with what we’re looking for!
Showcase Your Problem-Solving Skills:In your application, don’t just list your skills—give us examples of how you've tackled complex challenges in the past. We love seeing real-world applications of your expertise, especially in incident resolution and root cause analysis.
Be Authentic:Let your personality shine through! We value diversity and want to know who you are beyond your technical skills. Share your passion for site reliability engineering and how you collaborate with teams to drive strategic change.
Apply Through Our Website:We encourage you to submit your application directly through our website. It’s the best way to ensure it gets into the right hands. Plus, you’ll find all the details about the role and our company culture there!
How to prepare for a job interview at J.P. Morgan
✨Know Your Tech Stack
Familiarise yourself with the technologies mentioned in the job description, like Databricks, Snowflake, AWS, and Kubernetes. Be ready to discuss your experience with these tools and how you've used them in past projects.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of how you've tackled complex challenges in previous roles. Highlight your experience with root cause analysis and incident management, as these are key aspects of the Site Reliability Engineer role.
✨Emphasise Collaboration
Since this role involves working with cross-functional teams, be prepared to discuss how you've successfully collaborated with others in the past. Share examples that demonstrate your ability to build meaningful relationships and mentor team members.
✨Understand SLI/SLO/SLA Concepts
Brush up on your knowledge of service level indicators, objectives, and agreements. Be ready to explain how you've applied these concepts in your work, as they are crucial for ensuring the reliability and performance of AI/ML platforms.