At a Glance
- Tasks: Develop and support AI/ML solutions while collaborating with talented teams.
- Company: Join JPMorgan Chase, a leader in innovative financial solutions.
- Benefits: Competitive salary, health coverage, tuition reimbursement, and mental health support.
- Other info: Diverse and inclusive workplace with excellent growth opportunities.
- Why this job: Make a real impact on AI/ML platforms and drive strategic change.
- Qualifications: Experience in site reliability and proficiency in Python or PySpark.
The predicted salary is between 60000 - 80000 £ per year.
Join us to shape the future of AI/ML data platforms and make a real impact on how we deliver market-leading solutions. You will collaborate with talented colleagues, solve complex challenges, and help drive strategic change across our organization. At JPMorganChase, you'll find opportunities for growth, mentorship, and the chance to work with cutting‑edge technologies. Your contributions will help us deliver resilient and innovative data solutions that power our business.
As a Site Reliability Engineer in the AI/ML Data Platforms team, you will play a key role in building and supporting scalable, resilient data solutions. You will engage in root cause analysis, production changes, and collaborate with cross‑functional teams to drive improvements. You will also mentor team members and partner with colleagues across our global network. Your work will directly impact the reliability and performance of our AI/ML platforms.
Job Responsibilities- Develop and support AI/ML solutions for troubleshooting and incident resolution
- Coordinate incident management coverage to ensure effective resolution of application issues
- Collaborate with cross‑functional teams to perform root cause analysis and implement production changes
- Apply expertise in application development and support using technologies such as Databricks, Snowflake, AWS, and Kubernetes
- Mentor and guide team members to drive strategic change
- Build tools to automate repeated tasks and reduce operational toil
- Ensure compliance with risk controls and company standards
- Contribute to system design, resiliency, testing, operational stability, and disaster recovery
- Foster a collaborative team environment to achieve common goals
- Proficient in site reliability culture and principles, with experience implementing them within applications or platforms
- Skilled in running production incident calls and managing incident resolution
- Experience with observability, including monitoring, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, or Splunk
- Strong understanding of SLI/SLO/SLA and error budgets
- Proficiency in Python or PySpark for AI/ML modeling
- Ability to automate tasks and reduce toil through tool development
- Hands‑on experience in system design, resiliency, testing, operational stability, and disaster recovery
- Awareness of risk controls and compliance with organizational standards
- Ability to work collaboratively and build meaningful relationships
- Experience in an SRE or production support role with AWS Cloud, Databricks, Snowflake, or similar technologies
- AWS and Databricks certifications
Software Engineering III - AMDP in London employer: Ccgmag
Contact Detail:
Ccgmag Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Software Engineering III - AMDP in London
✨Tip Number 1
Network like a pro! Reach out to current employees at JPMorganChase on LinkedIn or through mutual connections. A friendly chat can give you insider info and might just get your foot in the door.
✨Tip Number 2
Show off your skills! Prepare a portfolio or GitHub repository showcasing your projects, especially those related to AI/ML and site reliability. This is your chance to demonstrate your expertise beyond the CV.
✨Tip Number 3
Ace the interview by practising common technical questions and situational scenarios. Use platforms like StudySmarter to brush up on your knowledge and rehearse your answers with friends or mentors.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in being part of the JPMorganChase team.
We think you need these skills to ace Software Engineering III - AMDP in London
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with AI/ML solutions and site reliability principles. We want to see how your skills align with the role, so don’t hold back on showcasing your relevant projects!
Showcase Your Technical Skills: When applying, emphasise your proficiency in tools like Databricks, Snowflake, and AWS. We’re keen on seeing your hands-on experience, so include specific examples of how you’ve used these technologies to solve complex challenges.
Highlight Collaboration Experience: Since this role involves working with cross-functional teams, share instances where you’ve successfully collaborated with others. We love to see how you’ve contributed to team goals and driven strategic change in previous roles.
Apply Through Our Website: Don’t forget to submit your application through our official website! It’s the best way for us to receive your details and ensure you’re considered for this exciting opportunity. We can’t wait to hear from you!
How to prepare for a job interview at Ccgmag
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like Databricks, Snowflake, AWS, and Kubernetes. Brush up on your Python or PySpark skills, as these will be crucial for AI/ML modelling.
✨Showcase Your SRE Experience
Be prepared to discuss your experience with site reliability principles and how you've implemented them in past roles. Highlight any incidents you've managed and the outcomes of those situations to demonstrate your problem-solving skills.
✨Prepare for Collaboration Questions
Since this role involves working with cross-functional teams, think of examples where you’ve successfully collaborated with others. Be ready to explain how you fostered a team environment and drove strategic change.
✨Understand Incident Management
Familiarise yourself with incident management processes and be ready to discuss how you would handle production issues. Knowing tools for observability like Grafana or Datadog will also give you an edge in the conversation.