Principal ML Platform Reliability Engineer

Principal ML Platform Reliability Engineer

Full-Time No working from home possible
Dormont Manufacturing Co

Dormont Manufacturing Co is seeking a Principal Software Engineer for our ML Platform to ensure reliability and scalability in biotech AI innovation. You will architect solutions focused on GPU and TPU infrastructure while leading the reliability of our global job scheduler and optimizing inference services.

The ideal candidate possesses strong skills in AI/ML workload management, cloud design, and Kubernetes orchestration while supporting a culture of collaboration and curiosity in a hybrid work environment.

#J-18808-Ljbffr
Dormont Manufacturing Co

Contact Details:

Dormont Manufacturing Co Recruitment Team