Job Description
**Overview**
Microsoft AI is seeking an experienced High Performance Computing Operations Engineering Manager to join our infrastructure team on the MAI SuperIntelligence Team. In this role, you’ll lead a team of Site Reliability Engineers who blend software engineering and systems engineering to keep our large-scale distributed AI infrastructure reliable and efficient. You’ll work closely with ML researchers, data engineers, and product developers to design and operate the platforms that power training, fine-tuning, and serving generative AI models.
Microsoft Superintelligence Team
Microsoft Superintelligence Team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive...