Microsoft

Member Of Technical Staff Hardware Health Mai Superintelligence Team United States California Mountain View

Microsoft

4 months ago
Onsite
Full Time
Senior
0 applicants
View Job Listing
Microsoft
Apply to 100+ jobs

About this role

A Senior Hardware Reliability Engineer specializing in GPU cluster health monitoring and diagnostics, focusing on predictive analytics, system observability, and reliability improvements for large-scale datacenter GPU systems. The role involves collaboration with cross-functional teams to enhance hardware design and operational efficiency.

Skills

Qualifications

BS in Computer ScienceMaster's in Computer Science12+ years technical experienceExperience with HPC or GPU systems
Microsoft

About Microsoft

microsoft.com

Explore Microsoft products and services and support for your home or business. Shop Microsoft 365, Copilot, Teams, Xbox, Windows, Azure, Surface and more.

About Microsoft

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for Microsoft.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com