Senior Site Reliability Engineer, HPC and LSF
NVIDIA
About this role
A Senior Site Reliability Engineer on NVIDIA’s Hardware Infrastructure Farm leads the design and operation of large-scale compute clusters that support silicon development and datacenter acceleration. The role focuses on building and operating high-reliability, high-performance infrastructure while driving automation and process improvements to increase engineers’ productivity. It involves collaborating with domain experts to improve how chip development utilizes compute resources and contributing to time-to-market improvements for next-generation chips.
Skills
Qualifications
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
Recent company news
Nvidia Swears Off an Earnings Crutch, Putting Pressure on Other Tech Companies
11 hours ago
Nvidia to invest $4 billion into photonics companies Coherent and Lumentum
2 days ago
Tech stocks today: Nvidia CEO Jensen Huang suggests end of OpenAI investments, Apple unveils MacBook Neo
1 hour ago
NVIDIA Announces Strategic Partnership With Lumentum to Develop State-of-the-Art Optics Technology
2 days ago
CEO Of Tiny Company Tells Jim Cramer They’ve Outperformed NVIDIA Since 2015
2 days ago
About NVIDIA
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for NVIDIA.
Salary
$170k – $227k
per year