NVIDIA

Senior System Architect, Infrastructure Reliability

NVIDIA(1 day ago)

HybridFull TimeSenior$184,000 - $356,500Systems Engineering
Apply Now

About this role

NVIDIA is hiring a Senior System Architect specializing in Heterogeneous EDA Systems to develop an automated framework for failure attribution in high-performance computing environments. The role involves designing scalable diagnostic tools that analyze system telemetry to identify root causes of job failures across CPU, GPU, and system infrastructure.

View Original Listing

Required Skills

  • C++
  • Python
  • Distributed Systems
  • Linux
  • Cluster Management
  • Telemetry
  • GPU Monitoring
  • System Diagnostics
  • Machine Learning
  • High-Performance Computing
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com