Senior AI-HPC Cluster Engineer - MLOps
NVIDIA(1 month ago)
About this role
NVIDIA is hiring an experienced engineer to design and implement GPU compute clusters for deep learning and high-performance computing. The role focuses on building scalable automation and tooling, supporting researchers running AI/HPC workloads, and improving the GPU-accelerated computing ecosystem. It emphasizes collaboration across teams and strategic mentorship on managing large-scale compute, networking, and storage infrastructure.
Required Skills
- HPC Systems
- GPU Computing
- Cluster Management
- Slurm
- Kubernetes
- MPI
- NCCL
- Linux
- Containers
- Python
+3 more
Qualifications
- Bachelor’s degree in Computer Science, Electrical Engineering, or related field
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Senior HPC Cluster Engineer
Nebius(11 months ago)
HPC Solutions Architect
Lavendo(16 days ago)
Senior HPC Cluster Engineer
Nebius(1 year ago)
HPC & Research Data Systems Engineer
UK Centre for Ecology and Hydrology(9 days ago)
Opportunistic Role
SF Compute(2 months ago)
Engineer - HPC Platform
LI Test Company(4 days ago)