Senior Platform and EngOps Engineer - Cluster Operations
NVIDIA(2 months ago)
About this role
NVIDIA is hiring EngOps and Platform Engineers to join teams that develop and maintain software enabling GPU communication for artificial intelligence, high-performance computing, and visualization. The role centers on supporting and improving large, interconnected GPU clusters (NVLink and InfiniBand) that power advanced AI research and products. This position contributes to NVIDIA's mission to accelerate the next wave of AI and discovery.
Required Skills
- Ansible
- Python
- Shell Scripting
- Linux
- Networking
- Cluster Management
- DevOps
- Troubleshooting
- Monitoring
- Firmware Updates
Qualifications
- BS in Computer Science
- MS in Computer Science
- BS in Computer Engineering
- MS in Computer Engineering
- BS in Electrical Engineering
- MS in Electrical Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Assoc. Dir. DDIT IES Cloud Engineering
Jack and Jen Child Care Center(2 months ago)
GPU Cluster Architect
Nebius(6 months ago)
Senior HPC Cluster Engineer
Nebius(11 months ago)
Opportunistic Role
SF Compute(2 months ago)
Senior HPC Cluster Engineer
Nebius(1 year ago)
AI Senior Staff Systems Engineer
BETA CAE Greece(1 month ago)