Senior Storage Production Engineer - DGX Cloud
NVIDIA(1 month ago)
About this role
A Storage Production Engineer at NVIDIA is responsible for ensuring GPU cloud storage services meet reliability, availability, and performance expectations for AI/ML and HPC workloads. The role focuses on driving automation, optimizing data access efficiency, and improving storage system lifecycle and scalability. It supports cross-team coordination to enable safe changes and long-term system reliability.
Required Skills
- Distributed Storage
- Storage Networking
- Linux
- Kubernetes
- Automation
- Monitoring
- Performance Tuning
- Capacity Planning
- Python
- C++
+5 more
Qualifications
- BS in Computer Science
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Engineering Manager, HPC Kubernetes Platform
NorthMark Strategies(2 months ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(6 days ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(21 days ago)
Senior System Engineer
Backbase(7 days ago)
Storage Admin / Linux Admin
BTI Solutions(2 months ago)
Staff Engineer, Distributed Storage and HPC & AI Infrastructure
Together AI(1 month ago)