Senior Site Reliability Engineer - Storage
NVIDIA(1 month ago)
About this role
A Senior Site Reliability Engineer at NVIDIA focused on high-performance computing (HPC) storage and cloud-augmented on-prem infrastructure. The role supports NVIDIA’s AI and HPC initiatives by shaping storage architecture and enabling scalable, reliable infrastructure for engineering teams. The position sits within NVIDIA’s infrastructure engineering organization and contributes to cutting-edge computing projects.
Required Skills
- NFS
- NVMe/TCP
- S3
- Lustre
- Kubernetes
- Python
- Go
- Automation
- Monitoring
- Configuration Management
+9 more
Qualifications
- BS in Computer Science
- MS
- Ph.D.
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
HPC & AI Cloud Architect
Leonardo(1 month ago)
Staff Engineer, Distributed Storage and HPC & AI Infrastructure
Together AI(1 month ago)
Manager, HPC Storage Engineer
Runpod, Inc. (26 days ago)
Assoc. Dir. DDIT IES Cloud Engineering
Jack and Jen Child Care Center(2 months ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(14 days ago)
Senior Site Reliability Engineer
Sustainable Talent(1 year ago)