Senior Site Reliability Engineer - DGX Cloud
NVIDIA(1 month ago)
About this role
A Senior Site Reliability Engineer at NVIDIA ensures the reliability, availability, and performance of large-scale GPU cloud services by overseeing production system design and long-term operational health. The role emphasizes cross-functional collaboration and continuous improvement of systems to support both internal and external users.
Required Skills
- Kubernetes
- OpenStack
- Python
- Go
- Linux
- Networking
- Containers
- Automation
- Monitoring
- Logging
+5 more
Qualifications
- BS in Computer Science or Related Field
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Site Reliability Engineer
McKesson(2 months ago)
Site Reliability Engineer II
Fivetran (1 month ago)
Senior Site Reliability Engineering (SRE) Manager
Swift(1 month ago)
Senior Site Reliability Engineer
Cabify(1 month ago)
Site Reliability Engineer III
Genuine Parts Company(27 days ago)
Site Reliability Engineer
Boomi (1 month ago)