Senior Site Reliability Engineer - GPU Cloud
NVIDIA(7 months ago)
About this role
A Senior Site Reliability Engineer position on NVIDIA's SRE team supporting the NVIDIA GPU Cloud platform for internal R&D and external AI/ML customers. The role is part of the infrastructure organization focused on High-Performance and Distributed Computing across on-prem and cloud environments.
Required Skills
- Infrastructure Automation
- Distributed Systems
- Terraform
- Kubernetes
- Cloud Administration
- Go
- Python
- C++
- Debugging
- Troubleshooting
+2 more
Qualifications
- M.Sc
- B.E in Computer Science
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Site Reliability Architect
HHAeXchange(20 days ago)
GPU Software Engineer
CAE(15 days ago)
Senior Site Reliability Engineer
Nebius(11 months ago)
Senior Site Reliability Engineer
Nebius(1 year ago)
Senior Software Engineer - Cloud Traffic
Confluent(18 days ago)
Senior Site Reliability Engineer
Nebius(11 months ago)