Manager, Site Reliability Engineer - DGX Cloud
NVIDIA(1 month ago)
About this role
NVIDIA is a leader in AI and high-performance computing, building cloud platforms that power advanced GPUs and AI applications. The Senior Manager of SRE is a senior engineering leader responsible for the SRE organization that supports NVIDIA’s cloud offerings and long-term platform reliability strategy.
Required Skills
- Kubernetes
- Cloud
- Automation
- Observability
- Incident Management
- Leadership
- Terraform
- Python
- Linux
- SRE Principles
Qualifications
- BS in Computer Science
- MS in Computer Science
- BS in Electrical Engineering
- MS in Electrical Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Lead Cloud Site Reliability Engineer
NICE(11 days ago)
Sr Site Reliability Engineer
NATIONAL(2 months ago)
Site Reliability Engineer (SRE)
Encora(1 month ago)
Principal Site Reliability Engineer
LI Test Company(1 month ago)
Senior MTS, Site Reliability Engineering
Aviatrix(1 month ago)
Staff Site Reliability Engineer
BuildOps(3 days ago)