Senior Site Reliability Engineer
NVIDIA(9 days ago)
About this role
NVIDIA is seeking a seasoned Senior Site Reliability Engineer (SRE) to support its complex infrastructure and internal CI/CD systems for GPU and Tegra systems. The role involves managing on-premises data center infrastructure, deploying applications on Kubernetes, and ensuring system reliability and security across NVIDIA's various business units.
Required Skills
- Kubernetes
- Docker
- Prometheus
- Grafana
- Ansible
- Redfish
- KVM
- IPMI
- SQL
- Automation
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Lead Site Reliability Engineer
GetGround(2 months ago)
Site Reliability Engineer
Euronext(2 days ago)
Site Reliability Engineer
NTT DATA,(2 days ago)
Staff Site Reliability Engineer
PathAI(1 month ago)
Principal Site Reliability Engineer (Platform Tribe)
Playson(15 days ago)
Site Reliability Engineer
NTT DATA,(1 month ago)