Site Reliability Engineer - Hardware Infrastructure
NVIDIA(1 month ago)
About this role
A Site Reliability Engineer at NVIDIA helps define, develop, and support large-scale production systems to ensure high efficiency, availability, and uptime. The role combines software and systems engineering to enable developer velocity while maintaining reliable, fault-tolerant services. The team emphasizes collaboration, creativity, and automation to minimize toil and sustain system performance.
Required Skills
- Incident Management
- Postmortems
- Root Cause Analysis
- SLOs
- Monitoring
- Alerting
- Automation
- Generative AI
- On-Call
- Python
+5 more
Qualifications
- Degree in Computer Science or Related Field
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Site Reliability Engineer - Intermediate
Equifax India(1 month ago)
SRE, Site Reliability Engineering
Klaviyo(24 days ago)
Site Reliability Engineer - Vice President
iCapital(20 days ago)
Engineer - Site Reliability Engineering
LSEG(7 months ago)
Site Reliability Engineer, Cloud Infrastructure
Quizlet(1 month ago)
Site Reliability Engineer - Intermediate
Equifax India(1 month ago)