NVIDIA

Senior Site Reliability Engineer - Observability and Telemetry Platform

NVIDIA

2 months ago
United States
Onsite
Full Time
Senior
0 applicants
View Job Listing
NVIDIA
Apply to 100+ jobs

About this role

A Site Reliability Engineer at NVIDIA is responsible for ensuring high availability and efficient operation of large-scale GPU cloud services. The role focuses on designing and improving production systems and observability platforms to support performance, capacity, and developer velocity. It emphasizes automation, reliability engineering practices, and continuous system improvement in a collaborative engineering environment.

Skills

Qualifications

BS in Computer Science or Related Field
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

About NVIDIA

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for NVIDIA.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com