NVIDIA

Senior Site Reliability Engineer, Observability

NVIDIA(1 month ago)

HybridFull TimeSenior$184,000 - $356,500Site Reliability / Platform Engineering
Apply Now

About this role

A Site Reliability Engineer role at NVIDIA focused on the global telemetry and observability backbone for AI and data platforms. The role is part of the data and observability teams that support large-scale AI, data, and platform services and contributes to the design and evolution of NVIDIA’s telemetry systems. This position operates at the intersection of AI infrastructure and platform engineering, supporting visibility across metrics, logs, traces, and profiling data.

View Original Listing

Required Skills

  • Observability
  • Prometheus
  • Thanos
  • Mimir
  • Loki
  • OpenSearch
  • Tempo
  • Jaeger
  • OpenTelemetry
  • Python

+18 more

Qualifications

  • Bachelor's Degree in Computer Science or Related Field
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com