Senior Software Engineer, Observability
NVIDIA(1 month ago)
About this role
A Senior/Staff Engineer on NVIDIA's Observability team will lead the technical design and delivery of a next-generation, multi-region observability platform that supports NVIDIA’s AI and data ecosystems at extreme scale. This architecture-heavy, code-first role focuses on creating a unified telemetry stack for metrics, logs, traces, profiles, and analytics. The position collaborates broadly across GPU, ML infra, networking, and cloud teams to shape NVIDIA’s global observability strategy.
Required Skills
- Prometheus
- Thanos
- Mimir
- Grafana
- OpenTelemetry
- Fluent Bit
- Vector
- Loki
- ELK
- OpenSearch
+19 more
Qualifications
- BS or MS in EE, ECE, CS or Equivalent Experience
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Observability Engineer (Cloud Engineer)
Fair Isaac Corporation(2 months ago)
Observability Engineer
LSEG(4 months ago)
Observability Engineer
LSEG(1 month ago)
Senior Site Reliability Engineer (Observability)
Iterable(2 months ago)
Staff Platform Site Reliability Specialist (Observability & Kubernetes)
Everbridge (23 days ago)
Solutions Architect
OpenObserve(1 month ago)