NVIDIA

Senior Site Reliability Engineer - HPC

NVIDIA(1 day ago)

HybridFull TimeSenior$152,000 - $287,500Compute Farm / Systems Engineering
Apply Now

About this role

NVIDIA is seeking a Senior Site Reliability Engineer to help build and operate its global service platform, with a focus on ensuring high availability and operational excellence. The role involves designing scalable solutions in hybrid multi-cloud environments, automating infrastructure provisioning, and collaborating across teams to maintain critical systems that support NVIDIA's innovative technologies in AI and high-performance computing.

View Original Listing

Required Skills

  • Kubernetes
  • IaC
  • Monitoring
  • Scripting
  • Observability
  • Cloud Infrastructure
  • Capacity Planning
  • Automation
  • Reliability Engineering
  • Data-Driven Operations
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com