NVIDIA

Senior Deep Learning Inference Performance Architect

NVIDIA(1 month ago)

United States, Durham, NC, North CarolinaOnsiteFull TimeSenior$184,000 - $356,500Engineering
Apply Now

About this role

A Senior Deep Learning Inference Performance Architect at NVIDIA focuses on advancing GPU and system-level approaches to improve AI inference performance and efficiency. The role centers on evaluating production large language model performance techniques, informing future GPU architecture decisions, and bridging hardware and software development. It supports the Inference Performance Architecture team in delivering real-time, cost-effective AI inference platforms.

View Original Listing

Required Skills

  • Deep Learning
  • GPU Computing
  • CUDA
  • C++
  • Python
  • Performance Modeling
  • Profiling
  • HPC
  • Computer Architecture
  • Kernel Development

Qualifications

  • MS in CS/EE/Math
  • PhD in CS/EE/Math
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com