NVIDIA

Engineering Manager, Deep Learning Inference

NVIDIA(1 month ago)

United States, California, Santa Clara, CAOnsiteFull TimeManager$224,000 - $431,250Engineering
Apply Now

About this role

A Manager, Deep Learning Inference Software at NVIDIA leads an engineering team building and advancing inference frameworks that enable deployment of large language models and multimodal generative AI on NVIDIA GPUs. The role focuses on shaping software such as SGLang, vLLM, and FlashInfer to make AI deployment scalable and efficient across datacenter and edge environments.

View Original Listing

Required Skills

  • C/C++
  • Python
  • CUDA
  • Triton
  • CUTLASS
  • GPU Programming
  • Performance Optimization
  • Multi-GPU
  • Profiling
  • Model Deployment

+1 more

Qualifications

  • MS in Computer Science or Electrical/Computer Engineering
  • PhD in Computer Science or Electrical/Computer Engineering
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com