Engineering Manager, Deep Learning Inference
NVIDIA(1 month ago)
About this role
A Manager, Deep Learning Inference Software at NVIDIA leads an engineering team building and advancing inference frameworks that enable deployment of large language models and multimodal generative AI on NVIDIA GPUs. The role focuses on shaping software such as SGLang, vLLM, and FlashInfer to make AI deployment scalable and efficient across datacenter and edge environments.
Required Skills
- C/C++
- Python
- CUDA
- Triton
- CUTLASS
- GPU Programming
- Performance Optimization
- Multi-GPU
- Profiling
- Model Deployment
+1 more
Qualifications
- MS in Computer Science or Electrical/Computer Engineering
- PhD in Computer Science or Electrical/Computer Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
AI Software Architect
Intel(2 months ago)
Senior ML Engineer (Token Factory)
Nebius(1 month ago)
SWE, Inference Performance, Onboard
Wayve(11 months ago)
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(3 months ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
AI Infrastructure Engineer
NIO(5 months ago)