Senior Deep Learning Software Engineer, Inference
NVIDIA(1 month ago)
California, United States, Santa Clara, CAOnsiteFull TimeSenior$152,000 - $287,500Software Engineering
Apply NowAbout this role
A Senior Software Engineer in Deep Learning Inference at NVIDIA will design, build, and optimize GPU-accelerated software that powers large-scale AI model serving. The role focuses on advancing and maintaining inference frameworks such as SGLang and vLLM to enable efficient deployment of state-of-the-art language and generative models.
Required Skills
- C++
- Python
- CUDA
- GPU Programming
- Deep Learning
- Inference
- Performance Optimization
- Model Serving
- Multi-GPU
- NCCL
+3 more
Qualifications
- Master's Degree in Computer Science or related
- PhD in Computer Science or related
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →