Senior Deep Learning Software Engineer, Inference
NVIDIA(1 month ago)
About this role
A Senior Software Engineer focused on Deep Learning Inference responsible for designing, building, and optimizing GPU-accelerated software that powers advanced AI applications. The role centers on developing and improving inference frameworks (such as SGLang and vLLM) and enabling efficient deployment of large-scale language and generative models across NVIDIA accelerators. The position collaborates with the deep learning community and internal teams to deliver high-performance model serving solutions.
Required Skills
- Deep Learning
- C++
- CUDA
- GPU Programming
- Performance Optimization
- Profiling
- Model Serving
- NCCL
- Python
- Software Design
Qualifications
- Master's Degree in Computer Science
- PhD in Computer Science
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
AI Software Architect
Intel(2 months ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
SWE, Inference Performance, Onboard
Wayve(11 months ago)
Machine Learning Engineer – HPC
Meshy(5 months ago)
Embedded Software Engineer with Deep Learning experience, Lund
Axis Communications(1 month ago)
Research Engineer (LLM Training and Performance)
JetBrains(3 months ago)