Senior Software Engineer, AI Inference Systems
NVIDIA(1 month ago)
About this role
A Software Engineer at NVIDIA focused on building highly efficient AI inference systems for large-scale models. The role centers on advancing inference stacks, GPU kernels, and compiler infrastructure to maximize performance and scalability across multi-GPU, multi-node, and multi-cloud environments. It also involves contributing to industry benchmarks and publishing research that pushes ML systems performance.
Required Skills
- Python
- C++
- CUDA
- GPU Programming
- Kernel Optimization
- Compiler Development
- Profiling
- Distributed Systems
- Kubernetes
- Docker
+2 more
Qualifications
- BS in Computer Science/Engineering/Software Engineering
- MS in Computer Science/Engineering/Software Engineering
- PhD in ML Systems/GPU Architecture/High-Performance Computing
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Member of Technical Staff, GPU Optimization
Mirage(3 months ago)
2026 Summer Intern - Machine Learning Engineer, AI Kernels (PhD)
General Manufacturing(1 month ago)
Head of Inference Kernels
Etched(3 months ago)
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius(10 months ago)
Triton Compiler Engineer
Intel(1 month ago)
HPC System Engineer
Nebius(2 months ago)