Senior System Software Engineer - AI Performance and Efficiency Tools
NVIDIA(1 month ago)
About this role
A software engineering role at NVIDIA focused on building internal analysis, profiling, and debugging tools for AI workloads running on GPU clusters. The position works with architecture and software teams to provide insights that improve performance, power efficiency, and system reliability for AI training and inference.
Required Skills
- C++
- Python
- PyTorch
- TensorFlow
- Distributed Training
- Slurm
- Kubernetes
- CUDA
- NCCL
- Profiling
+6 more
Qualifications
- BS in Computer Science or related
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
HPC System Engineer
Nebius(2 months ago)
Research Engineer (LLM Training and Performance)
JetBrains(3 months ago)
Senior Systems Engineer - AI Infrastructure
Clockwork.io(25 days ago)
Member of Technical Staff, GPU Optimization
Mirage(3 months ago)
Member of Technical Staff (GPU Engineer)
Reka(1 month ago)
Machine Learning Engineer - Infra
TechCrunch(11 days ago)