Senior HPC and AI Networking Performance Research and Analysis Engineer
NVIDIA(26 days ago)
About this role
A Performance Research and Analysis Engineer at NVIDIA’s Performance group focuses on characterizing and understanding performance of large-scale AI workloads on GPU and CPU clusters used for distributed deep learning LLM training and inference. The role centers on analyzing communication and system behavior across hardware and software stacks and developing tools and methodologies for deep performance investigation. The position operates at the intersection of hardware platforms (HCAs, switches, CPUs, GPUs) and software layers to drive insight into performance expectations and limitations.
Required Skills
- Performance Analysis
- Benchmarking
- Profiling
- RDMA
- MPI
- NCCL
- CUDA
- Deep Learning
- Python
- C
+5 more
Qualifications
- B.Sc in Computer Science or Software Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
AI SW Runtime/Networking Engineer
Intel(21 days ago)
HPC System Engineer
Nebius(2 months ago)
Principal Software Architect- High Performance Computing
Applied Materials(12 days ago)
Principal Software Engineer – Scale-Up Networking (GPU-Centric)
Hewlett Packard Enterprise(19 days ago)
Senior HPC Developer - GPU and Networking
Clockwork.io(27 days ago)
Software Engineer — GPU Networking & Distributed Systems
Baseten(11 hours ago)