Senior Software Architect, AI Networking
NVIDIA(2 months ago)
About this role
An Architect in NVIDIA’s E2E Architecture group will shape large language model inference infrastructure, working across software and hardware to design scalable systems for generative AI on advanced GPU clusters. The role focuses on defining how AI models are deployed and scaled in production and collaborating with engineers, researchers, and partners to deliver high-performance inference solutions.
Required Skills
- LLM Inference
- Distributed Systems
- GPU Acceleration
- CUDA
- C++
- Python
- Memory Orchestration
- Compute Scheduling
- Networking
- Profiling
+7 more
Qualifications
- BS in Computer Science
- MS in Computer Science
- PhD in Computer Science
- BS in Electrical Engineering
- MS in Electrical Engineering
- PhD in Electrical Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix(28 days ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
Machine Learning Engineer – HPC
Meshy(5 months ago)
Machine Learning Engineer - Infra
TechCrunch(15 days ago)
System Engineer (Token Factory)
Nebius(7 months ago)
AI Senior Staff Systems Engineer
BETA CAE Greece(1 month ago)