Senior GenAI Algorithms Engineer — Post-Training Optimizations
NVIDIA(1 month ago)
About this role
A Senior Deep Learning Algorithms Engineer on NVIDIA’s Algorithmic Model Optimization Team focuses on optimizing generative AI models (LLMs, VLMs, and multimodal models) for maximal inference efficiency and streamlined deployment on NVIDIA hardware. The role spans research and engineering across the AI software stack, working with both NVIDIA SDKs and open-source frameworks to achieve strong accuracy–performance tradeoffs through software–hardware co-design.
Required Skills
- Model Optimization
- Quantization
- Speculative Decoding
- Sparsity
- Knowledge Distillation
- Pruning
- NAS
- Deployment
- PyTorch
- Python
+9 more
Qualifications
- Master's in Computer Science or Related Field
- PhD in Computer Science or Related Field
- Equivalent Experience
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Staff AI Software Engineer - Edge Model Optimization & Deployment
Field AI(21 days ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
Member of Technical Staff - Efficient ML
Moonlake AI(2 months ago)
Staff Machine Learning Engineer, Inference Optimisation
Wayve(5 months ago)
Senior ML Scientist, GenAI
Picsart(25 days ago)
Principal Machine Learning Researcher, On-Device Optimization
HP(1 month ago)