Senior GenAI Algorithms Engineer — Post-Training Optimizations
NVIDIA(24 days ago)
About this role
A Senior Deep Learning Algorithms Engineer on NVIDIA’s Algorithmic Model Optimization Team focuses on optimizing generative AI models (LLMs, VLMs, and multi‑modal models) for maximal inference efficiency and deployability on NVIDIA hardware. The role bridges research and engineering to design, implement, and productionize model optimization algorithms and software within NVIDIA’s AI stack and open-source frameworks. It emphasizes software–hardware co‑design to improve compute and memory efficiency while balancing accuracy–performance tradeoffs.
Required Skills
- Model Optimization
- Quantization
- Speculative Decoding
- Sparsity
- Distillation
- Pruning
- NAS
- Python
- PyTorch
- CUDA
+5 more
Qualifications
- Master's Degree in Computer Science or Related Field
- PhD in Computer Science or Related Field
- Equivalent Experience
- 5+ Years Relevant Experience
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Member of Technical Staff - Efficient ML
Moonlake AI(2 months ago)
Machine Learning Scientist (L4/L5) - Multi-modal Algorithms for Games
Netflix(18 days ago)
Staff Machine Learning Engineer, Inference Optimisation
Wayve(5 months ago)
Internship / Thesis Student for Edge AI Optimization Research & Engineering (f/m/d)
NXP Semiconductors(19 days ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
Matterport - Senior ML Ops Engineer
CoStar News(5 months ago)