Senior Deep Learning Software Engineer, Inference and Model Optimization
NVIDIA
About this role
A Senior Deep Learning Software Engineer on NVIDIA's Algorithmic Model Optimization Team works at the intersection of applied research and software engineering to improve inference efficiency for generative AI models (LLMs and diffusion models). The role contributes to the TRT Model Optimizer platform and supports internal and external teams by enabling scalable, automated deployment of optimized models. It spans high-level frameworks like PyTorch and HuggingFace as well as low-level kernel and deployment work in CUDA, Triton, and TensorRT.
Skills
Qualifications
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
Recent company news
NVIDIA Ignites the Next Industrial Revolution in Knowledge Work With Open Agent Development Platform
2 days ago
Nvidia CEO Says Company Is Firing Up H200 Production for China
1 day ago
Nvidia CEO Huang says company sees more than $1 trillion in sales through 2027
1 day ago
Nvidia's one of the fastest growing companies with one of the lowest valuations, says Jim Cramer
14 hours ago
Nvidia is reskinning games with AI. Gamers are angry about it, and wrong
1 day ago
About NVIDIA
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for NVIDIA.
Salary
$152k – $288k
per year
More jobs at NVIDIA
Similar Jobs
LLM Algorithmic Optimization Engineer - Intern
NIO
Neural Network Optimization Engineer
Recraft
Manager, Engineering - Hardware Acceleration (CUDA)
Torc Robotics
LLM Algorithmic Optimization Engineer
NIO
Deep Learning Intern, Model Optimization
Intrinsic
Lead Machine Learning Engineer (GenAI & Multimodal Systems) – Digital Innovation Agency
Truelogic Software