Senior Deep Learning Software Engineer, Inference and Model Optimization
NVIDIA
About this role
A Senior Deep Learning Software Engineer on NVIDIA's Algorithmic Model Optimization Team will help advance automated inference and deployment solutions for generative AI models, including LLMs and diffusion models. The role involves developing the TRT Model Optimizer platform used internally and externally to improve model inference efficiency and scalability. This position sits at the intersection of applied research and software engineering, contributing to NVIDIA's leadership in AI inference technologies.
Skills
Qualifications
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
Recent company news
NVIDIA Ignites the Next Industrial Revolution in Knowledge Work With Open Agent Development Platform
2 days ago
Nvidia CEO Says Company Is Firing Up H200 Production for China
1 day ago
Nvidia CEO Huang says company sees more than $1 trillion in sales through 2027
1 day ago
Nvidia's one of the fastest growing companies with one of the lowest valuations, says Jim Cramer
14 hours ago
Nvidia is reskinning games with AI. Gamers are angry about it, and wrong
1 day ago
About NVIDIA
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for NVIDIA.
Salary
$184k – $357k
per year
More jobs at NVIDIA
Similar Jobs
Member of Technical Staff, Inference
Ashby
Gen AI Model Artist
DNEG
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius
Generative AI Inference Engineer
Stability AI
Applied Machine Learning Scientist
Variational AI
Machine Learning and Generative AI Research Scientist
Hewlett Packard Enterprise