Senior Deep Learning Architect, LLM Inference
NVIDIA
About this role
Senior Deep Learning Architect for LLM Inference at NVIDIA focused on advancing inference performance for large language models and preserving TRT-LLM's leadership. The role sits at the intersection of GPU hardware and deep learning software and involves collaborating with internal teams and external partners to shape the direction of inference serving and support GPU product launches.
Skills
Qualifications
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
Recent company news
Nvidia Swears Off an Earnings Crutch, Putting Pressure on Other Tech Companies
11 hours ago
Nvidia to invest $4 billion into photonics companies Coherent and Lumentum
2 days ago
Tech stocks today: Nvidia CEO Jensen Huang suggests end of OpenAI investments, Apple unveils MacBook Neo
1 hour ago
NVIDIA Announces Strategic Partnership With Lumentum to Develop State-of-the-Art Optics Technology
2 days ago
CEO Of Tiny Company Tells Jim Cramer They’ve Outperformed NVIDIA Since 2015
2 days ago
About NVIDIA
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for NVIDIA.
Salary
$184k – $357k
per year
More jobs at NVIDIA
Similar Jobs
Senior Software Engineer - Model Performance
Inference
Machine Learning Intern - Dynamic KV-Cache Modeling for Efficient LLM Inference
d-Matrix
LLM Inference Engineer
Hippocratic AI
ML Platform Engineer
eBay
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius
SWE, Inference Performance, Onboard
Wayve