NVIDIA

Manager, Large Language Model Inference

NVIDIA

3 months ago
United States
Hybrid
Full Time
Manager
1 applicant
View Job Listing
NVIDIA
Apply to 100+ jobs

About this role

A hands-on Engineering Manager at NVIDIA leading the development of next-generation LLM/VLM inference software for the TensorRT platform. The role combines technical ownership and people leadership to architect and ship production-grade inference runtimes across enterprise and edge GPUs. It involves close collaboration with researchers, GPU architects, and cross-functional teams to accelerate AI deployment and performance.

Skills

Qualifications

MS in Computer Science, Computer Engineering, AI or related fieldPhD in Computer Science, Computer Engineering, AI or related fieldEquivalent Experience
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

About NVIDIA

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for NVIDIA.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com