NVIDIA

Senior Deep Learning Software Engineer, Inference and Model Optimization

NVIDIA

1 month ago
Santa Clara, CA
Onsite
Full Time
Senior
0 applicants
View Job Listing
NVIDIA
Apply to 100+ jobs

About this role

A Senior Deep Learning Software Engineer on NVIDIA's Algorithmic Model Optimization Team works at the intersection of applied research and software engineering to improve inference efficiency for generative AI models (LLMs and diffusion models). The role contributes to the TRT Model Optimizer platform and supports internal and external teams by enabling scalable, automated deployment of optimized models. It spans high-level frameworks like PyTorch and HuggingFace as well as low-level kernel and deployment work in CUDA, Triton, and TensorRT.

Skills

Qualifications

Masters in Computer Science or related fieldPhD in Computer Science or related field
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

About NVIDIA

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for NVIDIA.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com