NVIDIA

Senior Deep Learning Software Engineer, Inference and Model Optimization

NVIDIA(1 month ago)

United States, California, Santa Clara, CAOnsiteFull TimeSenior$184,000 - $356,500Engineering
Apply Now

About this role

A Senior Deep Learning Software Engineer on NVIDIA's Algorithmic Model Optimization Team will help advance automated inference and deployment solutions for generative AI models, including LLMs and diffusion models. The role involves developing the TRT Model Optimizer platform used internally and externally to improve model inference efficiency and scalability. This position sits at the intersection of applied research and software engineering, contributing to NVIDIA's leadership in AI inference technologies.

View Original Listing

Required Skills

  • Deep Learning
  • Python
  • PyTorch
  • HuggingFace
  • CUDA
  • TensorRT
  • Triton
  • Kernel Development
  • Profiling
  • Model Optimization

+5 more

Qualifications

  • MS in Computer Science, AI, or Applied Math
  • PhD in Computer Science, AI, or Applied Math
  • Equivalent Experience
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com