NVIDIA

AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026

NVIDIA(1 month ago)

HybridFull TimeMedior$124,000 - $218,500Engineering
Apply Now

About this role

A Software Engineer, Performance Analysis and Optimization for LLM Inference at NVIDIA focuses on improving the efficiency and scalability of large language model inference on NVIDIA computing platforms. The role centers on advancing compiler and kernel infrastructure to shape runtime behavior and hardware utilization for next-generation LLM deployments across data center and embedded platforms. The position requires close collaboration with compiler, hardware, kernel, and framework teams and influences performance of deployed models.

View Original Listing

Required Skills

  • C++
  • Python
  • Compiler Optimization
  • IR
  • Graph Transformations
  • Kernel Tuning
  • Profiling
  • CUDA
  • Deep Learning
  • Performance Analysis

+3 more

Qualifications

  • MS in Computer Science or Computer Engineering
  • PhD in Computer Science or Computer Engineering
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com