AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026
NVIDIA(1 month ago)
About this role
A Software Engineer, Performance Analysis and Optimization for LLM Inference at NVIDIA focuses on improving the efficiency and scalability of large language model inference on NVIDIA computing platforms. The role centers on advancing compiler and kernel infrastructure to shape runtime behavior and hardware utilization for next-generation LLM deployments across data center and embedded platforms. The position requires close collaboration with compiler, hardware, kernel, and framework teams and influences performance of deployed models.
Required Skills
- C++
- Python
- Compiler Optimization
- IR
- Graph Transformations
- Kernel Tuning
- Profiling
- CUDA
- Deep Learning
- Performance Analysis
+3 more
Qualifications
- MS in Computer Science or Computer Engineering
- PhD in Computer Science or Computer Engineering
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Software Engineer, AI Compiler
Normal Computing(5 days ago)
AI Compiler Engineer
Intel(1 month ago)
Compiler Engineer
Cerebras Systems(1 month ago)
AI Software Development Engineer
Intel(1 month ago)
High Level Synthesis Compiler Engineer
France Cars SAS AAA(2 months ago)
AI Compiler Engineer
Intel(1 month ago)