Compute Architecture Software Engineer
NVIDIA(4 months ago)
About this role
An LLM Inference Software Engineer at NVIDIA will work on the TRTLLM project to accelerate large language model inference using GPU technology across environments from single PCs to large GPU clusters. The role is part of a collaborative engineering team focused on advancing AI infrastructure and performance.
Required Skills
- GPU Programming
- LLM Inference
- Python
- C++
- CUDA
- Deep Learning
- Performance Optimization
- Problem Solving
- Collaboration
Qualifications
- BS or Above in Computer Science, Engineering, or Related Field
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Senior Software Engineer - Model Performance
Inference(1 month ago)
ML Platform Engineer
eBay(27 days ago)
LLM Inference Engineer
Hippocratic AI(3 months ago)
AI Infrastructure Engineer
NIO(5 months ago)
Software Engineer, Model Performance Tooling
Baseten(1 month ago)
Research Engineer (LLM Training and Performance)
JetBrains(3 months ago)