AI Computing Software Development Engineer, TensorRT-LLM
NVIDIA(4 months ago)
About this role
A Software Development Engineer on NVIDIA's TensorRT-LLM team focused on building GPU-accelerated inference software for large language models. The role contributes to the deep learning inference platform used across NVIDIA products and works closely with research, software, and product teams to advance LLM inference technology.
Required Skills
- Python
- C++
- Software Design
- PyTorch
- HuggingFace
- Performance Optimization
- GPU Programming
- CUDA
- LLM Inference
- Deep Learning
Qualifications
- Master's Degree in Computer Engineering, Computer Science, or Applied Mathematics
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
AI Infrastructure Engineer
NIO(5 months ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
ML Platform Engineer
eBay(27 days ago)
LLM Inference Engineer
Hippocratic AI(3 months ago)
LLM Algorithmic Optimization Engineer
NIO(1 year ago)
Manager, Engineering - Hardware Acceleration (CUDA)
Torc Robotics(2 months ago)