Senior Deep Learning Software Engineer, Inference and Model Optimization
NVIDIA(1 month ago)
About this role
A Senior Deep Learning Software Engineer on NVIDIA's Algorithmic Model Optimization Team will help advance automated inference and deployment solutions for generative AI models, including LLMs and diffusion models. The role involves developing the TRT Model Optimizer platform used internally and externally to improve model inference efficiency and scalability. This position sits at the intersection of applied research and software engineering, contributing to NVIDIA's leadership in AI inference technologies.
Required Skills
- Deep Learning
- Python
- PyTorch
- HuggingFace
- CUDA
- TensorRT
- Triton
- Kernel Development
- Profiling
- Model Optimization
+5 more
Qualifications
- MS in Computer Science, AI, or Applied Math
- PhD in Computer Science, AI, or Applied Math
- Equivalent Experience
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Neural Network Optimization Engineer
Recraft(3 months ago)
Staff AI Software Engineer - Edge Model Optimization & Deployment
Field AI(20 days ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
Principal Machine Learning Researcher, On-Device Optimization
HP(1 month ago)
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(3 months ago)
Senior Engineer, AI Systems
Samsung Research America(26 days ago)