Software Engineer, LLM Inference
NVIDIA(5 months ago)
About this role
An LLM inference framework developer engineer at NVIDIA based in Shanghai focused on GPU-accelerated inferencing software for Auto Driving and AI City. The role supports development and scaling of inference frameworks and contributes to advancing NVIDIA's AI stack by staying current with academic and industry ML developments.
Required Skills
- C++
- Software Design
- Debugging
- Performance Analysis
- Optimization
- LLMs
- Deep Learning
- PyTorch
- Collaboration
- Communication
Qualifications
- Masters or higher in Computer Engineering, Computer Science, or Applied Mathematics
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Senior Machine Learning Scientist
Dream Sports(1 month ago)
AI Software Architect
Intel(2 months ago)
SWE, Inference Performance, Onboard
Wayve(1 year ago)
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius(11 months ago)
Principal Associate, Data Scientist - LLM Customization Team
Capital(1 month ago)
AI Inference Center Product Manager
Gruve(2 months ago)