Solutions Architect, Inference Deployments
NVIDIA(1 month ago)
About this role
A Solutions Architect (Inference Focus) at NVIDIA focuses on advancing and demonstrating GPU-accelerated AI inference solutions using Kubernetes to help enterprises deploy generative AI and large language models in production. The role supports customer adoption and scale of inference platforms while showcasing NVIDIA GPU technologies and related tooling.
Required Skills
- Kubernetes
- GPU Orchestration
- TensorRT
- TensorRT-LLM
- Triton
- NVIDIA NIM
- GPU Operator
- MIG
- Model Optimization
- Performance Tuning
+6 more
Qualifications
- BS in Computer Science or Engineering
- NVIDIA Certified AI Engineer
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Generative AI Inference Engineer
Stability AI(3 months ago)
Senior Software Engineer I, Inference
CoreWeave(1 month ago)
Senior Site Reliability Engineer — Token Factory (Inference Platform)
Nebius(8 months ago)
Senior Software Engineer - Model Performance
Inference(1 month ago)
Senior Software Engineer, Cluster Orchestration
CoreWeave(1 month ago)
AI Inference Engineer - Speech
Zoom(1 month ago)