NVIDIA

Solutions Architect, Inference Deployments

NVIDIA(1 month ago)

HybridFull TimeSenior$152,000 - $218,500Solutions Architecture
Apply Now

About this role

A Solutions Architect (Inference Focus) at NVIDIA focuses on advancing and demonstrating GPU-accelerated AI inference solutions using Kubernetes to help enterprises deploy generative AI and large language models in production. The role supports customer adoption and scale of inference platforms while showcasing NVIDIA GPU technologies and related tooling.

View Original Listing

Required Skills

  • Kubernetes
  • GPU Orchestration
  • TensorRT
  • TensorRT-LLM
  • Triton
  • NVIDIA NIM
  • GPU Operator
  • MIG
  • Model Optimization
  • Performance Tuning

+6 more

Qualifications

  • BS in Computer Science or Engineering
  • NVIDIA Certified AI Engineer
NVIDIA

About NVIDIA

nvidia.com

NVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.

View more jobs at NVIDIA

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com