Senior ML Platform Engineer - Lepton
NVIDIA(1 month ago)
About this role
An ML Platform Engineer at NVIDIA will design, build, and scale high-performance machine learning infrastructure to enable researchers and engineers to train and deploy advanced models on large GPU systems. The role focuses on creating reliable, automated, and reproducible platforms using modern Infrastructure-as-Code and software engineering practices to support large-scale distributed GPU clusters.
Required Skills
- Ansible
- Terraform
- SRE
- Python
- Go
- Kubernetes
- Docker
- Linux
- Networking
- Monitoring
+4 more
Qualifications
- BS in Computer Science or Engineering
- MS in Computer Science or Engineering
- Equivalent Experience
About NVIDIA
nvidia.comNVIDIA invents the GPU and drives advances in AI, HPC, gaming, creative design, autonomous vehicles, and robotics.
View more jobs at NVIDIA →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at NVIDIA
Similar Jobs
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(6 days ago)
Senior Platform Engineer
Pluralis Research(2 months ago)
Senior Software Engineer - AI/ML Infra
GEICO(3 months ago)
Engineering Manager, HPC Kubernetes Platform
NorthMark Strategies(3 months ago)
Senior Platform Engineer, Machine Learning
Monzo(6 months ago)
Machine Learning Engineer - ML Training Platform
Pluralis Research(1 day ago)