Technical Product Manager (Cluster Experience)
Nebius(1 month ago)
About this role
A Product Manager on the Cluster Experience team at Nebius defines how customers experience GPU clusters for large-scale ML training and inference. The role focuses initially on reliability, performance, and observability for multi-node distributed systems and expands into UX, operational tooling, and advanced cluster workflows. It is a deeply technical PM role suited to candidates from ML infrastructure, distributed systems, SRE, or cloud engineering backgrounds who want to grow into product.
Required Skills
- Reliability
- Performance
- Observability
- Product Direction
- Cross Functional
- Customer Research
- Distributed Systems
- ML Infrastructure
- Orchestrators
- Performance Tuning
+2 more
About Nebius
nebius.comNebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Nebius
Similar Jobs
Senior Software Engineer - AI Infra Visibility
Clockwork.io(12 days ago)
Software Engineer - AI Infra Visibility
Clockwork.io(12 days ago)
Senior Software Engineer, Cluster Orchestration
CoreWeave(11 days ago)
Senior Software Engineer – Backend
Vizcom(1 month ago)
Senior HPC Developer - GPU and Networking
Clockwork.io(7 days ago)
ML Cluster Operations Engineer
TensorWave(2 months ago)