Senior HPC Cluster Engineer
Nebius(1 year ago)
About this role
A Senior HPC Cluster Engineer at Nebius will join the GPU & InfiniBand team to help develop and optimize the company’s hyperscaler cloud platform for AI workloads. The role operates within a global R&D organization and focuses on ensuring high-performance, secure multi-GPU and InfiniBand-based HPC environments while collaborating with hardware virtualization and cloud teams. The position offers flexible working arrangements and opportunities for professional growth.
Required Skills
- GPU Clusters
- InfiniBand
- Performance Tuning
- Troubleshooting
- Hardware Integration
- Automation
- Monitoring
- Device Management
- System Software
- Linux
+15 more
About Nebius
nebius.comNebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Nebius
Similar Jobs
Senior HPC Developer - GPU and Networking
Clockwork.io(7 days ago)
Senior Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Senior HPC Operations Engineer
Lambda(2 months ago)
Staff Software Engineer, GPU Infrastructure (HPC)
Cohere(2 months ago)
Site Reliability Engineer, AI/ML Infrastructure
Boson AI(8 days ago)
HPC (High-Performance Computing)
Talent Worx(10 months ago)