Nebius

HPC System Engineer

Nebius(1 month ago)

HybridFull TimeSenior$159,271 - $214,151 (estimated)Engineering
Apply Now

About this role

The Systems Engineer (Cloudmeter) will support benchmarking and evaluation of GPU platforms for machine learning and AI workloads, enabling data-driven optimization of hardware and software stacks. The role collaborates with hardware and development teams to perform acceptance testing, run experiments across diverse GPU configurations, and guide platform decisions for performance and scalability. The position contributes to next-generation hardware development by validating performance, stability, and compatibility of GPU clusters.

View Original Listing

Required Skills

  • Unix/Linux
  • Python
  • Bash
  • CUDA
  • NCCL
  • Drivers
  • Troubleshooting
  • Docker
  • Kubernetes
  • GPU Benchmarking

+6 more

Nebius

About Nebius

nebius.com

Nebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com