Senior ML Engineer (Token Factory)
Nebius(13 days ago)
About this role
A position on the Token Factory team at Nebius Cloud focused on building an inference platform to make foundation models fast, reliable, and easy to deploy at massive scale. The role is part of Nebius’s global AI cloud infrastructure supporting tens of thousands of GPUs and serves customers across text, vision, audio, and multimodal AI. It sits within the company’s engineering and R&D hubs to advance AI cloud compute capabilities.
Required Skills
- C++
- GPU Programming
- Kernel Development
- Runtime Components
- Performance Optimization
- Profiling
- Debugging
- Memory Management
- CUDA
- ROCm
+2 more
About Nebius
nebius.comNebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Nebius
Similar Jobs
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
Performance Engineer - Inference
Cerebras Systems(13 days ago)
Engineering Manager, Inference Developer Productivity
Anthropic(11 hours ago)
Compiler Architect
d-Matrix(1 month ago)
Production Engineer, Compute
Crusoe(1 month ago)
Software Engineer (AI Core)
Dialpad(1 day ago)