Senior ML Engineer (Token Factory)
Nebius(5 months ago)
About this role
The role is part of Token Factory within Nebius Cloud, building an inference and fine-tuning platform for foundation models (text, vision, audio, multimodal) at massive scale on a large GPU cloud. The position focuses on making training and deployment fast, reliable, and efficient across tens of thousands of GPUs. It requires deep technical expertise in distributed LLM training and inference and involves close collaboration with engineering and research teams.
Required Skills
- Fine-Tuning
- LoRA
- Inference Optimization
- Speculative Decoding
- Quantization
- JAX
- Distributed Training
- Model Reimplementation
- ML Theory
- Reinforcement Learning
+10 more
About Nebius
nebius.comNebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Nebius
Similar Jobs
Generative AI - ML System Engineering
Meshy(1 year ago)
Member of Technical Staff - Efficient ML
Moonlake AI(2 months ago)
Founding Lead Machine Learning Engineer
BJAK(1 month ago)
ML Research Intern in Agentic Runtime Systems
Constructor Knowledge(11 months ago)
Multimodal AI Engineer, Document Understanding
LlamaIndex(2 months ago)
Founding Lead Machine Learning Engineer
BJAK(1 month ago)