System Engineer (Token Factory)
Nebius
About this role
Nebius Token Factory is building a large-scale AI inference platform on Nebius Cloud to make foundation models fast, reliable, and easy to deploy. This engineering role contributes to the low-level runtime and kernel stack that powers GPU-based inference across diverse hardware. The position operates within the engineering organization and works closely with ML and backend teams to deliver performant, production-grade inference services.
Skills
About Nebius
nebius.comNebius is a cloud platform for AI explorers that provides GPU‑accelerated infrastructure to build, tune, and run machine learning models and applications. It offers access to top‑tier NVIDIA GPUs and tooling designed to maximize efficiency and performance for training, fine‑tuning, and inference. Nebius focuses on simplifying ML workflows so researchers, developers, and teams can iterate faster without managing hardware.
Recent company news
NVIDIA and Nebius Partner to Scale Full-Stack AI Cloud
1 day ago
Trending tickers: Nebius Group, Hims & Hers, BMW, Savills and On The Beach
5 hours ago
Nvidia Invests $2B in Nebius (NBIS) Stock. What It Means for CoreWeave, AI Trade
17 hours ago
Where Will Nebius Group Be in 5 Years?
4 hours ago
Nvidia invests US$2 billion in AI cloud firm Nebius
4 hours ago
About Nebius
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for Nebius.
Salary
$169k – $227k
per year
More jobs at Nebius
Similar Jobs
Senior Software Engineer, Deep Learning Inference
NVIDIA
Engineering Manager, Inference ML Runtime
Cerebras Systems
DL Algorithms Engineer - Cosmos - New College Graduate 2026
NVIDIA
Research Engineering, Inference
Bitdeer
Deep Learning Software Engineer, FlashInfer - New College Grad 2025
NVIDIA
Forward Deployed ML Engineer
Modal