Member of Technical Staff - Efficient ML
Moonlake AI(2 months ago)
About this role
The Member of Technical Staff - Efficient ML at Moonlake focuses on optimizing machine learning training efficiency and inference processes with an emphasis on techniques such as gradient checkpointing, low-latency serving, and various optimization strategies including quantization and pruning. The role includes enhancing GPU and kernel performance through advanced profiling, as well as ensuring infrastructure reliability with multi-node job management and GPU failure handling. Collaboration in an on-site team environment is required to develop real-time interactive content solutions.
Required Skills
- Dataloaders
- Fusion
- Gradient Checkpointing
- FSDP
- ZeRO
- Tensor Parallel
- NCCL Tuning
- Nsight Profiling
- Triton
- CUDA
+16 more
About Moonlake AI
moonlakeai.comMoonlake AI is an innovative platform designed to empower users to create and code interactive games and virtual worlds seamlessly. The company focuses on delivering intuitive tools that allow users to bring their creative visions to life, making game development accessible to a broader audience. By integrating advanced AI technology into its services, Moonlake AI is positioned as a pioneer in reshaping how games and interactive experiences are conceived and built.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Moonlake AI
Similar Jobs
Member of Engineering (Pre-training and inference fault tolerance)
poolside(2 months ago)
Member of Technical Staff, ML Engineer
Mirage(2 months ago)
Senior ML Engineer (Token Factory)
Nebius(5 months ago)
Senior Systems Engineer - AI Infrastructure
Clockwork.io(6 days ago)
Systems Research Engineer Intern- GPU Programming (Summer 2026)
Together AI(28 days ago)
HPC System Engineer
Nebius(1 month ago)