Head of Inference Kernels
Etched(3 months ago)
About this role
The Head of Inference Kernels at Etched leads a high-performance team to develop optimized kernels and inference stacks for state-of-the-art transformer models, aiming for over 10x performance enhancement compared to existing benchmarks. The role encompasses architecting best-in-class inference performance, co-designing innovative algorithmic improvements, and ensuring alignment across cross-functional teams. The ideal candidate will possess extensive experience in designing GPU kernels, a deep understanding of transformer architectures, and a demonstrated track record in managing effective engineering teams.
Required Skills
- Inference Performance
- Inference Mega Kernels
- Model Mapping
- Algorithmic Innovation
- Team Building
- Performance Alignment
- GPU Kernel Optimization
- Deep Learning
- Transformer Architecture
- Roofline Models
+7 more
About Etched
www.etched.comEtched is at the forefront of advanced computing technology with its groundbreaking product, Sohu, the world's first transformer ASIC. By etching transformer architecture directly into silicon, Etched delivers server solutions that provide dramatically faster and more cost-effective AI model inference compared to traditional GPU-based systems. Their innovative technology is designed to optimize the performance of AI applications, enabling unprecedented processing capabilities for next-generation models. With a commitment to pushing the boundaries of what's possible in AI, Etched is poised to revolutionize the industry.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Etched
Similar Jobs
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
Software Engineer, Staff - SIMD Kernels
d-Matrix(1 month ago)
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix(2 months ago)
Senior Staff ML Researcher - LLM Algorithmic Optimization
d-Matrix(3 months ago)
Compiler Architect
d-Matrix(1 month ago)
Software Engineer, Senior Staff - SIMD Kernels
d-Matrix(1 month ago)