Compiler Architect
d-Matrix(1 month ago)
About this role
The Compiler Architect at d-Matrix is responsible for designing and implementing a scalable MLIR-based compiler framework focused on cloud-based AI inference, specifically for large-scale NLP and transformer models. This role includes architecting the end-to-end software pipeline for efficient mapping of AI models to distributed compute environments, optimizing for inference latency and throughput. The Compiler Architect will lead the development of compiler strategies, mentor a team of engineers, and collaborate with cross-functional teams to ensure seamless integration and performance in cloud deployments. A strong background in compiler design, ML inference, and cloud infrastructure is essential for success in this role.
Required Skills
- Compiler Architecture
- MLIR
- Cloud Inference
- Model Optimization
- Resource Provisioning
- Tensor Layout Optimization
- Distributed Execution
- AI Frameworks
- Leadership Skills
- Communication Skills
+6 more
Qualifications
- BS in Computer Science or Electrical Engineering
- MS in Computer Science or Electrical Engineering
- PhD in Computer Science or Electrical Engineering
- 12+ years of experience in Front End Compiler and systems software development
- Deep experience in designing or leading compiler efforts using MLIR, LLVM, Torch-MLIR, or similar frameworks
- Strong understanding of model optimization for inference
- Expertise in deploying ML models to heterogeneous compute environments
About d-Matrix
www.d-matrix.aid-Matrix is revolutionizing generative AI with its cutting-edge inference platform, Corsair™, designed for ultra-low latency and high throughput in data centers. The platform integrates memory-compute technology, enabling speeds of 60,000 tokens per second with just 1ms latency for advanced models, making it both efficient and sustainable. With a focus on scalability, d-Matrix's products cater to a wide range of enterprise needs, advancing the accessibility and performance of AI technologies. Additionally, d-Matrix is committed to sustainability, allowing organizations to achieve impressive performance while minimizing energy consumption.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at d-Matrix
Similar Jobs
Compiler Engineer
Cerebras Systems(11 days ago)
Sr. Software Engineer, AI Compiler
Tenstorrent(2 years ago)
Senior Compiler Engineer
Flux Computing(2 months ago)
Jr. LLVM Compiler Engineer
Cerebras Systems(21 days ago)
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
GCC Compiler Engineer
Tenstorrent(1 year ago)