d-Matrix

Compiler Architect

d-Matrix(1 month ago)

HybridFull TimeSenior$190,000 - $300,000R&D - CTO & Architecture
Apply Now

About this role

The Compiler Architect at d-Matrix is responsible for designing and implementing a scalable MLIR-based compiler framework focused on cloud-based AI inference, specifically for large-scale NLP and transformer models. This role includes architecting the end-to-end software pipeline for efficient mapping of AI models to distributed compute environments, optimizing for inference latency and throughput. The Compiler Architect will lead the development of compiler strategies, mentor a team of engineers, and collaborate with cross-functional teams to ensure seamless integration and performance in cloud deployments. A strong background in compiler design, ML inference, and cloud infrastructure is essential for success in this role.

View Original Listing

Required Skills

  • Compiler Architecture
  • MLIR
  • Cloud Inference
  • Model Optimization
  • Resource Provisioning
  • Tensor Layout Optimization
  • Distributed Execution
  • AI Frameworks
  • Leadership Skills
  • Communication Skills

+6 more

Qualifications

  • BS in Computer Science or Electrical Engineering
  • MS in Computer Science or Electrical Engineering
  • PhD in Computer Science or Electrical Engineering
  • 12+ years of experience in Front End Compiler and systems software development
  • Deep experience in designing or leading compiler efforts using MLIR, LLVM, Torch-MLIR, or similar frameworks
  • Strong understanding of model optimization for inference
  • Expertise in deploying ML models to heterogeneous compute environments
d-Matrix

About d-Matrix

www.d-matrix.ai

d-Matrix is revolutionizing generative AI with its cutting-edge inference platform, Corsair™, designed for ultra-low latency and high throughput in data centers. The platform integrates memory-compute technology, enabling speeds of 60,000 tokens per second with just 1ms latency for advanced models, making it both efficient and sustainable. With a focus on scalability, d-Matrix's products cater to a wide range of enterprise needs, advancing the accessibility and performance of AI technologies. Additionally, d-Matrix is committed to sustainability, allowing organizations to achieve impressive performance while minimizing energy consumption.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com