Architecture Intern - Inference
Etched(1 month ago)
About this role
The Architecture Intern - Inference at Etched will contribute to the design and optimization of next-generation AI accelerators focused on transformer workloads. Responsibilities include porting state-of-the-art models, enhancing runtime capabilities, optimizing communication layers, and developing high-performance software components. The role requires proficiency in C++ or Rust, an understanding of distributed systems, and familiarity with transformer architectures. The intern will work on cutting-edge problems alongside industry leaders, directly influencing the infrastructure for AI performance.
Required Skills
- Model Porting
- Programming Abstractions
- Testing Capabilities
- Runtime Enhancement
- Multi-node Inference
- Intra-node Execution
- State Management
- Error Handling
- Routing Optimization
- Communication Layers
+24 more
Qualifications
- Bachelor’s degree in computer science, computer engineering, or a related field
- Master’s degree in computer science, computer engineering, or a related field
- PhD in computer science, computer engineering, or a related field
About Etched
www.etched.comEtched is at the forefront of advanced computing technology with its groundbreaking product, Sohu, the world's first transformer ASIC. By etching transformer architecture directly into silicon, Etched delivers server solutions that provide dramatically faster and more cost-effective AI model inference compared to traditional GPU-based systems. Their innovative technology is designed to optimize the performance of AI applications, enabling unprecedented processing capabilities for next-generation models. With a commitment to pushing the boundaries of what's possible in AI, Etched is poised to revolutionize the industry.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Etched
Similar Jobs
Staff Machine Learning Engineer, Inference Optimisation
Wayve(4 months ago)
Engineering Manager, GPU Kernel
Wayve(9 months ago)
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
Research Intern, Inference (Summer 2026)
Together AI(28 days ago)
Sr Software Engineer, Embedded Machine Learning
Cariad, Inc.(7 days ago)
Staff Software Engineer, Inference
Anthropic(1 day ago)