Performance Engineer - Inference
Cerebras Systems(13 days ago)
About this role
An engineer on the Inference Performance team at Cerebras works at the intersection of hardware and software to improve ML model inference speed and throughput on the Wafer Scale Engine. The role focuses on performance modeling, system-level analysis, and building tooling for performance projection and diagnostics. Team members contribute to advancing state-of-the-art inference capabilities for large-scale ML applications.
Required Skills
- Performance Modeling
- Kernel Optimization
- Compiler Algorithms
- Runtime Debugging
- Tooling Development
- Performance Profiling
- System Analysis
- Computer Architecture
- Simulator Experience
- LLM Math
+3 more
Qualifications
- Bachelors in Electrical Engineering
- Masters in Electrical Engineering
- PhD in Electrical Engineering
- Bachelors in Computer Science
- Masters in Computer Science
- PhD in Computer Science
About Cerebras Systems
cerebras.aiCerebras builds purpose‑built AI compute systems centered on its wafer‑scale processors to accelerate training and inference of large neural networks. Their integrated hardware‑and‑software platform delivers high throughput, low latency, and very large on‑chip memory/interconnect to shorten time‑to‑train for demanding AI workloads. Cerebras targets research labs and enterprises that need to scale experiments and deploy large models more quickly, pairing systems, tooling, and support to simplify large‑model development.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Cerebras Systems
Senior Technical Program Manager
Cerebras Systems(4 days ago)
Senior Software Development Engineer in Test (SDET) - AI Cluster
Cerebras Systems(4 days ago)
Senior Software Development Engineer in Test (SDET) - AI Cluster Networking and Security
Cerebras Systems(4 days ago)
Data Center Construction Project Manager
Cerebras Systems(4 days ago)
Similar Jobs
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
Senior ML Engineer (Token Factory)
Nebius(13 days ago)
System Engineer (Token Factory)
Nebius(6 months ago)
Lead ML Inference Engineer, Advertising
Roku(1 month ago)
Head of Inference Kernels
Etched(3 months ago)
System Engineer (Token Factory)
Nebius(21 days ago)