Audio Inference Engineer, Model Efficiency
Cohere(2 months ago)
About this role
The Audio Inference Engineer for Model Efficiency at Cohere is responsible for optimizing audio model serving efficiency by enhancing core metrics such as latency, throughput, and quality. This role involves identifying bottlenecks in existing systems and implementing innovative solutions for audio processing and streaming workloads. Collaboration with training and serving infrastructure teams is essential to ensure seamless integration of model development and real-time audio inference deployment. Candidates should have significant experience in high-performance audio or machine learning inference systems, with proficiency in C++ and Python, and a focus on achieving results.
Required Skills
- Machine Learning
- Audio Processing
- System Optimization
- C++
- Python
- Deep Learning
- Streaming Architecture
- Model Parallelization
- Inference Frameworks
- Sequence Modeling
+4 more
About Cohere
cohere.comSanity is a platform that provides flexible content management solutions tailored for developers, marketers, and content creators. By utilizing a real-time collaborative editor and structured content, it allows users to build and manage high-performance applications and websites. Sanity’s APIs and flexible data model enable seamless integration with various frameworks and technologies, empowering users to deliver customized content experiences. With features like query-driven content fetching and an extensible plugin system, Sanity is designed to enhance productivity and scalability for teams of all sizes.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Cohere
Similar Jobs
Lead ML Inference Engineer, Advertising
Roku(1 month ago)
Director of Engineering, Inference Services
CoreWeave(1 month ago)
Member of Technical Staff, Video Generation - Audio
xAI(1 month ago)
Senior Software Engineer, Data Platforms
Domino Data Lab(5 days ago)
Senior Software Engineer, Inference Platform
MongoDB(27 days ago)
Research Intern, Inference (Summer 2026)
Together AI(28 days ago)