Cohere

Staff Research Engineer, Model Efficiency

Cohere

4 months ago
New York, NY
Hybrid
Full Time
Senior
1 applicant
View Job Listing
Cohere
Apply to 100+ jobs

About this role

The Staff Research Engineer, Model Efficiency at Cohere is responsible for enhancing the inference efficiency of Large Language Models (LLMs) within AI systems. This role involves developing and deploying novel techniques to optimize model architecture, decoding algorithms, and software/hardware co-design for GPU acceleration. Candidates should possess a PhD in Machine Learning, significant experience in model efficiency techniques, strong software engineering skills, and a background in research publications.

Skills

Qualifications

PhD in Machine Learning or a related fieldPublications at top-tier conferences and venues (ICLR, ACL, NeurIPS)
Cohere

About Cohere

cohere.com

Cohere is an AI company that builds large language models and enterprise AI platforms for businesses and developers.

About Cohere

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for Cohere.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com