AI Tools

Login

Member of Technical Staff, Model Efficiency

Cohere

4 months ago

2.9 on Glassdoor

New York, NY

Hybrid

Full Time

Senior

1 applicant

View Job Listing

Apply to 100+ jobs

About this role

A Member of Technical Staff for Model Efficiency at Cohere focuses on enhancing the performance of large language models (LLMs) by implementing optimizations that improve inference speed, latency, and throughput. The role involves deep technical work across the inference stack, diagnosing bottlenecks, and collaborating with modeling and systems teams to deploy performance improvements. Candidates should have strong programming skills in C++ or Python, experience with LLM inference ecosystems, and a background in performance optimization, particularly with GPUs and distributed systems.

Skills

About Cohere

Cohere is an AI company that builds large language models and enterprise AI platforms for businesses and developers.

Recent company news

Nvidia-Backed Cohere Forms AI Alliance With Telecom Firm BCE

15 hours ago

Enterprise AI startup Cohere tops revenue target as momentum builds to IPO: Investor memo

1 month ago

Aston Martin F1 Team

Cohere joins Aston Martin Aramco as Official Generative AI Partner to help accelerate AI innovation

2 weeks ago

The AI Model Race May Have Slowed Down for Cohere

Nov 17, 2025

The Mobile Network

Cohere Technologies drives ahead with innovation vision

1 week ago

About Cohere

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for Cohere.

Reveal Company Insights

Apply to 100+ jobs

Auto-apply to this job and hundreds of similar ones with AI.

Salary

$206k – $274k

per year

More jobs at Cohere

Head of Global Partner Ecosystem

Cohere

Solutions Architect - Defence and National Security

Cohere

Senior Product Designer, North

Cohere

Product Manager, Agent Harness & Modelling

Cohere

Similar Jobs

Senior Software Engineer - Model Performance

Inference

Member of Engineering (Pre-training and inference fault tolerance)

poolside

AI / ML Platform Engineer

Whatnot

Principal Software Engineer - AI Inference

NVIDIA

Manager, Large Language Model Inference

NVIDIA

Engineering Manager, Inference Routing and Performance

Anthropic

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

Features

Auto Apply Resume Optimizer Cover Letter Optimizer Interview Prep

Company

Terms of Service Privacy Policy Help Center FAQ

Social Media

Need Help?

team@applyblast.com

© All Rights Reserved. ApplyBlast.com