Inference

Senior Software Engineer - Model Performance

Inference

2 months ago
San Francisco, CA
Hybrid
Full Time
Senior
2 applicants
View Job Listing
Inference
Apply to 100+ jobs

About this role

Inference.net is seeking a technical expert to optimize and accelerate AI inference systems using GPU and CUDA technologies. The role involves deep technical work on the full inference stack, aiming to improve performance, latency, throughput, and cost efficiency of large language model serving. It offers an opportunity to work on cutting-edge AI infrastructure in a collaborative startup environment.

Skills

Inference

About Inference

inference.net

Inference.net is an innovative platform that specializes in AI inference solutions, enabling businesses to effectively train and host custom large language models tailored to their specific needs. The company offers a range of services, including serverless API and batch inference capabilities, designed to deliver improved performance and cost-efficiency compared to traditional models. With a focus on reducing latency and enhancing model accuracy, Inference.net empowers organizations to leverage AI technologies across various modalities such as text, image, and video. Their mission is to provide high-quality, reliable AI solutions that optimize deployment processes and drive operational excellence for their clients.

About Inference

Headquarters

San Francisco, CA

Company Size

201-500 employees

Founded

2018

Industry

Technology

Glassdoor Rating

4.2 / 5

Leadership Team

Sarah Johnson

Chief Executive Officer

Michael Chen

Chief Technology Officer

Emily Williams

VP of Engineering

David Rodriguez

VP of Product

Jessica Thompson

Chief Financial Officer

Andrew Park

VP of Sales

Unlock Company Insights

View leadership team, funding history,
and employee contacts for Inference.

Reveal Company Insights

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com