Senior Software Engineer - Model Performance
Inference
About this role
Inference.net is seeking a technical expert to optimize and accelerate AI inference systems using GPU and CUDA technologies. The role involves deep technical work on the full inference stack, aiming to improve performance, latency, throughput, and cost efficiency of large language model serving. It offers an opportunity to work on cutting-edge AI infrastructure in a collaborative startup environment.
Skills
About Inference
inference.netInference.net is an innovative platform that specializes in AI inference solutions, enabling businesses to effectively train and host custom large language models tailored to their specific needs. The company offers a range of services, including serverless API and batch inference capabilities, designed to deliver improved performance and cost-efficiency compared to traditional models. With a focus on reducing latency and enhancing model accuracy, Inference.net empowers organizations to leverage AI technologies across various modalities such as text, image, and video. Their mission is to provide high-quality, reliable AI solutions that optimize deployment processes and drive operational excellence for their clients.
About Inference
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for Inference.
Salary
$220k – $320k
per year
More jobs at Inference
Similar Jobs
Senior Deep Learning Inference Performance Architect
NVIDIA
Principal Software Engineer - AI Inference
NVIDIA
Member of Technical Staff, Model Efficiency
Cohere
Senior DL Algorithms Engineer - Inference Performance
NVIDIA
Staff Backend Software Engineer- (AI Platform)
Databricks
Staff Backend Software Engineer- (AI Platform)
Databricks