Applied Machine Learning Engineer

Inference

2 months ago

San Francisco, CA

Hybrid

Full Time

Medior

5 applicants

View Job Listing

Apply to 100+ jobs

About this role

An Applied Machine Learning Engineer at Inference.net is responsible for building and enhancing the core ML systems for a custom model training platform, overseeing the entire training lifecycle from data intake to model delivery. The role involves creating and maintaining data processing pipelines, developing evaluation frameworks, and applying advanced ML techniques to ensure model quality at scale. This position requires a strong background in AI model training, particularly with PyTorch and transformer architectures, and includes collaboration with infrastructure teams to optimize training workflows on a GPU fleet.

Skills

About Inference

inference.net

Inference.net is an innovative platform that specializes in AI inference solutions, enabling businesses to effectively train and host custom large language models tailored to their specific needs. The company offers a range of services, including serverless API and batch inference capabilities, designed to deliver improved performance and cost-efficiency compared to traditional models. With a focus on reducing latency and enhancing model accuracy, Inference.net empowers organizations to leverage AI technologies across various modalities such as text, image, and video. Their mission is to provide high-quality, reliable AI solutions that optimize deployment processes and drive operational excellence for their clients.