Member of Technical Staff, Inference
OneCrew
About this role
An inference runtime engineer at Inferact works on optimizing AI inference engines to support large language models and diffusion models across diverse hardware and architectures. The role involves pushing the boundaries of LLM and diffusion model serving through core engine innovations.
Skills
Qualifications
About OneCrew
ashbyhq.comAshby is an all‑in‑one recruiting SaaS that consolidates applicant tracking (ATS), analytics, scheduling, and CRM into a single platform for hiring teams. It combines candidate tracking, people analytics, and automation to help companies source, evaluate, and hire more efficiently while improving interviewer and candidate experience. Built for ambitious startups through enterprise teams, Ashby emphasizes clean integrations, customizable workflows, and data‑driven hiring insights.
Recent company news
OneCrew Raises $7.5 Million in Series A
Aug 21, 2025
Belgian VC Entourage backs San Francisco-based SaaS firm OneCrew; eyes Europan expansion in 2025
Nov 11, 2024
OneCrew raises $7.5M to digitise the $150B paving industry with unified contractor platform — TFN
Aug 20, 2025
Getting a Smooth Takeoff to Using Paving Company Software
Jun 7, 2023
OneCrew Raises $3.25 Million in Seed Round
Nov 12, 2024
About OneCrew
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for OneCrew.
Salary
$200k – $400k
per year
More jobs at OneCrew
Similar Jobs
AI Software Engineer, LLM Inference Performance Analysis - New College Grad 2026
NVIDIA
Member of Technical Staff, ML Engineer
Mirage
ML Platform Engineer
eBay
Senior Deep Learning Architect, LLM Inference
NVIDIA
Inference Technical Lead, On-Device Transformers
OpenAI
Member of Technical Staff, Model Efficiency
Cohere