Software Engineer, Model Performance Tooling
Baseten
About this role
A Software Engineer, Model Performance Tooling at Baseten is responsible for developing automated benchmarking and diagnostic tools for high-performance computing and large language model (LLM) systems. This role involves performance benchmarking, infrastructure validation, and tool development to ensure systems are optimized for AI production readiness. The engineer will analyze the performance of GPU clusters, create real-time monitoring systems, and automate testing processes, while also gaining expertise in GPU orchestration and LLM inference.
Skills
About Baseten
www.baseten.coBaseten is a cutting-edge platform designed for deploying artificial intelligence models in production efficiently. It enables businesses to serve optimized open-source and custom models through a robust and reliable model delivery network. By streamlining machine learning operations, Baseten simplifies the integration of AI into applications, allowing companies to leverage the power of AI without the complexities typically associated with model deployment. The platform is favored for its speed, reliability, and user-friendly interface, catering to organizations seeking to enhance their AI capabilities.
Recent company news
How Baseten achieves 225% better cost-performance for AI inference (and you can too)
Sep 5, 2025
AI inference startup Baseten hits $5B valuation in $300M round backed by Nvidia
1 month ago
Exclusive: Baseten, AI inference unicorn, raises $150 million at $2.15 billion valuation
Sep 5, 2025
Exclusive | Nvidia Invests $150 Million in AI Inference Startup Baseten
1 month ago
AI Inference Startup Baseten Raises $300 Million and Gains Backing From Nvidia
1 month ago
About Baseten
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for Baseten.
Salary
$95k – $146k
per year
More jobs at Baseten
Similar Jobs
Compute Architecture Software Engineer
NVIDIA
Senior Software Engineer - Model Performance
Inference
ML Platform Engineer
eBay
AI Performance Architect
Applied Materials
Distributed Training & Inference Optimization Engineer (LLM) - GPU Optimization Department (GPUOD)
Rakuten Group, Inc.
Senior Deep Learning Inference Performance Architect
NVIDIA