Software Engineer, Model Performance Tooling
Baseten(28 days ago)
About this role
A Software Engineer, Model Performance Tooling at Baseten is responsible for developing automated benchmarking and diagnostic tools for high-performance computing and large language model (LLM) systems. This role involves performance benchmarking, infrastructure validation, and tool development to ensure systems are optimized for AI production readiness. The engineer will analyze the performance of GPU clusters, create real-time monitoring systems, and automate testing processes, while also gaining expertise in GPU orchestration and LLM inference.
Required Skills
- Performance Benchmarking
- Infrastructure Validation
- Model Dev Experience
- Tool Development
- Deep Hardware Profiling
- Monitoring & Observability
- Continuous Integration
- Optimization Automation
- Systems Understanding
- Automation Mindset
+4 more
About Baseten
www.baseten.coBaseten is a cutting-edge platform designed for deploying artificial intelligence models in production efficiently. It enables businesses to serve optimized open-source and custom models through a robust and reliable model delivery network. By streamlining machine learning operations, Baseten simplifies the integration of AI into applications, allowing companies to leverage the power of AI without the complexities typically associated with model deployment. The platform is favored for its speed, reliability, and user-friendly interface, catering to organizations seeking to enhance their AI capabilities.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Baseten
Similar Jobs
HPC System Engineer
Nebius(1 month ago)
Member of Technical Staff (GPU Engineer)
Reka(26 days ago)
Member of Technical Staff, Model Efficiency
Cohere(2 months ago)
AI Model Serving Specialist
Rackspace(1 month ago)
AI/ML Evaluation Engineer - Global Solutions Provider (Mexico)
Truelogic Software(2 months ago)
Director of Engineering, Inference Services
CoreWeave(1 month ago)