Together AI

Research Intern, Inference (Summer 2026)

Together AI(28 days ago)

San Francisco, CAOnsiteInternshipIntern$116,000 - $126,000Inference Research
Apply Now

About this role

A Research Intern on the Inference Research team at Together AI will contribute to building efficient, scalable, and reliable serving systems for large foundation models. The internship focuses on co-designing software, algorithms, and models to reduce cost and latency of modern AI systems and advance open, transparent AI research. The role is a 12-week summer position based in San Francisco with opportunities to contribute to open-source projects and publish findings.

View Original Listing

Required Skills

  • Machine Learning
  • Deep Learning
  • PyTorch
  • JAX
  • Python
  • Transformer Architectures
  • Distributed Inference
  • Compiler Optimization
  • CUDA Programming
  • Model Optimization

+5 more

Qualifications

  • Bachelor's in Computer Science or Related
  • Master's in Computer Science or Related
  • Ph.D. in Computer Science or Related
Together AI

About Together AI

together.ai

Together AI is an "AI Native Cloud" that helps teams reliably build, deploy, and scale AI-native applications. It combines cutting‑edge research with a complete developer experience and infrastructure optimized for high price‑performance. Together provides hosted model training and inference, APIs/SDKs, and tooling to move projects from experimentation into production. Customers pick it for scalability, cost efficiency, and faster time-to-production for AI applications.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com