Research Intern, Inference (Summer 2026)
Together AI(28 days ago)
About this role
A Research Intern on the Inference Research team at Together AI will contribute to building efficient, scalable, and reliable serving systems for large foundation models. The internship focuses on co-designing software, algorithms, and models to reduce cost and latency of modern AI systems and advance open, transparent AI research. The role is a 12-week summer position based in San Francisco with opportunities to contribute to open-source projects and publish findings.
Required Skills
- Machine Learning
- Deep Learning
- PyTorch
- JAX
- Python
- Transformer Architectures
- Distributed Inference
- Compiler Optimization
- CUDA Programming
- Model Optimization
+5 more
Qualifications
- Bachelor's in Computer Science or Related
- Master's in Computer Science or Related
- Ph.D. in Computer Science or Related
About Together AI
together.aiTogether AI is an "AI Native Cloud" that helps teams reliably build, deploy, and scale AI-native applications. It combines cutting‑edge research with a complete developer experience and infrastructure optimized for high price‑performance. Together provides hosted model training and inference, APIs/SDKs, and tooling to move projects from experimentation into production. Customers pick it for scalability, cost efficiency, and faster time-to-production for AI applications.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Together AI
Similar Jobs
Architecture Intern - Inference
Etched(1 month ago)
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius(10 months ago)
Staff Machine Learning Performance Engineer, Inference Optimisation
Wayve(2 months ago)
Machine Learning Engineer - Inference / Serving
Yobi(3 months ago)
Research Scientist (Embodied AI & World Models)
Graphcore(2 months ago)
Generative AI - ML System Engineering
Meshy(1 year ago)