Systems Research Engineer Intern- GPU Programming (Summer 2026)
Together AI(28 days ago)
About this role
A Systems Research Engineer Intern (GPU Programming) at Together AI will contribute to the development and optimization of GPU-accelerated kernels and algorithms for ML/AI systems. The intern will work with modeling, hardware, and software teams to co-design model architectures and efficient GPU solutions. This is a summer internship based in San Francisco for approximately 12 weeks, offering exposure to research-driven engineering and potential contributions to open-source projects.
Required Skills
- GPU Programming
- Parallel Computing
- CUDA
- Triton
- ML Models
- Performance Profiling
- Optimization
- Problem Solving
- Analytical Skills
- Collaboration
About Together AI
together.aiTogether AI is an "AI Native Cloud" that helps teams reliably build, deploy, and scale AI-native applications. It combines cutting‑edge research with a complete developer experience and infrastructure optimized for high price‑performance. Together provides hosted model training and inference, APIs/SDKs, and tooling to move projects from experimentation into production. Customers pick it for scalability, cost efficiency, and faster time-to-production for AI applications.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Together AI
Similar Jobs
Member of Technical Staff, GPU Optimization
Mirage(2 months ago)
Machine Learning Engineer – HPC
Meshy(5 months ago)
Member of Technical Staff (GPU Engineer)
Reka(26 days ago)
ML Solutions Engineer
TensorWave(2 months ago)
Senior HPC Developer - GPU and Networking
Clockwork.io(7 days ago)
ML Engineer, Large Language Models (LLM Training & Inference Optimization)
Nebius(10 months ago)