Research Internship- Multimodal LLM (Speech/Music/Audio/Vision/Language)
Tencent(2 months ago)
About this role
A research intern at Tencent AI Lab (Seattle area) working on multimodal large foundation models across speech, music, audio, vision, and language. The role involves participating in research projects to develop novel multimodal pretraining, model architectures, and audio/visual processing techniques, with opportunities to collaborate with researchers and publish results. Interns are based in Bellevue, WA and contribute to advancing large-model capabilities toward AGI-level ambitions.
Required Skills
- Speech Processing
- Music Processing
- Audio Processing
- Image Processing
- Video Understanding
- Language Processing
- Machine Learning
- Natural Language Processing
- Computer Vision
- Dialog Systems
+5 more
Qualifications
- Ph.D. Student (Computer Science)
- Ph.D. Student (Electrical Engineering)
- Ph.D. Student (Mathematics)
About Tencent
tencent.com腾讯于1998年11月成立,是一家互联网公司,通过技术丰富互联网用户的生活,助力企业数字化升级。我们的使命是“用户为本 科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission is "Value for Users, Tech for Good".
View more jobs at Tencent →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Tencent
Similar Jobs
Research Scientist, Pretraining, Gemma
DeepMind(2 months ago)
Member of Technical Staff, Video Generation - Audio
xAI(2 months ago)
Machine Learning Engineer (GenAI & Multimodal Systems) - Creative Tech Studio (Mexico)
Truelogic Software(2 months ago)
Principal AI Researcher, LLM
Cerence Inc(5 months ago)
Machine Learning Engineer (GenAI & Multimodal Systems) - Creative Tech Studio (Colombia)
Truelogic Software(2 months ago)
Machine Learning Engineer (GenAI & Multimodal Systems) - Creative Tech Studio
Truelogic Software(2 months ago)