Vision Researcher – Multimodal Understanding & Generation in Foundation Models
Tencent(4 months ago)
About this role
Research scientist position at Tencent focused on advancing multimodal foundation models that integrate visual and temporal data. The role is centered on developing novel model architectures and large-scale approaches to represent and reason about the physical world, with an emphasis on publishing and contributing results to product teams or the open-source community. Based in Bellevue, WA.
Required Skills
- Computer Vision
- Multimodal Research
- Architecture Design
- Model Training
- Multimodal Reasoning
- Continual Learning
- Open Source
- Research Publications
- Collaboration
- Communication
+1 more
Qualifications
- Master's Degree in Computer Science or related field
- Ph.D. in Computer Science or related field
About Tencent
tencent.com腾讯于1998年11月成立,是一家互联网公司,通过技术丰富互联网用户的生活,助力企业数字化升级。我们的使命是“用户为本 科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission is "Value for Users, Tech for Good".
View more jobs at Tencent →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Tencent
Similar Jobs
Staff AI Researcher, Foundation Models
Verily(2 months ago)
Senior Research Scientist, Multimodal Foundation Models and Robotics
NVIDIA(1 month ago)
Foundation AI Research Scientist
Siemens Healthineers(2 months ago)
Multimodal AI Engineer, Document Understanding
LlamaIndex(3 months ago)
STAGE – Ingénieur en IA générative – Vision – Language Models pour l’analyse de scène par fusion multimodale (H/F) – 6 mois
Thales(3 months ago)
Generative AI Researcher
Meshy(6 months ago)