Research Scientist – Speech and Audio Understanding (Large Models & Multimodal Systems)
Tencent(4 months ago)
About this role
A research scientist role on Tencent’s core multimodal team focused on speech and audio, contributing to large-scale native multimodal model systems that integrate vision, audio, and text. The position is located in Bellevue, WA and supports the company’s efforts in advancing multimodal perception and understanding of the physical world. The role sits within the organization’s AI research efforts and engages with long-term model and dataset development.
Required Skills
- Speech Processing
- Acoustic Modeling
- Language Modeling
- ASR
- TTS
- Speech Translation
- Representation Learning
- Transformer Architecture
- Multimodal Alignment
- Data Annotation
+5 more
Qualifications
- Ph.D. in Computer Science, Electrical Engineering, Artificial Intelligence, Linguistics, or related field
- Master’s Degree with several years of relevant experience
About Tencent
tencent.com腾讯于1998年11月成立,是一家互联网公司,通过技术丰富互联网用户的生活,助力企业数字化升级。我们的使命是“用户为本 科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission is "Value for Users, Tech for Good".
View more jobs at Tencent →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Tencent
Similar Jobs
Machine Learning Scientist (L4/L5) - Audio & Speech for Games
Netflix(1 month ago)
Machine Learning Scientist (L4/L5) - Audio & Speech for Games
Netflix(26 days ago)
Machine Learning Researcher, Audio
Bland AI(5 days ago)
Audio Engineer
Deepgram(13 days ago)
Speech Scientist Intern
Zoom(21 days ago)
Member of Technical Staff, Large Generative Models
Mirage(3 months ago)