Research Scientist - Speech & Audio Understanding (Speech Generation)
Tencent(3 months ago)
About this role
A research-focused role leading technical R&D on voice foundation models to advance speech and multimodal voice capabilities. The position centers on developing next-generation speech and audio technologies and integrating text, speech, and vision to improve voice interaction experiences. Based in Bellevue, WA at Tencent, the role contributes to foundational model innovation and applied research in voice AI.
Required Skills
- Voice Models
- Speech Synthesis
- Speech Recognition
- Audio Generation
- Voice Conversion
- Speech Codec
- PyTorch
- Megatron
- DeepSpeed
- Model Pretraining
+1 more
Qualifications
- Master's or Ph.D. in Computer Science, Artificial Intelligence, Electronic Engineering, or Signal Processing
About Tencent
tencent.com腾讯于1998年11月成立,是一家互联网公司,通过技术丰富互联网用户的生活,助力企业数字化升级。我们的使命是“用户为本 科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission is "Value for Users, Tech for Good".
View more jobs at Tencent →Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Tencent
Similar Jobs
Machine Learning Researcher, Audio
Bland AI(5 days ago)
PhD Audio AI Engineer (Speech Conversion, TTS & ASR)
Zoom(1 month ago)
Audio AI Engineer
Zoom(3 months ago)
Research Engineer, Audio
Anthropic(1 month ago)
Member of Technical Staff, Video Generation - Audio
xAI(2 months ago)
Audio English/Multilingual Tutor
xAI(1 month ago)