Principal Researcher – Reinforcement Learning for Large Foundation Models
Tencent
About this role
Tencent AI Lab is hiring an expert-level researcher to advance reinforcement learning methods for large foundation models. The role focuses on developing stable, efficient RL algorithms to improve complex reasoning, autonomous agent behavior, and continuous learning capabilities. The position will involve driving research that translates to real-world applications and influential publications.
Skills
Qualifications
About Tencent
tencent.com腾讯于1998年11月成立,是一家互联网公司,通过技术丰富互联网用户的生活,助力企业数字化升级。我们的使命是“用户为本 科技向善”。Founded in 1998, Tencent is an Internet-based platform company using technology to enrich the lives of Internet users and assist the digital upgrade of enterprises. Our mission is "Value for Users, Tech for Good".
Recent company news
Tencent Cloud Powers iyzico's European Expansion with Secure, Scalable Payment Infrastructure
5 hours ago
Capital World Investors Sells 1,835,986 Shares of Tencent Music Entertainment Group Sponsored ADR $TME
6 hours ago
Tencent Joins China’s AI Agent Race With ‘Top-Secret’ WeChat Project
2 days ago
China’s OpenClaw Frenzy Sends Minimax, Tencent Shares Soaring
2 days ago
WeChat explores building in-house AI model as Tencent ramps up AI push
8 hours ago
About Tencent
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for Tencent.
Salary
$164k – $308k
per year
More jobs at Tencent
Similar Jobs
Reinforcement Learning Intern
DeKalb County Government
Reinforcement learning engineer
Dexmate
Machine Learning Engineer, Foundation Model
DiDi
AI Scientist (Reinforcement Learning)
Resaro AI
Senior Machine Learning Engineer (Reinforcement Learning)
Datatonic
Senior/Staff Deep Reinforcement Learning Engineer
DoorDash