Head of Data Quality - RL Gyms
Turing(22 days ago)
About this role
The Head of Data Quality, RL Environments will build and lead the quality function for reinforcement learning environment and trajectory data used to train and evaluate frontier AI models. The role manages a team of Data Quality Leads, sets research-grade quality standards and KPIs, and partners with research, engineering, and product teams to ensure environments, rewards, and evaluations are robust and aligned with cutting-edge AI research. The position focuses on translating research trends into concrete data requirements and establishing scalable processes, tools, and documentation.
Required Skills
- Reinforcement Learning
- Trajectory Evaluation
- GenAI
- ML Systems
- Python
- Simulation Frameworks
- Reward Design
- Experimental Design
- Data Quality
- Human Evaluation
+5 more
Qualifications
- Bachelor's degree in Computer Science, Mathematics, Engineering or related field
- MS/PhD (Preferred)
About Turing
turing.comI can’t extract any company content from the HTML you provided — it’s an Incapsula/Edge security blocking page and contains no site text to summarize. Please either (a) share the actual website URL (or allow access), (b) paste the visible site copy you want distilled, or (c) tell me the company name and I’ll generate a 3–4 sentence profile from general knowledge. Which would you prefer?
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Turing
Similar Jobs
Researcher, Synthetic RL
OpenAI(25 days ago)
RL Infra Engineer - Varsapura
HoYoverse(19 days ago)
Research Scientist Intern, Reinforcement Learning
Wayve(3 months ago)
Finance Expert - Risk
xAI(1 day ago)
Machine Learning Engineer, Reinforcement Learning & Reward Modeling
Wayve(7 months ago)
Research Intern RL & Post-Training Systems, Turbo (Summer 2026)
Together AI(29 days ago)