Turing

Head of Data Quality - RL Gyms

Turing(22 days ago)

San Francisco, CA, California, United StatesOnsiteFull TimeDirector$213,553 - $285,030 (estimated)Data Quality
Apply Now

About this role

The Head of Data Quality, RL Environments will build and lead the quality function for reinforcement learning environment and trajectory data used to train and evaluate frontier AI models. The role manages a team of Data Quality Leads, sets research-grade quality standards and KPIs, and partners with research, engineering, and product teams to ensure environments, rewards, and evaluations are robust and aligned with cutting-edge AI research. The position focuses on translating research trends into concrete data requirements and establishing scalable processes, tools, and documentation.

View Original Listing

Required Skills

  • Reinforcement Learning
  • Trajectory Evaluation
  • GenAI
  • ML Systems
  • Python
  • Simulation Frameworks
  • Reward Design
  • Experimental Design
  • Data Quality
  • Human Evaluation

+5 more

Qualifications

  • Bachelor's degree in Computer Science, Mathematics, Engineering or related field
  • MS/PhD (Preferred)
Turing

About Turing

turing.com

I can’t extract any company content from the HTML you provided — it’s an Incapsula/Edge security blocking page and contains no site text to summarize. Please either (a) share the actual website URL (or allow access), (b) paste the visible site copy you want distilled, or (c) tell me the company name and I’ll generate a 3–4 sentence profile from general knowledge. Which would you prefer?

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com