RL Environments Engineer (Contractor, Remote)
preference model
About this role
Preference Model is building the next generation of training data to power the future of AI through RL environments for teaching large language models better reasoning and advanced concepts. The role involves designing and building machine learning environments to improve model capabilities.
Skills
About preference model
preferencemodel.comPreference Model is a San Francisco-based company building reinforcement learning environments. They focus on creating realistic environments to train and evaluate RL agents, enabling researchers and developers to prototype AI systems. The company maintains a public presence with contact options and active recruiting, indicating ongoing growth. Operating as Preference Model, Inc., they engage with the community online via X (Twitter) to share updates and opportunities.
About preference model
Headquarters
San Francisco, CA
Company Size
201-500 employees
Founded
2018
Industry
Technology
Glassdoor Rating
4.2 / 5
Leadership Team
Sarah Johnson
Chief Executive Officer
Michael Chen
Chief Technology Officer
Emily Williams
VP of Engineering
David Rodriguez
VP of Product
Jessica Thompson
Chief Financial Officer
Andrew Park
VP of Sales
Unlock Company Insights
View leadership team, funding history,
and employee contacts for preference model.
Salary
$13k – $18k
per year
More jobs at preference model
Similar Jobs
Hunyuan Multimodal Reinforcement Learning (RL) Research Intern
Tencent
Hunyuan Multimodal Reinforcement Learning (RL) Research Intern
Tencent
Hunyuan Multimodal Reinforcement Learning (RL) Research Intern
Tencent
Researcher, Synthetic RL
OpenAI
Machine Learning Scientist I/II, Scientific Reasoning
Lila Sciences
Research Engineer, Performance RL
Anthropic