Senior Research Scientist, Reward Models
Anthropic(1 month ago)
About this role
A Senior Research Scientist on Anthropic's Reward Models team leads work to improve how models represent and optimize for human preferences, advancing AI alignment and safety. The role focuses on developing novel reward-model architectures and training methodologies for large language models and translating research into production improvements. The position provides access to frontier models and significant computational resources and involves close collaboration across research and engineering teams.
Required Skills
- Reward Modeling
- RLHF
- Preference Learning
- LLM Evaluation
- Experiment Design
- Robustness
- Production ML
- Research Publication
- Mentoring
- Collaboration
+3 more
Qualifications
- Bachelor's Degree in Related Field
- Equivalent Experience
About Anthropic
anthropic.comAnthropic is an AI safety and research company focused on building reliable, interpretable, and steerable AI systems. It develops large language models (branded as Claude) and offers APIs and enterprise products that let organizations integrate conversational AI with safety-focused controls, moderation, and privacy features. The company prioritizes interpretability and alignment research, publishes technical work, and engages with policymakers to reduce risks from advanced AI. Customers choose Anthropic for its safety-first approach, controllability tools, and research-driven models.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Anthropic
Similar Jobs
Research Scientist, Frontier, Zurich
DeepMind(18 days ago)
Machine Learning Scientist
LMArena(1 month ago)
Research Scientist, Sound and Audio
DeepMind(1 day ago)
Machine Learning Engineer, Reinforcement Learning & Reward Modeling
Wayve(7 months ago)
Research Engineer, Frontier Safety Risk Assessment
DeepMind(1 month ago)
Reward Program Manager
Referrals Only(1 month ago)