Research Engineer, Reward Models Training
Anthropic(10 months ago)
About this role
This role owns the end-to-end engineering of reward model training, building infrastructure to train, evaluate, and deploy reward models that align AI with human values. The engineer will scale training pipelines to large model sizes, incorporate diverse human feedback, and partner closely with researchers to productionize novel techniques. Work directly impacts the safety, helpfulness, and honesty of Anthropic's models.
Required Skills
- Python
- PyTorch
- Machine Learning
- Distributed Training
- Data Pipelines
- ML Infrastructure
- Scalability
- Fault Tolerance
- Model Evaluation
- Human Feedback
+8 more
Qualifications
- Bachelor's Degree in Related Field
About Anthropic
anthropic.comAnthropic is an AI safety and research company focused on building reliable, interpretable, and steerable AI systems. It develops large language models (branded as Claude) and offers APIs and enterprise products that let organizations integrate conversational AI with safety-focused controls, moderation, and privacy features. The company prioritizes interpretability and alignment research, publishes technical work, and engages with policymakers to reduce risks from advanced AI. Customers choose Anthropic for its safety-first approach, controllability tools, and research-driven models.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Anthropic
Similar Jobs
Staff Software Engineer, Machine Learning Infrastructure
Clarifai(28 days ago)
Member of Technical Staff, Large Generative Models
Mirage(2 months ago)
Machine Learning Researcher
Inference(29 days ago)
Sr. Software Reliability Engineer for AI
MixMode(25 days ago)
Register Your Interest – Senior ML/Research Engineer
Owkin(1 month ago)
Founding Lead Machine Learning Engineer
BJAK(1 month ago)