Research Engineer, Reward Models Platform
Anthropic(1 month ago)
About this role
This role builds scalable tooling and infrastructure to accelerate reward-signal development for Anthropic's fine-tuning teams, turning manual experimentation into fast, repeatable workflows. You will partner closely with researchers on the Rewards and Fine-Tuning teams to translate scientific needs into platform capabilities, while occasionally contributing to research. The work focuses on enabling rapid iteration across rubric design, human feedback experiments, reward robustness evaluation, and detecting reward pathologies.
Required Skills
- Python
- ML Workflows
- Data Pipelines
- Infrastructure
- Tooling
- Automation
- Monitoring
- Observability
- Experiment Tracking
- Reward Modeling
+11 more
Qualifications
- Bachelor's Degree
About Anthropic
anthropic.comAnthropic is an AI safety and research company focused on building reliable, interpretable, and steerable AI systems. It develops large language models (branded as Claude) and offers APIs and enterprise products that let organizations integrate conversational AI with safety-focused controls, moderation, and privacy features. The company prioritizes interpretability and alignment research, publishes technical work, and engages with policymakers to reduce risks from advanced AI. Customers choose Anthropic for its safety-first approach, controllability tools, and research-driven models.
Apply instantly with AI
Let ApplyBlast auto-apply to jobs like this for you. Save hours on applications and land your dream job faster.
More jobs at Anthropic
Similar Jobs
Reward Program Manager
Referrals Only(1 month ago)
Reward & Recognition (Compensation) Lead, 12 Month Fixed Term Contract
DeepMind(1 month ago)
Reward & Recognition (Compensation) Lead - 6 Month Fixed Term Contract
DeepMind(6 days ago)
Product Manager - Small Mobile Apps (Remote)
Mode Mobile(1 month ago)
Machine Learning Engineer, Reinforcement Learning & Reward Modeling
Wayve(7 months ago)
AI Evaluator - Evergreen Requisition
Sama(1 year ago)