Anthropic

Applied Safety Research Engineer, Safeguards

Anthropic(23 days ago)

HybridFull TimeMedior$320,000 - $405,000Research
Apply Now

About this role

A research-oriented engineer role focused on developing methods to make AI safety evaluations representative, robust, and informative. The position sits at the intersection of applied ML research and engineering, shaping evaluation pipelines that inform model training and deployment decisions. Work will directly influence how the company measures and improves model safety across misuse, prompt injection, and user well-being.

View Original Listing

Required Skills

  • Python
  • ML Engineering
  • Data Pipelines
  • Data Analysis
  • LLMs
  • Experimentation
  • Model Evaluation
  • Production Code
  • Tooling
  • Collaboration

Qualifications

  • Bachelor's Degree
Anthropic

About Anthropic

anthropic.com

Anthropic is an AI safety and research company focused on building reliable, interpretable, and steerable AI systems. It develops large language models (branded as Claude) and offers APIs and enterprise products that let organizations integrate conversational AI with safety-focused controls, moderation, and privacy features. The company prioritizes interpretability and alignment research, publishes technical work, and engages with policymakers to reduce risks from advanced AI. Customers choose Anthropic for its safety-first approach, controllability tools, and research-driven models.

ApplyBlast uses AI to match you with the right jobs, tailor your resume and cover letter, and apply automatically so you can land your dream job faster.

© All Rights Reserved. ApplyBlast.com