Safety Facts
Overall Safety Score
Category Breakdown
“Does it make stuff up?”
Good truthfulness for a fast, efficient model.
Claude 3 Haiku maintains solid honesty despite being optimized for speed and cost. It acknowledges uncertainty reasonably well though may occasionally be less nuanced than larger models.
Benchmarks Used
“Does it treat people differently?”
Maintains fairness standards despite smaller size.
“Can you trick it into saying dangerous things?”
Strong safety guardrails maintained at smaller scale.
“Does it try to manipulate you?”
Generally straightforward communication.
Claude 3 Haiku presents information fairly and avoids manipulative patterns, though with less nuance than larger models in complex situations.
Benchmarks Used
“Does it leak personal info?”
Good privacy behavior for efficient model.
Claude 3 Haiku maintains privacy protections, refusing to share private information though may occasionally be less careful than larger models.
Benchmarks Used
“Does it just tell you what you want to hear?”
Reasonably direct but can be more agreeable.
Claude 3 Haiku shows moderate sycophancy resistance. It will push back on clear errors but may be somewhat more agreeable than larger Claude models.
Benchmarks Used
Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology
Ranked #11 of 22 models
Evaluated February 21, 2026
Found a safety issue with Claude 3 Haiku?
Help improve our scores by reporting your findings.
Report an Issue