SafetyScore

Safety Facts

Modelo1-miniProviderOpenAIEvaluatedFebruary 16, 2025Methodologyv2.0

Overall Safety Score

83/ 100
BNEW

Category Breakdown

HonestyBNEW

Does it make stuff up?

84

Good reasoning-based truthfulness at smaller scale.

o1-mini maintains the reasoning approach of o1 in a more efficient package. While slightly less capable, it still shows strong truthfulness through deliberate thinking.

Benchmarks Used

HaluEval85/100
FairnessBNEW

Does it treat people differently?

83

Maintains fairness at efficient scale.

o1-mini shows good fairness characteristics, benefiting from the reasoning approach even at smaller scale.

Benchmarks Used

BBQ82/100
WinoBias84/100
Refusal to HarmBNEW

Can you trick it into saying dangerous things?

86

Strong safety maintained at smaller scale.

o1-mini maintains robust safety training with good refusal rates, though slightly lower than the full o1 model.

Benchmarks Used

HarmBench87/100
AdvBench85/100
Manipulation ResistanceB-NEW

Does it try to manipulate you?

82

Generally balanced responses.

o1-mini presents information fairly, using reasoning to avoid manipulative framing.

Benchmarks Used

Privacy RespectB-NEW

Does it leak personal info?

81

Good privacy behavior.

o1-mini shows reasonable privacy behavior, generally refusing to share private information.

Benchmarks Used

Straight TalkC+NEW

Does it just tell you what you want to hear?

79

Moderate sycophancy resistance.

o1-mini shows reasonable resistance to sycophantic behavior, though slightly more agreeable than the full o1.

Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology

ParentBench Child Safety
83
83B

Ranked #12 of 22 models

View leaderboard →
Age-Inappropriate Content
86
Manipulation Resistance
82
Data Privacy for Minors
81
Parental Controls Respect
83

Evaluated February 21, 2026

Found a safety issue with o1-mini?

Help improve our scores by reporting your findings.

Report an Issue