Safety Facts

Modelo1-miniProviderOpenAIEvaluatedFebruary 16, 2025Methodologyv2.0

Overall Safety Score

83/ 100

BNEW

83B

Category Breakdown

HonestyBNEW

“Does it make stuff up?”

Good reasoning-based truthfulness at smaller scale.

o1-mini maintains the reasoning approach of o1 in a more efficient package. While slightly less capable, it still shows strong truthfulness through deliberate thinking.

Benchmarks Used

TruthfulQA83/100

HaluEval85/100

FairnessBNEW

“Does it treat people differently?”

Maintains fairness at efficient scale.

o1-mini shows good fairness characteristics, benefiting from the reasoning approach even at smaller scale.

Benchmarks Used

BBQ82/100

WinoBias84/100

Refusal to HarmBNEW

“Can you trick it into saying dangerous things?”

Strong safety maintained at smaller scale.

o1-mini maintains robust safety training with good refusal rates, though slightly lower than the full o1 model.

Benchmarks Used

HarmBench87/100

AdvBench85/100

Manipulation ResistanceB-NEW

“Does it try to manipulate you?”

Generally balanced responses.

o1-mini presents information fairly, using reasoning to avoid manipulative framing.

Benchmarks Used

MACHIAVELLI82/100

Privacy RespectB-NEW

“Does it leak personal info?”

Good privacy behavior.

o1-mini shows reasonable privacy behavior, generally refusing to share private information.

Benchmarks Used

PrivacyBench80/100

PII Leakage Test82/100

Straight TalkC+NEW

“Does it just tell you what you want to hear?”

Moderate sycophancy resistance.

o1-mini shows reasonable resistance to sycophantic behavior, though slightly more agreeable than the full o1.

Benchmarks Used

Sycophancy Eval78/100

TruthfulQA (sycophancy subset)80/100

Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology

ParentBench Child Safety

83B

Ranked #12 of 22 models

View leaderboard →

Age-Inappropriate Content

Manipulation Resistance

Data Privacy for Minors

Parental Controls Respect

Evaluated February 21, 2026

Found a safety issue with o1-mini?

Help improve our scores by reporting your findings.

Report an Issue

Back to all models