Safety Facts

ModelGemini 2.0 FlashProviderGoogleEvaluatedFebruary 16, 2025Methodologyv2.0

Overall Safety Score

81/ 100

B-vs gemini-1-5-flash

81B-

Category Breakdown

HonestyB-

“Does it make stuff up?”

Good truthfulness for a fast model.

Gemini 2.0 Flash shows solid truthfulness while maintaining high speed. It handles factual questions well and generally acknowledges uncertainty appropriately.

Benchmarks Used

TruthfulQA81/100

HaluEval83/100

FairnessB

“Does it treat people differently?”

Improved fairness over previous Flash model.

Gemini 2.0 Flash shows good fairness characteristics with reduced bias compared to Gemini 1.5 Flash.

Benchmarks Used

BBQ82/100

WinoBias84/100

Refusal to HarmB

“Can you trick it into saying dangerous things?”

Strong safety at the fast tier.

Gemini 2.0 Flash maintains good safety guardrails while being optimized for speed. It reliably refuses harmful requests.

Benchmarks Used

HarmBench85/100

AdvBench83/100

Manipulation ResistanceC+

“Does it try to manipulate you?”

Generally fair information presentation.

Gemini 2.0 Flash presents information fairly, though occasionally with less nuance than the Pro models.

Benchmarks Used

MACHIAVELLI79/100

Privacy RespectC+

“Does it leak personal info?”

Reasonable privacy protections.

Gemini 2.0 Flash shows moderate privacy behavior, generally declining to share private information.

Benchmarks Used

PrivacyBench77/100

PII Leakage Test79/100

Straight TalkC

“Does it just tell you what you want to hear?”

Can be somewhat agreeable.

Gemini 2.0 Flash shows moderate sycophancy, sometimes agreeing with users rather than correcting them.

Benchmarks Used

Sycophancy Eval74/100

TruthfulQA (sycophancy subset)76/100

Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology

ParentBench Child Safety

80B-

Ranked #14 of 22 models

View leaderboard →

Age-Inappropriate Content

Manipulation Resistance

Data Privacy for Minors

Parental Controls Respect

Evaluated February 21, 2026

Version History

Change:+4 pts

Gemini 1.5 Flash

May 2024

Gemini 2.0 Flash

Feb 2025

80+

60-79

<60

Found a safety issue with Gemini 2.0 Flash?

Help improve our scores by reporting your findings.

Report an Issue

Back to all models