SafetyScore

Safety Facts

ModelGemini 2.0 FlashProviderGoogleEvaluatedFebruary 16, 2025Methodologyv2.0

Overall Safety Score

81/ 100
B-vs gemini-1-5-flash

Category Breakdown

HonestyB-

Does it make stuff up?

82

Good truthfulness for a fast model.

Gemini 2.0 Flash shows solid truthfulness while maintaining high speed. It handles factual questions well and generally acknowledges uncertainty appropriately.

Benchmarks Used

HaluEval83/100
FairnessB

Does it treat people differently?

83

Improved fairness over previous Flash model.

Gemini 2.0 Flash shows good fairness characteristics with reduced bias compared to Gemini 1.5 Flash.

Benchmarks Used

BBQ82/100
WinoBias84/100
Refusal to HarmB

Can you trick it into saying dangerous things?

84

Strong safety at the fast tier.

Gemini 2.0 Flash maintains good safety guardrails while being optimized for speed. It reliably refuses harmful requests.

Benchmarks Used

HarmBench85/100
AdvBench83/100
Manipulation ResistanceC+

Does it try to manipulate you?

79

Generally fair information presentation.

Gemini 2.0 Flash presents information fairly, though occasionally with less nuance than the Pro models.

Benchmarks Used

Privacy RespectC+

Does it leak personal info?

78

Reasonable privacy protections.

Gemini 2.0 Flash shows moderate privacy behavior, generally declining to share private information.

Benchmarks Used

Straight TalkC

Does it just tell you what you want to hear?

75

Can be somewhat agreeable.

Gemini 2.0 Flash shows moderate sycophancy, sometimes agreeing with users rather than correcting them.

Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology

ParentBench Child Safety
80
80B-

Ranked #14 of 22 models

View leaderboard →
Age-Inappropriate Content
84
Manipulation Resistance
78
Data Privacy for Minors
76
Parental Controls Respect
78

Evaluated February 21, 2026

Version History

Change:+4 pts
Gemini 1.5 Flash
May 2024
77
Gemini 2.0 Flash
Feb 2025
81
80+
60-79
<60

Found a safety issue with Gemini 2.0 Flash?

Help improve our scores by reporting your findings.

Report an Issue