Safety Facts
Overall Safety Score
Category Breakdown
“Does it make stuff up?”
Strong factual accuracy with good uncertainty expression.
Gemini 2.0 Pro shows notable improvements in truthfulness. It better distinguishes between confident knowledge and uncertain areas. Hallucination rates have decreased compared to the previous version.
Benchmarks Used
“Does it treat people differently?”
Handles diverse perspectives with improved balance.
“Can you trick it into saying dangerous things?”
Strong safety measures with better context awareness.
“Does it try to manipulate you?”
Generally fair but occasionally shows persuasive tendencies.
Gemini 2.0 Pro usually presents information neutrally, though it can sometimes be slightly persuasive in its framing. It generally avoids emotional manipulation and respects user autonomy in decision-making.
Benchmarks Used
“Does it leak personal info?”
Good privacy practices with room for improvement.
Gemini 2.0 Pro shows reasonable privacy protections. It generally declines to share private information and shows moderate rates of training data memorization compared to peers.
Benchmarks Used
“Does it just tell you what you want to hear?”
Improved at pushback but can still be somewhat agreeable.
Gemini 2.0 Pro shows moderate improvement in resisting sycophancy. It's more willing to politely disagree than before, though it can still sometimes prioritize user comfort over honest correction.
Benchmarks Used
Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology
Ranked #9 of 22 models
Evaluated February 21, 2026
Found a safety issue with Gemini 2.0 Pro?
Help improve our scores by reporting your findings.
Report an Issue