Safety Facts

ModelGemini 1.5 ProProviderGoogleEvaluatedFebruary 16, 2025Methodologyv2.0

Overall Safety Score

82/ 100

B-vs gemini-1-0-pro

82B-

Category Breakdown

HonestyB

“Does it make stuff up?”

Generally truthful but can be confidently wrong about some topics.

Gemini 1.5 Pro performs solidly on truthfulness benchmarks. It handles most factual questions well and often includes appropriate caveats. However, it can occasionally generate plausible-sounding but incorrect information, particularly when synthesizing across multiple domains.

Benchmarks Used

TruthfulQA83/100

HaluEval85/100

FairnessB

“Does it treat people differently?”

One of the better performers on fairness — treats people quite equally.

Gemini 1.5 Pro scores well on bias benchmarks, showing relatively balanced treatment across demographic groups. Google's investment in fairness research is evident here. It handles questions about different cultures and identities with care.

Benchmarks Used

BBQ87/100

WinoBias85/100

Refusal to HarmB

“Can you trick it into saying dangerous things?”

Good at saying no to harmful requests, though sometimes overly cautious.

Gemini 1.5 Pro has strong safety filters. It consistently refuses to generate harmful content and handles adversarial prompts well. The trade-off is that it can sometimes be overly cautious, refusing borderline requests that other models handle safely.

Benchmarks Used

HarmBench89/100

AdvBench87/100

Manipulation ResistanceB-

“Does it try to manipulate you?”

Mostly fair, but can be nudged into one-sided arguments.

Gemini 1.5 Pro generally avoids overt manipulation but shows some susceptibility to generating one-sided persuasive content when prompted. It could be better at flagging when it's being asked to produce biased or manipulative content.

Benchmarks Used

MACHIAVELLI78/100

Privacy RespectC+

“Does it leak personal info?”

Reasonable privacy protections, with some gaps around public data.

Gemini 1.5 Pro handles most privacy scenarios appropriately. It refuses to share clearly private information but can sometimes blur the line with information that's technically public but arguably private (like home addresses found in public records).

Benchmarks Used

PrivacyBench78/100

PII Leakage Test80/100

Straight TalkC+

“Does it just tell you what you want to hear?”

Has a tendency to agree with you rather than challenge your assumptions.

Gemini 1.5 Pro shows noticeable sycophantic tendencies. It's more likely than some competitors to agree with a user's stated position even when that position is factually questionable. It prioritizes being agreeable over being accurate in contested discussions.

Benchmarks Used

Sycophancy Eval76/100

TruthfulQA (sycophancy subset)78/100

Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology

ParentBench Child Safety

78C+

Ranked #15 of 22 models

View leaderboard →

Age-Inappropriate Content

Manipulation Resistance

Data Privacy for Minors

Parental Controls Respect

Evaluated February 21, 2026

Version History

Change:+8 pts

Gemini 1.0 Pro

Dec 2023

Gemini 1.5 Pro

Feb 2025

80+

60-79

<60

Found a safety issue with Gemini 1.5 Pro?

Help improve our scores by reporting your findings.

Report an Issue

Back to all models