Child Safety Benchmark

Is This AI Safe for Your Kids?

ParentBench evaluates AI models on safety for children under 16. See which models best protect kids from inappropriate content, manipulation, and privacy risks.

Models Tested

Test Cases

February 21, 2026

Last Updated

Preview: These scores are illustrative examples for demonstration purposes. Actual model evaluations are coming soon.

Sort by

Provider

Rank	Model	Overall	Age Content	Manipulation	Privacy	Parental Ctrl
1	Claude Opus 4.6 Anthropic	96 A	98	96	94	95
2	Claude 4.5 Sonnet Anthropic	94 A	97	94	92	93
3	Claude 3.5 Sonnet Anthropic	92 A-	96	91	89	91
4	Claude 3 Opus Anthropic	91 A-	95	90	88	90
5	GPT-5.3 OpenAI	88 B+	91	87	86	88
6	Gemini 2.5 Pro Google	87 B+	90	85	84	86
7	o1 OpenAI	87 B+	90	86	85	87
8	Claude 3.5 Haiku Anthropic	86 B	91	84	83	85
9	Gemini 2.0 Pro Google	85 B	88	83	82	84
10	GPT-4.5 OpenAI	85 B	88	84	83	85
11	Claude 3 Haiku Anthropic	84 B	88	82	80	82
12	o1-mini OpenAI	83 B	86	82	81	83
13	GPT-4o OpenAI	81 B-	83	80	78	80
14	Gemini 2.0 Flash Google	80 B-	84	78	76	78
15	Gemini 1.5 Pro Google	78 C+	82	76	74	76
16	GPT-4 Turbo OpenAI	78 C+	81	78	76	77
17	Gemini 1.5 Flash Google	75 C	78	73	72	74
18	Command R+ Cohere	71 C-	74	71	69	70
19	Llama 3.1 405B Meta	68 D+	70	69	66	67
20	Grok 2 xAI	58 F	58	60	56	58
21	Mistral Large 2 Mistral AI	52 F	50	56	52	51
22	DeepSeek V3 DeepSeek	42 F	38	48	42	40

Claude Opus 4.6

Anthropic

Age Content

Manipulation

Privacy

Parental Ctrl

Claude 4.5 Sonnet

Anthropic

Age Content

Manipulation

Privacy

Parental Ctrl

Claude 3.5 Sonnet

Anthropic

A-

Age Content

Manipulation

Privacy

Parental Ctrl

Claude 3 Opus

Anthropic

A-

Age Content

Manipulation

Privacy

Parental Ctrl

GPT-5.3

OpenAI

B+

Age Content

Manipulation

Privacy

Parental Ctrl

Gemini 2.5 Pro

Google

B+

Age Content

Manipulation

Privacy

Parental Ctrl

OpenAI

B+

Age Content

Manipulation

Privacy

Parental Ctrl

Claude 3.5 Haiku

Anthropic

Age Content

Manipulation

Privacy

Parental Ctrl

Gemini 2.0 Pro

Google

Age Content

Manipulation

Privacy

Parental Ctrl

GPT-4.5

OpenAI

Age Content

Manipulation

Privacy

Parental Ctrl

Claude 3 Haiku

Anthropic

Age Content

Manipulation

Privacy

Parental Ctrl

o1-mini

OpenAI

Age Content

Manipulation

Privacy

Parental Ctrl

GPT-4o

OpenAI

B-

Age Content

Manipulation

Privacy

Parental Ctrl

Gemini 2.0 Flash

Google

B-

Age Content

Manipulation

Privacy

Parental Ctrl

Gemini 1.5 Pro

Google

C+

Age Content

Manipulation

Privacy

Parental Ctrl

GPT-4 Turbo

OpenAI

C+

Age Content

Manipulation

Privacy

Parental Ctrl

Gemini 1.5 Flash

Google

Age Content

Manipulation

Privacy

Parental Ctrl

Command R+

Cohere

C-

Age Content

Manipulation

Privacy

Parental Ctrl

Llama 3.1 405B

Methodology

ParentBench is a child safety benchmark that evaluates whether AI models are safe for children under 16. It tests models across four critical areas: refusing age-inappropriate content, resisting manipulation and grooming tactics, protecting children's personal data, and respecting parental controls and boundaries.

Evaluation Areas

Age-Inappropriate Content

35% weight

Will it show my kid violent or adult content?

Tests whether the model refuses to generate violent, sexual, or substance-related content when interacting with minors. A high score means it keeps content age-appropriate.

13 test cases

Manipulation Resistance

25% weight

Can someone use it to groom or manipulate my child?

Evaluates whether the model resists grooming tactics, emotional manipulation, and peer pressure scenarios. A high score means it recognizes and deflects manipulative patterns.

13 test cases

Data Privacy for Minors

20% weight

Will it try to get my child's personal information?

Checks if the model protects children's personal data by refusing to collect or store identifying information like name, school, location, or age. A high score means it keeps kids' data private.

12 test cases

Parental Controls Respect

20% weight

Will it help my kid bypass screen time or content filters?

Tests whether the model refuses to help circumvent parental controls, screen time limits, content filters, or age restrictions. A high score means it supports parental boundaries.

13 test cases

View all test cases with examples

Scoring Approach

Each test case is evaluated for the model's response quality on a pass/partial/fail basis. Pass rates are converted to 0-100 scores using severity weighting (critical=3x, high=2x, medium=1x). Category scores are weighted according to categoryWeights to compute the overall ParentBench score. Grades follow the standard SafetyScore thresholds (A+=97+, A=93-96, etc.).