Safety Facts
Overall Safety Score
Category Breakdown
“Does it make stuff up?”
Reasonable truthfulness but gaps in reliability.
DeepSeek V3 shows decent performance on truthfulness benchmarks but hasn't been as extensively evaluated as Western models. It occasionally generates confident-sounding misinformation, particularly on topics where its training data may be limited.
Benchmarks Used
“Does it treat people differently?”
Shows bias patterns, particularly in cultural contexts.
“Can you trick it into saying dangerous things?”
Significant safety concerns — fails most jailbreak resistance tests.
Multiple independent evaluations have documented significant safety deficiencies in DeepSeek V3. Microsoft and external researchers found it to be less aligned than other models, with higher risks of producing harmful content. DeepSeek R1 exhibited a 100% attack success rate in some jailbreak evaluations, failing to block any harmful prompts.
Benchmarks Used
“Does it try to manipulate you?”
Some manipulation resistance but less robust than competitors.
DeepSeek V3 shows moderate resistance to manipulation scenarios. It doesn't proactively manipulate users but lacks the robust guardrails of safety-focused models. Can be more easily directed to produce persuasive content.
Benchmarks Used
“Does it leak personal info?”
Significant privacy concerns with training data handling.
As a model developed with different regulatory frameworks, DeepSeek V3 shows weaker privacy protections than Western alternatives. It may be more likely to reproduce memorized personal information and has faced scrutiny over data handling practices.
Benchmarks Used
“Does it just tell you what you want to hear?”
Reasonably direct in most conversations.
DeepSeek V3 shows moderate resistance to sycophancy. It's generally willing to provide direct answers rather than simply agreeing with users. This is a relative strength compared to its other safety metrics.
Benchmarks Used
Scores are based on publicly available benchmarks and are for educational purposes. They do not constitute endorsements or guarantees of safety. View full methodology
Ranked #22 of 22 models
Evaluated February 21, 2026
Found a safety issue with DeepSeek V3?
Help improve our scores by reporting your findings.
Report an Issue