deepseek/deepseek-chat-v3-0324
DeepSeek V3 (2025-03-24) via OpenRouter
Release Date
Mar 24th, 2025Parameters
–Context Size
163.8kBenchmark Cost
$0.31Speed
33.8 tok/sCreative writing
37.44%Rule following
73.06%Utility
83.77%Mathematics
80.00%Tooling
78.03%Language
80.76%Logic
86.28%Codex Violation Detection
Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| matrix | |||||||||||
| 90% | 88% | 79% | 78% | 77% | 74% | 72% | 67% | 67% | 3% | 70% | |
| 96% | 96% | 87% | 87% | 85% | 83% | 83% | 78% | 72% | 3% | 77% | |
| 93% | 93% | 93% | 93% | 89% | 89% | 89% | 89% | 87% | 82% | 89% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 93% | 99% | |
| tiers | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 93% | 93% | 99% | |
| 100% | 100% | 100% | 94% | 94% | 94% | 92% | 86% | 86% | 86% | 93% | |
| 86% | 86% | 82% | 82% | 77% | 77% | 73% | 73% | 73% | 51% | 76% | |
| 93% | 92% | 88% | 87% | 83% | 79% | 75% | 75% | 74% | 71% | 82% | |
| 85.69% | |||||||||||
Data extraction
Extract key details from a given block of text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 95% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 50% | 95% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 50% | 50% | 50% | 50% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 70% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 90.83% | |||||||||||
Dialogue tags
Various tasks related to dialogue tags in text.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| dialogue-200 | |||||||||||
| 53% | 50% | 50% | 50% | 49% | 41% | 38% | 22% | 0% | 0% | 35% | |
| 76% | 66% | 64% | 58% | 51% | 50% | 49% | 47% | 22% | 10% | 49% | |
| 86% | 73% | 53% | 50% | 50% | 49% | 47% | 32% | 26% | 0% | 47% | |
| dialogue-500 | |||||||||||
| 49% | 12% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 6% | |
| 54% | 30% | 7% | 1% | 1% | 0% | 0% | 0% | 0% | 0% | 9% | |
| 49% | 49% | 38% | 6% | 5% | 5% | 2% | 1% | 1% | 0% | 16% | |
| Ungrouped | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 37.44% | |||||||||||
Language Comprehension
Does the model understand more than just English?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | |
| 100.00% | ||||||
Language Writing
Can the model generate text in different languages?
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Total |
|---|---|---|---|---|---|---|
| 89% | 78% | 67% | 67% | 50% | 70% | |
| 100% | 100% | 91% | 83% | 77% | 90% | |
| 100% | 100% | 100% | 56% | 0% | 71% | |
| 90% | 78% | 63% | 0% | 0% | 46% | |
| 87% | 86% | 75% | 0% | 0% | 49% | |
| 65.38% | ||||||
N-Length Sentences
Write sentences with exactly N words
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 88% | 81% | 97% | |
| 97% | 97% | 92% | 92% | 84% | 79% | 77% | 75% | 75% | 74% | 84% | |
| 74% | 58% | 51% | 39% | 35% | 28% | 18% | 11% | 8% | 0% | 32% | |
| 71.13% | |||||||||||
Novel outline
Handle questions about the outline of a novel in various formats
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| outline-count | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| pov-count | |||||||||||
| 100% | 50% | 50% | 50% | 0% | 0% | 0% | 0% | 0% | 0% | 25% | |
| 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 50% | |
| 65.83% | |||||||||||
Tool usage within Novelcrafter
Output messages that are related to tool usage within Novelcrafter
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 90.00% | |||||||||||
Voice/dialogue sheets
Extract dialogue from given text as voice sheets.
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 100% | 100% | 100% | 100% | 100% | 0% | 0% | 0% | 0% | 0% | 50% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 84.00% | |||||||||||
Write N of X
Write exactly N words/sentences/paragraphs...
| Scenario | Run 1 | Run 2 | Run 3 | Run 4 | Run 5 | Run 6 | Run 7 | Run 8 | Run 9 | Run 10 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| paragraphs | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 0% | 80% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| sentences | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 0% | 90% | |
| words | |||||||||||
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 92% | 99% | |
| 100% | 100% | 100% | 100% | 100% | 100% | 100% | 100% | 98% | 98% | 100% | |
| 100% | 92% | 92% | 77% | 77% | 54% | 27% | 9% | 0% | 0% | 53% | |
| 100% | 92% | 92% | 77% | 77% | 77% | 54% | 27% | 2% | 0% | 60% | |
| 100% | 98% | 27% | 0% | 0% | 0% | 0% | 0% | 0% | 0% | 23% | |
| 84.91% | |||||||||||