qwen/qwen3-32b

Qwen 3 32B

Release Date

Apr 28th, 2025

Context Size

41k

Reasoning

No

Benchmark Cost

$0.66

Speed

70.4 tok/s

Categories

20%40%60%80%100%Creative Writing81.3%Tooling97.4%Language84.6%Utility81.7%Reasoning86.3%Text Editing89.9%Rule Following46.8%Hallucination89.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
838378696876%
848382818082%
908787868186%
868483827782%
858382807481%
898382818083%
Detailed Writing Rules81.72%
genre
837873696874%
878379767680%
838181807881%
858483838083%
858479797681%
777775747375%
genre78.97%
Novelcrafter Default Prompt
868578777380%
848482797982%
848480787781%
898380787681%
929283817384%
828078777478%
Novelcrafter Default Prompt81.01%
80.57%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
939291908891%
959392909092%
939090908690%
949393918992%
91.12%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100929283756987%
1001001001001001008398%
9898959490847591%
9999999999111174%
100100100100100969699%
100100100100100100100100%
9191908887837987%
1001001001001001008898%
9696969389887090%
Generic Prompt91.45%
Specific Prompt
100100100100100978998%
100100100100100100100100%
10099999999997696%
1001001009999999899%
100100100100100100100100%
100100100100100100100100%
9896959594934388%
10010010010010010098100%
1001001009797969398%
Specific Prompt97.63%
94.54%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%