qwen/qwen3-235b-a22b-2507

Qwen3 235B A22B Instruct 2507

Release Date

Jul 21st, 2025

Context Size

262.1k

Reasoning

No

Benchmark Cost

$0.41

Speed

40.3 tok/s

Categories

20%40%60%80%100%Creative Writing84.8%Tooling99.2%Language60.8%Utility83.1%Reasoning85.8%Text Editing91.8%Rule Following65.4%Hallucination69.8%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
909082817784%
938886858287%
929190898589%
898986868487%
908986858587%
929291919091%
Detailed Writing Rules87.64%
genre
827978777578%
888380777581%
888583827983%
858481797881%
878484817783%
928785838186%
genre82.02%
Novelcrafter Default Prompt
828180807880%
908784848285%
918987848186%
898584838285%
878684797983%
909087868187%
Novelcrafter Default Prompt84.42%
84.69%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
929292919092%
939291906586%
959492919092%
939391909091%
90.37%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100979797979798%
100100100100100100100100%
9489868484828186%
100100100100100100100100%
100100100100100100100100%
9191919191919191%
8686837877737379%
9999997978767486%
1001001001001009999100%
Generic Prompt93.33%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9595959594939394%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.29%
96.31%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%