qwen/qwen3.5-9b

Qwen 3.5 9B

Release Date

Mar 10th, 2026

Context Size

262k

Reasoning

Yes

Benchmark Cost

$1.20

Speed

73.9 tok/s

Categories

20%40%60%80%100%Creative Writing84.4%Tooling97.0%Language88.2%Utility94.0%Reasoning92.9%Text Editing85.4%Rule Following61.0%Hallucination85.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
858279797981%
848483828082%
878483837983%
858484817882%
838282797781%
918281797682%
Detailed Writing Rules81.83%
genre
818179797579%
838382776979%
828180797780%
858078757579%
797877757376%
868277777680%
genre78.76%
Novelcrafter Default Prompt
858483797882%
898685787783%
878281818082%
848484838083%
888582817983%
878683828284%
Novelcrafter Default Prompt82.82%
81.14%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
969594947691%
979797969095%
939290888890%
989893908693%
92.43%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999695936192%
100100999999992489%
100100100100100100100100%
1001001009999997496%
9694827979794980%
100100100100100100100100%
9689888781816784%
Generic Prompt93.36%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
10099999999999999%
1001001001001009999100%
100100100100100100100100%
10010010010099994792%
9996969552523976%
100100100100100100100100%
10010010010010010098100%
Specific Prompt96.33%
94.85%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%