qwen/qwen3.5-flash-02-23

Qwen 3.5 Flash

Release Date

Feb 25th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$2.51

Speed

153.9 tok/s

Categories

20%40%60%80%100%Creative Writing83.8%Tooling87.9%Language91.9%Utility96.1%Reasoning94.7%Text Editing92.8%Rule Following63.2%Hallucination80.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
868181797881%
868484808083%
858383797882%
838281797880%
898686847985%
858383828183%
Detailed Writing Rules82.22%
genre
817471706973%
848383787881%
828079797880%
888180797881%
828179797880%
828281777780%
genre79.11%
Novelcrafter Default Prompt
828282828082%
868483818083%
858282828082%
898783818084%
868584838284%
878481817882%
Novelcrafter Default Prompt82.87%
81.40%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979695959496%
999797969096%
969392919092%
999898979798%
95.42%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
10010010010098988397%
100100100100100100100100%
10099918373737385%
9796969595947993%
100100100100100100100100%
9797979793898894%
Generic Prompt96.42%
Specific Prompt
100100100100100100100100%
10010010010010010097100%
9999999999968096%
1001001001001001009199%
100100100100100100100100%
100100100100100100100100%
9898989797942086%
100100100100100100100100%
100100100100100100100100%
Specific Prompt97.83%
97.13%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001000080%
80.00%