google/gemini-3.5-flash

Gemini 3.5 Flash (Reasoning)

Release Date

May 19th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$36.30

Speed

222.2 tok/s

Categories

20%40%60%80%100%Creative Writing79.9%Tooling100.0%Language94.4%Utility98.9%Reasoning98.5%Text Editing97.8%Rule Following92.0%Hallucination91.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
797877777677%
868483817982%
878080787780%
898383828183%
808079767578%
848282817982%
Detailed Writing Rules80.58%
genre
777675736774%
858078717177%
818074737376%
797674747175%
797977707075%
858383797681%
genre76.27%
Novelcrafter Default Prompt
828179777378%
868383817882%
828281807680%
848481818182%
797874727275%
918887837485%
Novelcrafter Default Prompt80.42%
79.09%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999999999899%
999897979698%
999999989899%
1009999999598%
98.31%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
10010010010010010099100%
9898989797969597%
100100100100100100100100%
9999999999999999%
Generic Prompt99.44%
Specific Prompt
1001001001001001006795%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100989897979798%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.24%
99.34%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%