google/gemini-3-flash-preview

Gemini 3 Flash (Preview)

Release Date

Dec 17th, 2025

Context Size

1m

Reasoning

No

Benchmark Cost

$2.84

Speed

111.3 tok/s

Categories

20%40%60%80%100%Creative Writing75.0%Tooling97.6%Language95.0%Utility86.4%Reasoning94.8%Text Editing97.5%Rule Following65.1%Hallucination71.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
737270696971%
787674747475%
797674737075%
787574716973%
797876737276%
797573717174%
Detailed Writing Rules73.98%
genre
727269686670%
777673737174%
817773706874%
747272716872%
767473716972%
757272686871%
genre72.10%
Novelcrafter Default Prompt
837973737176%
807675757376%
818078787779%
807978747377%
777674747174%
837675736975%
Novelcrafter Default Prompt76.19%
74.09%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999898979798%
989595949495%
999998989498%
999998989698%
97.11%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999898989898%
100100100100100100100100%
100100969696969697%
100100100100100100100100%
9595959494949194%
100100100100100100100100%
9797979696959596%
Generic Prompt98.41%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9898989898989898%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999999898989899%
100100100100100100100100%
1001001009999999799%
Specific Prompt99.59%
99.00%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%