google/gemini-3-flash-preview

Gemini 3 Flash (Preview, Reasoning)

Release Date

Dec 17th, 2025

Context Size

1m

Reasoning

Yes

Benchmark Cost

$9.60

Speed

162.3 tok/s

Categories

20%40%60%80%100%Creative Writing75.9%Tooling100.0%Language94.9%Utility97.2%Reasoning98.1%Text Editing98.1%Rule Following74.5%Hallucination85.3%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
838277766978%
787777756775%
807977767678%
807875757276%
858279797680%
837876736976%
Detailed Writing Rules76.99%
genre
717170696870%
747370696871%
767574717173%
787372686672%
747372716571%
737271717071%
genre71.35%
Novelcrafter Default Prompt
807977767377%
818076767678%
828282777579%
807571706973%
828079767578%
787776756775%
Novelcrafter Default Prompt76.68%
75.01%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999999989899%
999898989798%
999898969597%
999999969598%
97.89%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999899%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9696969594949495%
100100100100100100100100%
9999999999969298%
Generic Prompt99.07%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001009999999999100%
100100100100100100100100%
100100100100100100100100%
10010010010010010098100%
9897979797979697%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.57%
99.32%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%