google/gemini-2.5-flash

Gemini 2.5 Flash

Release Date

Jun 17th, 2025

Context Size

1m

Reasoning

No

Benchmark Cost

$1.61

Speed

150.0 tok/s

Categories

20%40%60%80%100%Creative Writing77.6%Tooling100.0%Language86.2%Utility61.5%Reasoning92.6%Text Editing97.8%Rule Following57.5%Hallucination71.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
827878747277%
807777767276%
828276757578%
838178787579%
858581777581%
828181817981%
Detailed Writing Rules78.65%
genre
797371707073%
898478777681%
787776746975%
817876767477%
808073726975%
888685796881%
genre76.85%
Novelcrafter Default Prompt
757474717073%
848274737277%
787574746874%
828181777379%
787776757175%
868379796879%
Novelcrafter Default Prompt76.20%
77.23%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979696958293%
969695949395%
989793939295%
959595949494%
94.24%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999986094%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9796959594949395%
100100100100100100100100%
9696939292928592%
Generic Prompt97.87%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898979696969697%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.65%
98.76%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%