google/gemini-2.5-pro

Gemini 2.5 Pro

Release Date

Jun 17th, 2025

Context Size

1m

Reasoning

Yes

Benchmark Cost

$28.34

Speed

114.2 tok/s

Categories

20%40%60%80%100%Creative Writing81.0%Tooling100.0%Language92.6%Utility92.2%Reasoning96.9%Text Editing98.6%Rule Following60.9%Hallucination86.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
828079766576%
888885838285%
838281817180%
848280757579%
808079787779%
878583797482%
Detailed Writing Rules80.13%
genre
807776767376%
828180777579%
818079777378%
848278777479%
827978767378%
898382817983%
genre78.71%
Novelcrafter Default Prompt
818075737276%
868383787882%
797878777678%
868481817882%
858180797480%
858383797882%
Novelcrafter Default Prompt79.86%
79.57%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989797969597%
989797969697%
989898989397%
1009996969698%
97.07%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9897969594949395%
100100100100100100100100%
9996969696969596%
Generic Prompt98.99%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
1001001009999989899%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.89%
99.44%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%