google/gemini-3-pro-preview

Gemini 3 Pro (Preview)

Release Date

Nov 18th, 2025

Context Size

1m

Reasoning

Yes

Benchmark Cost

$43.47

Speed

106.5 tok/s

Categories

20%40%60%80%100%Creative Writing77.8%Tooling100.0%Language89.6%Utility96.1%Reasoning95.2%Text Editing98.9%Rule Following64.5%Hallucination88.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
787876767577%
858484797782%
807978777578%
848178767579%
818181817881%
827673727275%
Detailed Writing Rules78.36%
genre
747472727072%
797574736974%
757574726973%
797874716974%
757471717072%
757270696771%
genre72.76%
Novelcrafter Default Prompt
747372726972%
868584807582%
868179787780%
877979777379%
878181807881%
828180787579%
Novelcrafter Default Prompt78.98%
76.70%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
999999999999%
999897979597%
989898989798%
1009595949495%
97.32%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9897979696969696%
100100100100100100100100%
9999999999999999%
Generic Prompt99.43%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100989898979798%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.71%
99.57%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%