google/gemini-3.1-pro-preview

Gemini 3.1 Pro (Preview)

Release Date

Feb 19th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$51.24

Speed

84.9 tok/s

Categories

20%40%60%80%100%Creative Writing85.4%Tooling99.9%Language94.9%Utility99.9%Reasoning96.0%Text Editing98.5%Rule Following91.2%Hallucination89.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
888584807983%
888787868486%
848483828083%
868482828283%
878484838184%
868484848184%
Detailed Writing Rules83.89%
genre
878178787680%
908683828285%
898280767480%
918880787582%
858477747379%
928988857886%
genre82.00%
Novelcrafter Default Prompt
858483838283%
888787878687%
848483838383%
848381818182%
858483828083%
888787868687%
Novelcrafter Default Prompt84.26%
83.38%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989797967492%
999998989698%
999999989698%
959594949194%
95.49%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9797969696959596%
100100100100100100100100%
9999999999999999%
Generic Prompt99.28%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
1001001001001001006094%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.25%
99.27%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%