google/gemini-3.1-pro-preview

Gemini 3.1 Pro (Preview)

Release Date

Feb 19th, 2026

Context Size

1m

Benchmark Cost

$33.13

Speed

82.0 tok/s

Creative writing

87.93%

Rule following

93.71%

Utility

93.24%

Mathematics

100.00%

Tooling

99.58%

Language

94.89%

Logic

94.61%

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
0-shot Creative writingRule following
888584807983%
0-shot Creative writingRule following
888787868486%
0-shot Creative writingRule following
858483828083%
0-shot Creative writingRule following
868482828283%
0-shot Creative writingRule following
878484838184%
0-shot Creative writingRule following
868484848184%
Detailed Writing Rules83.91%
genre
0-shot Creative writingRule following
878179797780%
0-shot Creative writingRule following
918684838285%
0-shot Creative writingRule following
898280767480%
0-shot Creative writingRule following
918882797583%
0-shot Creative writingRule following
858477767379%
0-shot Creative writingRule following
928988857887%
genre82.37%
Novelcrafter Default Prompt
0-shot Creative writingRule following
858483838283%
0-shot Creative writingRule following
888787878687%
0-shot Creative writingRule following
848483838383%
0-shot Creative writingRule following
848381818182%
0-shot Creative writingRule following
858483828083%
0-shot Creative writingRule following
888787868687%
Novelcrafter Default Prompt84.27%
83.52%

Codex Violation Detection

Detects factual inconsistencies between a story bible and prose passages. The model must output structured XML identifying each violation with paragraph number and substring.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
matrix
0-shot ToolingUtilityLogicRule following
9898979797969696959496%
0-shot ToolingUtilityLogicRule following
1001001001001001009797979799%
0-shot ToolingUtilityLogicRule following
100100100100100100100100979599%
0-shot ToolingUtilityLogicRule following
100100100100100100100100100100100%
matrix98.66%
tiers
0-shot ToolingUtilityLogicRule following
100100100100100100100100100100100%
0-shot ToolingUtilityLogicRule following
100100100100100100100100100100100%
0-shot ToolingUtilityLogicRule following
100100100100100100100100969699%
0-shot ToolingUtilityLogicRule following
100100100100100100100100100100100%
tiers99.79%
99.22%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
dialogue-200
0-shot Creative writingRule following
100100100100100100100100100100100%
0-shot Creative writingRule following
100100100100100100100100100100100%
0-shot Creative writingRule following
1001001001001001001001001008999%
dialogue-20099.64%
dialogue-500
0-shot Creative writingRule following
1001001001001001001001001006096%
0-shot Creative writingRule following
100100100100100100100100100100100%
0-shot Creative writingRule following
100100100100100100100100100100100%
dialogue-50098.66%
Ungrouped
0-shot Creative writingRule following
100100100100100100100100100100100%
99.27%

Language Comprehension

Does the model understand more than just English?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
100100100100080%
0-shot Language
100100100100100100%
95.00%

Language Writing

Can the model generate text in different languages?

Scenario #1 #2 #3 #4 #5 Total
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
0-shot Language
10010067535074%
0-shot Language
100100100100100100%
0-shot Language
100100100100100100%
94.80%

N-Length Sentences

Write sentences with exactly N words

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
100.00%

Novel outline

Handle questions about the outline of a novel in various formats

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
outline-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
outline-count100.00%
pov-count
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
0-shot ToolingUtility
100100100100100100100100100100100%
pov-count100.00%
100.00%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot ToolingUtility
100100100100100100100100100100100%
100.00%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
0-shot Utility
100100100100100100100100100100100%
0-shot Utility
1001001001001000000050%
1-shot Utility
1001001001001000000050%
Few-shot Utility
10010010010010010010000070%
0-shot Utility
100100100100100100100100100100100%
74.00%

Write N of X

Write exactly N words/sentences/paragraphs...

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
paragraphs
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
paragraphs100.00%
sentences
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
sentences100.00%
words
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
0-shot Rule following
100100100100100100100100100100100%
words100.00%
100.00%