google/gemma-4-31b-it

Gemma 4 31B (Reasoning)

Release Date

Apr 3rd, 2026

Context Size

256k

Reasoning

Yes

Benchmark Cost

$1.10

Speed

29.5 tok/s

Categories

20%40%60%80%100%Creative Writing78.1%Tooling100.0%Language83.8%Utility96.3%Reasoning97.2%Text Editing98.8%Rule Following85.0%Hallucination94.4%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
818079777378%
827877767578%
838280797680%
808079787779%
868281818082%
797978787277%
Detailed Writing Rules78.98%
genre
787167646469%
767575757074%
757469656569%
747470707072%
737170686770%
827979767578%
genre72.05%
Novelcrafter Default Prompt
887676756776%
848080777679%
848382817781%
838178787579%
817876767477%
868382817882%
Novelcrafter Default Prompt79.18%
76.74%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
969696959495%
999796969697%
989797979797%
10010099989498%
96.86%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
10099999999999899%
100100100100100100100100%
10010010099999999100%
9898979796969597%
100100100100100100100100%
9999999999999999%
Generic Prompt99.35%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
10099999999989799%
100100100100100100100100%
10010010010010010099100%
Specific Prompt99.85%
99.60%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%