google/gemma-4-31b-it

Gemma 4 31B

Release Date

Apr 3rd, 2026

Context Size

256k

Reasoning

No

Benchmark Cost

$0.37

Speed

24.1 tok/s

Categories

20%40%60%80%100%Creative Writing75.6%Tooling100.0%Language75.0%Utility86.7%Reasoning96.5%Text Editing98.6%Rule Following72.7%Hallucination90.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
757270686870%
817877767377%
828181797379%
807878727276%
847979777479%
837574737275%
Detailed Writing Rules76.16%
genre
706967676668%
797674706974%
747269686770%
747371696871%
747373736772%
797775716674%
genre71.31%
Novelcrafter Default Prompt
797675737375%
797777757376%
827978767578%
807978767377%
807877727176%
847979787880%
Novelcrafter Default Prompt76.95%
74.81%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
969696969696%
989897969697%
989898989898%
959594949494%
96.32%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9998989898969698%
100100100100100100100100%
9999999996969698%
Generic Prompt99.46%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999989898989899%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.85%
99.66%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%