google/gemma-3-12b-it

Gemma 3 12B

Release Date

Mar 12th, 2025

Context Size

128k

Reasoning

No

Benchmark Cost

$0.16

Speed

39.2 tok/s

Categories

20%40%60%80%100%Creative Writing75.4%Tooling97.7%Language80.1%Utility79.3%Reasoning79.4%Text Editing85.2%Rule Following61.0%Hallucination69.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
777676757075%
808079797178%
818077767478%
858281727178%
817977757477%
807877707075%
Detailed Writing Rules76.79%
genre
797373726873%
818180806978%
787874746474%
818180767579%
787675746674%
807875757577%
genre75.66%
Novelcrafter Default Prompt
797574736774%
807876767677%
787572727274%
827874727276%
767575746774%
828179777579%
Novelcrafter Default Prompt75.50%
75.98%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
908989898188%
88878685370%
858483838284%
777170696370%
77.70%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999989898987094%
9998989898989898%
100100100100100100100100%
7474747474747474%
7676767676767475%
100100100100100100100100%
9292929289818189%
Generic Prompt92.19%
Specific Prompt
1001001001001001008998%
100100100100100100100100%
9998989898989898%
10010010010010010099100%
100100100100100100100100%
10089898989868690%
9594949494939194%
100100100100100100100100%
100100100100100100100100%
Specific Prompt97.76%
94.98%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%