openai/gpt-5.2

GPT-5.2

Release Date

Dec 10th, 2025

Context Size

400k

Reasoning

No

Benchmark Cost

$13.99

Speed

53.4 tok/s

Categories

20%40%60%80%100%Creative Writing80.4%Tooling100.0%Language91.2%Utility96.2%Reasoning94.5%Text Editing97.5%Rule Following67.1%Hallucination95.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
818078787478%
878684787883%
858584838384%
878584817983%
838078787779%
858483817982%
Detailed Writing Rules81.59%
genre
797877767477%
807978777778%
807979757578%
848079787880%
797877767577%
868180797380%
genre78.12%
Novelcrafter Default Prompt
797873727175%
838281787880%
848379777580%
838281818182%
838281817781%
848079777779%
Novelcrafter Default Prompt79.43%
79.71%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979696959395%
979695959395%
989795959496%
949388888590%
94.13%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999693909095%
100100100100100100100100%
100100100100100100100100%
10010010010099997496%
9595959493939194%
100100100100100100100100%
8989898989898989%
Generic Prompt97.08%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
1001001009797979598%
10099989797959597%
100100100100100100100100%
1001001001001009999100%
Specific Prompt99.38%
98.23%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%