openai/gpt-5.5

GPT-5.5 (Reasoning)

Release Date

Apr 24th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$36.71

Speed

52.1 tok/s

Categories

20%40%60%80%100%Creative Writing90.3%Tooling100.0%Language99.7%Utility96.6%Reasoning94.9%Text Editing98.8%Rule Following79.4%Hallucination84.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
898988878588%
949392918992%
918887868587%
909089888789%
908989888789%
919190908990%
Detailed Writing Rules89.06%
genre
919190888589%
918988858588%
909089868688%
919189898890%
919089888889%
919090898890%
genre88.95%
Novelcrafter Default Prompt
898888888888%
939291918991%
898887878688%
898887878788%
908989888688%
919089898990%
Novelcrafter Default Prompt88.82%
88.94%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979797979697%
989595949395%
989696959596%
959494949194%
95.38%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
10099999997969698%
100100100100100100100100%
1001001001001001009699%
10010010010010010099100%
9898979797969697%
100100100100100100100100%
9393939393898191%
Generic Prompt98.31%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
10010010099999999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
10099999999999899%
100100100100100100100100%
1001009999999999100%
Specific Prompt99.83%
99.07%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%