openai/gpt-5.5

GPT-5.5

Release Date

Apr 24th, 2026

Context Size

1m

Reasoning

No

Benchmark Cost

$24.02

Speed

50.4 tok/s

Categories

20%40%60%80%100%Creative Writing90.4%Tooling100.0%Language94.1%Utility81.9%Reasoning95.2%Text Editing98.2%Rule Following72.3%Hallucination80.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
919090898790%
949291908991%
898888888788%
939189898690%
929089888889%
929191908690%
Detailed Writing Rules89.61%
genre
898886868687%
908787868286%
929189888890%
959090908891%
919189898990%
949190908991%
genre89.00%
Novelcrafter Default Prompt
918987878788%
939190898990%
898986858587%
929090888889%
919090898890%
908989888789%
Novelcrafter Default Prompt88.87%
89.16%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979796969697%
989797969196%
999998989698%
959594949194%
95.93%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
100100100100999999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9797969696969496%
100100100100100100100100%
9797979796969696%
Generic Prompt99.12%
Specific Prompt
1001001001001001008998%
100100100100100100100100%
9999999999999999%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999989898989898%
100100100100100100100100%
1001001001001009999100%
Specific Prompt99.57%
99.35%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%