openai/gpt-5.5

GPT-5.5 (Reasoning, Low)

Release Date

Apr 24th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$28.51

Speed

51.2 tok/s

Categories

20%40%60%80%100%Creative Writing90.2%Tooling100.0%Language99.2%Utility96.4%Reasoning95.0%Text Editing98.6%Rule Following76.9%Hallucination84.4%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
898989878788%
949291898891%
908988888688%
949490858389%
898989898688%
929289898890%
Detailed Writing Rules89.13%
genre
939292888790%
919189878689%
908989898889%
939290898890%
919090908990%
929089898990%
genre89.65%
Novelcrafter Default Prompt
938989888689%
939191908991%
898787868587%
898888868387%
949089878589%
919190898790%
Novelcrafter Default Prompt88.65%
89.14%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979796969696%
989797979597%
989898979597%
949494929193%
95.84%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999799%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9898979797969697%
100100100100100100100100%
9797969693938994%
Generic Prompt98.90%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
1001001001001009999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999999898989899%
100100100100100100100100%
1001001001001009999100%
Specific Prompt99.80%
99.35%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%