openai/gpt-5.4-nano

GPT-5.4 Nano (Reasoning, Low)

Release Date

Mar 17th, 2026

Context Size

400k

Reasoning

Yes

Benchmark Cost

$1.14

Speed

130.4 tok/s

Categories

20%40%60%80%100%Creative Writing80.9%Tooling89.8%Language81.9%Utility91.4%Reasoning78.9%Text Editing82.2%Rule Following31.6%Hallucination99.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
818177767277%
898887817885%
878684828184%
878483837382%
878682817883%
858480807981%
Detailed Writing Rules82.07%
genre
777574737274%
858377767379%
807979767678%
848181797781%
817979787478%
828281777780%
genre78.35%
Novelcrafter Default Prompt
807777777477%
858483807882%
868482818183%
878583818184%
838281808081%
838181818081%
Novelcrafter Default Prompt81.33%
80.58%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
918888878387%
908785838286%
838177776677%
928684848185%
83.77%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100787867676779%
100100100100100100100100%
9999999595959597%
9998989898989798%
1001001009696969698%
10099999999867494%
8381747371716975%
10010010010010010099100%
9695959191898992%
Generic Prompt92.51%
Specific Prompt
10089897867676779%
100100100100100100100100%
9898969695948695%
9594929191919192%
100100100100100100100100%
100100999999959598%
9493917878777784%
1001001009999999999%
10097979595929295%
Specific Prompt93.74%
93.12%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%