openai/gpt-5-nano

GPT-5 Nano

Release Date

Aug 7th, 2025

Context Size

400k

Reasoning

Yes

Benchmark Cost

$2.06

Speed

98.7 tok/s

Categories

20%40%60%80%100%Creative Writing67.0%Tooling99.2%Language77.2%Utility93.9%Reasoning89.6%Text Editing82.7%Rule Following57.6%Hallucination93.5%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
696565646365%
726766656266%
777170696771%
686666666566%
787371696772%
707070686769%
Detailed Writing Rules68.31%
genre
716563635964%
726666656066%
706766656366%
716766656467%
726866656467%
686868666166%
genre66.03%
Novelcrafter Default Prompt
646464625962%
646262616062%
838179716676%
747069676469%
787269686871%
777471706772%
Novelcrafter Default Prompt68.67%
67.67%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
959493878590%
969695949495%
939291878790%
959289878489%
91.23%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9695939090908992%
100100100100999999100%
100100100100100100100100%
100100999775757389%
7776757166636270%
100100100100100999699%
9389898989858288%
Generic Prompt93.04%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
10099999998979698%
100100100100999998100%
100100100100100100100100%
10099999595959597%
8989838280726780%
100100100100100989899%
10010010010096969698%
Specific Prompt97.01%
95.03%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%