openai/gpt-4.1-nano

GPT-4.1 Nano

Release Date

Apr 14th, 2025

Context Size

1m

Reasoning

No

Benchmark Cost

$0.25

Speed

202.2 tok/s

Categories

20%40%60%80%100%Creative Writing71.8%Tooling81.4%Language79.0%Utility68.4%Reasoning70.2%Text Editing76.1%Rule Following40.9%Hallucination87.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
756868676368%
797373696772%
777371706972%
757474716672%
747070676569%
828178777378%
Detailed Writing Rules72.02%
genre
706766656166%
757271686771%
717067676468%
757474716872%
706966666467%
878176706876%
genre69.96%
Novelcrafter Default Prompt
767474726171%
827474746774%
827473726974%
817776757477%
767674736873%
828180787479%
Novelcrafter Default Prompt74.79%
72.26%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
797676757276%
838079797980%
747471704968%
908177726978%
75.33%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
9794928381815884%
100100100100100100100100%
9998989898979698%
9898989797979797%
100100100100100968898%
6351515151515052%
8481787366646072%
10010010010097969598%
9796969391898693%
Generic Prompt87.96%
Specific Prompt
8978786767676773%
100100100100100100100100%
9693868685858588%
9999999998989899%
9696969696969696%
10010010010095957495%
8480767675727276%
10010010010010010097100%
9393898989868389%
Specific Prompt90.62%
89.29%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100673390%
90.00%