openai/gpt-4.1-nano

GPT-4.1 Nano

Release Date

Apr 14th, 2025

Parameters

Context Size

1m

Creative writing

30.15%

Rule following

64.20%

Utility

60.50%

Mathematics

100.00%

Tooling

47.69%

Language

77.34%

Logic

86.88%

Dialogue tags

Various tasks related to dialogue tags in text.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Creative writingRule following
100%100%100%100%100%61%14%14%14%1%60%
0-shot Creative writingRule following
50%50%49%49%48%45%43%34%26%22%42%
0-shot Creative writingRule following
50%50%50%50%49%49%47%45%30%1%42%
0-shot Creative writingRule following
100%99%96%92%66%63%41%18%14%2%59%
0-shot Creative writingRule following
0%0%0%0%0%0%0%0%0%0%0%
0-shot Creative writingRule following
11%0%0%0%0%0%0%0%0%0%1%
0-shot Creative writingRule following
24%15%14%13%1%1%0%0%0%0%7%
30.15%

Language Comprehension

Does the model understand more than just English?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%100%100%100%100%100%
0-shot Language
100%0%0%0%0%20%
0-shot Language
100%100%0%0%0%40%
65.00%

Language Writing

Can the model generate text in different languages?

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Total
0-shot Language
100%100%100%100%86%97%
0-shot Language
100%100%90%88%64%88%
0-shot Language
100%100%91%67%50%82%
0-shot Language
100%89%86%71%50%79%
0-shot Language
100%100%100%100%50%90%
87.22%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot ToolingUtility
100%100%100%100%100%100%100%100%67%33%90%
90.00%

N-Length Sentences

Write sentences with exactly N words

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Rule following
100%100%100%100%100%98%97%96%91%89%97%
0-shot Rule following
83%83%82%80%79%77%72%68%66%55%74%
0-shot Rule following
76%76%76%75%64%59%59%57%30%26%60%
77.14%

Voice/dialogue sheets

Extract dialogue from given text as voice sheets.

Scenario Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Run 8 Run 9 Run 10 Total
0-shot Utility
0%0%0%0%0%0%0%0%0%0%0%
1-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
Few-shot Utility
100%100%100%100%100%100%100%100%100%100%100%
0-shot Utility
0%0%0%0%0%0%0%0%0%0%0%
0-shot Utility
0%0%0%0%0%0%0%0%0%0%0%
40.00%