NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Nano
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2025-04-14-01JRTST5AK30HQZFP5S88CRWBP
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly grinned, twirling a pencil in her fingers. “Bill, this is perfect. A kids' party, all those shiny presents — they won’t know what hit ’em.” Bill squinted at the map drawn on a napkin. “You sure about this, Molly? It’s risky. Kids are loud, slippery floors, and their parents watch like hawks.” She chuckled. “That’s what makes it fun. Besides, look at the setup — outside the backyard, slide plus bounce house. We slip in, grab the loot, slip out.” He crossed his arms skeptically. “And all the presents lined up inside? No alarms?” Molly waved her hand dismissively. “Please. It’s a kid’s party. Who’s gonna suspect two sneaky adults? And I’ve got a fake invitation. ‘Kid’s’ name, of course.” Bill looked nervous. “But what if we get caught? Maybe just a little peek — see what’s inside?” Molly rolled her eyes. “No time for peeks, Bill. We’re taking everything tonight. These kids can’t be happy with just cake and balloons — they need our gifts!” Bill nodded reluctantly. “Alright, let’s do it. But if I get caught—” “You’ll blame me,” Molly interrupted, grinning. “And I’ll tell them it was all your idea.”
id
"gen-1744656209-wUQKaGuBvHwSoEQn4bXz"
upstream_id
"chatcmpl-BMJ4DsAkTSdueZzWxXF4oLHKDeaRF"
total_cost
0.0001203
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:43:31.595161+00:00"
Monday, April 14th, 2025 at 6:43:31 PM GMT+00:00
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
248
moderation_latency
143
generation_time
718
tokens_prompt
68
tokens_completion
283
native_tokens_prompt
67
native_tokens_completion
284
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001203
Evaluation details
Result
Evaluator
Details
Meta Data
90.4837%
Matches word count
n/a
neededClean
false
words
190
0.0000%
Dialogue to Total Word Ratio
Ratio: 77.89%, Deviation: 27.89%
neededClean
false
wordsTotal
190
wordsDialogue
148
45.2419%