NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Nano
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-14-01JRTSWVJ0KRBRZ0VM0HRQEDZP
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly: "Alright, Bill, you sure about this? Stealing from a bunch of kids’ birthday presents? Seems kinda...mean." Bill: "Come on, Molly! Think about the loot. Bicycles, dolls, game consoles—this party's got it all. It’s like a treasure chest." Molly: "Yeah, but what if we get caught? Mrs. Baker’s sharp, I’ve seen her catch kids for less." Bill: "That’s why I’ve got the perfect plan. You distract her at the door with some lame balloon animal, I sneak in through the backyard." Molly: "Balloon animal? You want me to whip up a clown decoration? I’m terrible at that." Bill: "Just fake it. 'Hey, look at this ridiculous dog I made!'" Molly: "And then I throw her off while you bag all the presents?" Bill: "Exactly. You’re the distraction, I’m the quarterback—the sneaky one." Molly: "What about the security cameras? Won’t they catch us in action?" Bill: "I’ve got a shiny mirror we can angle to reflect the camera’s view. Works every time." Molly: "And what do we do with all these presents afterward? We can't just walk out with a mountain of toys." Bill: "Dump them in your car, and then—bam! Instant fortune." Molly: "Alright, alright. Let’s do this. But if we get caught, I’m blaming you." Bill: "Deal. Now, let’s make this the greatest heist ever."
id
"gen-1744656298-6iDzAKOPZKYht7Jy4CrA"
upstream_id
"chatcmpl-BMJ5et1GuVTtBo4lyolosBPZhagKP"
total_cost
0.0001387
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:45:03.059095+00:00"
Monday, April 14th, 2025 at 6:45:03 PM GMT+00:00
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
300
moderation_latency
149
generation_time
3828
tokens_prompt
68
tokens_completion
330
native_tokens_prompt
67
native_tokens_completion
330
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001387
Evaluation details
Result
Evaluator
Details
Meta Data
27.1658%
Matches word count
n/a
neededClean
false
words
219
98.3210%
Dialogue to Total Word Ratio
Ratio: 93.61%, Deviation: 3.61%
neededClean
false
wordsTotal
219
wordsDialogue
205
62.7434%