NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Mini
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-14-01JRTSV3AV4AFQZ96QW7N9E9QS
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Alright, Molly, let me get this straight—our grand heist is… a kids’ birthday party?” “Precisely, Bill. The birthday bash of the century. Sugar-high chaos, distracted parents, and mountain-loads of presents.” “You do realize the only thing I’ve ever stolen was Grandma’s fruitcake, right? And even that left me with a papercut.” “Relax. This isn’t about finesse. It’s about speed and blending in. Think about it—balloons, cake, noise, sticky fingers everywhere.” “Sticky fingers? That sounds like a warning, not an advantage.” “Bill, think bigger! While everyone’s busy singing ‘Happy Birthday,’ we’ll swipe every toy, game, and action figure faster than you can say ‘piñata.’” “But how do we get in? Last time I tried sneaking into a party, I tripped over my own foot and face-planted into Aunt Linda’s fruit punch.” “Simple. We dress as clowns. No one questions clowns at kids’ parties.” “Clowns? Molly, last time you were a clown, you scared the neighbor’s kid for three days.” “Details, Bill. Details. This time, I’m the sweet, balloon-twisting kind. You’re the silent, mysterious mime.” “A mime? I barely know how to talk, now you want me to not talk at all? That’s sabotage.” “Exactly! You’re perfect. Now, tie your shoes tighter. We’ve got presents to pilfer and a party to crash.”
id
"gen-1744656240-4Xa5WxCkkTER4sF9aXZ1"
upstream_id
"chatcmpl-BMJ4iURz7eFxGYLouCqmxE0kdClse"
total_cost
0.0005388
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:44:03.681935+00:00"
Monday, April 14th, 2025 at 6:44:03 PM GMT+00:00
model
"openai/gpt-4.1-mini-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
421
moderation_latency
117
generation_time
1674
tokens_prompt
68
tokens_completion
319
native_tokens_prompt
67
native_tokens_completion
320
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0005388
Evaluation details
Result
Evaluator
Details
Meta Data
51.9255%
Matches word count
n/a
neededClean
false
words
216
36.7879%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
216
wordsDialogue
216
44.3567%