NC Bench
Overview
About
Tests
Models
Model
o4 Mini
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-16-01JRZZ8FEJ5WTK2MB8N5P3665A
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peers around the corner of the living room, Bill double-checks the blueprints drawn on a napkin. Molly: "Are you sure the clown distraction works?" Bill: "Absolutely. I hired Joey the Joker to juggle under-flavored cupcakes. Kids will flock." Molly: "And the sugar comas will clear the room?" Bill: "Precisely. Which gives us ten minutes to nab every wrapped surprise and slip out." Molly: "What about the one with unicorn wrapping?" Bill: "Highest priority. It's sparkling." Molly: "Alright. We split: I grab the central table, you cover the side hamper." Bill: "Got it. And the exit route?" Molly: "Through the doggy door. Tiny enough for crates." Bill: "Will Santa’s prop interfere?" Molly: "If he complains, tell him his gig’s up." Bill: "Smart. Pajama party upstairs?" Molly: "Perfect cover. We’ll blend in after looting." Bill: "I’ll need a mini forklift." Molly: "I brought duct tape and a squeaky rubber chicken." Bill: "Essential tools. Let’s stealth." Molly: "Sneak in. Steal gifts. Sneak out." Bill: "Mischief managed." Molly: "Don't forget the glitter bombs I stashed by the piñata." Bill: "Good call. A shimmer diversion at midnight." Molly: "And the confetti cannons?" Bill: "Armed. We go loud or go home." Molly: "Bartender is a fake Santa, right?" Bill: "Sure, he owes me twenty bucks." Molly: "Kids will think Santa owes them extra toys." Bill: "Perfect alibi for empty boxes."
id
"gen-1744829693-Re3atOfiBzbZ0laa4ues"
upstream_id
"chatcmpl-BN2CLfAxkM3z4WXSE1pHRSyYnVltO"
total_cost
0.0145618
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-16T18:55:17.432936+00:00"
Wednesday, April 16th, 2025 at 6:55:17 PM GMT+00:00
model
"openai/o4-mini-2025-04-16"
app_id
182717
streamed
true
cancelled
false
latency
20552
moderation_latency
112
generation_time
23324
tokens_prompt
68
tokens_completion
3276
native_tokens_prompt
66
native_tokens_completion
3293
native_tokens_reasoning
2880
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0145618
Evaluation details
Result
Evaluator
Details
Meta Data
0.4920%
Matches word count
n/a
neededClean
false
words
227
46.0611%
Dialogue to Total Word Ratio
Ratio: 80.62%, Deviation: 9.38%
neededClean
false
wordsTotal
227
wordsDialogue
183
23.2765%