NC Bench
Overview
About
Tests
Models
Model
o4 Mini
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-04-16-01JRZZ4VQ3WBKR91EPGY8DQ7JZ
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
A narrow alley behind the birthday hall. Molly and Bill crouch in shadows, grinning. "Molly: Our target is the kiddie party at Sunnyvale Community Center." "Bill: Do they have security, or just sugar-fueled toddlers?" "Molly: Tune the rattle lasers, deploy the cake distraction timer." "Bill: Rattle lasers? You mean squeaky toys wired to confetti cannons?" "Molly: Exactly. When the piƱata explodes, our super-sack scoops gifts." "Bill: We also need stealth muffins to bribe cookie patrol." "Molly: Cookie patrol is equipped with high-pitched whistles. Mind that." "Bill: Understood. I've mastered toddler lingo. Goo goo ga ga?" "Molly: Perfect. In and out before 'Happy Birthday' hits second verse." "Bill: What about Maria's unicorn cake? That slice is prime intel." "Molly: That's our exit signal. Slice up, drop micro gifts, vanish." "Bill: I hate glitter. Do we have vacuum gloves?" "Molly: Vacuum gloves engage. No sparkle left in toddler trajectory." "Bill: Time check. Three minutes to party crash countdown." "Molly: Go, go! Remember: tiny footprints, maximum stealth, massive haul." "Bill: Mission ridiculous. Operation Present Storm is a go!" They melt into the night.
id
"gen-1744829575-SJoPnstfb7nhO3t08jxo"
upstream_id
"chatcmpl-BN2ARiXiOAts0A0Eh0VbLygmksot1"
total_cost
0.0256102
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-16T18:53:49.762665+00:00"
Wednesday, April 16th, 2025 at 6:53:49 PM GMT+00:00
model
"openai/o4-mini-2025-04-16"
app_id
182717
streamed
true
cancelled
false
latency
51798
moderation_latency
91
generation_time
54311
tokens_prompt
68
tokens_completion
5787
native_tokens_prompt
66
native_tokens_completion
5804
native_tokens_reasoning
5504
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0256102
Evaluation details
Result
Evaluator
Details
Meta Data
43.3783%
Matches word count
n/a
neededClean
false
words
183
99.9998%
Dialogue to Total Word Ratio
Ratio: 89.62%, Deviation: 0.38%
neededClean
false
wordsTotal
183
wordsDialogue
164
71.6891%