NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-04-16-01JRZZG4JM1DD8P07NC0M3SP3C
Run Details
Model:
o4 Mini High
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
1m 30s
Cost:
$0.032096
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
50.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Night draped the suburban cul-de-sac in blue gloom as Molly and Bill crouched behind a thick hedge. A faint muffled giggle floated from a window where kids had gathered for sugar and laughter. Bill adjusted his black gloves, eyeing a pacifier and a tower of wrapped surprises stacked near the door. Molly smirked and tapped a nearby sprinkler. Tension cracked like frozen candy. Festive balloons bobbed above a picket fence barrier that seemed more festive than formidable. Their hearts pounded like pinatas ready to burst. A stray balloon bobbed gently beyond reach. Somewhere inside a pinwheel twirled against plastic streamers. Molly: You see that glowing cake? We grab it and run when confetti pops. Bill: Confetti? That’s a distraction we’ll use for cover. Timing is key. Molly: Hurry up, butterfingers, the clown is rigging the pinata. Bill: I’ve got grappling hook and tutu. Nothing screams stealth like flair. Molly: Presents glitter under the tablecloth. We need a bigger sack. Bill: Do kids scream or cry if their gifts disappear? Molly: We distract them with balloons filled with confetti and rubber spiders. Bill: Then we vanish into the night with armfuls of wrapping paper. Bill: Where do we hide pinata candy?
id
"gen-1744829944-CQg1z34CezpxNjNStISD"
upstream_id
"chatcmpl-BN2GOUINkb9ECW68nmJXW9ON8Oi97"
total_cost
0.0320958
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-16T19:00:34.822084+00:00"
4/16/2025, 7:00:34 PM
model
"openai/o4-mini-high-2025-04-16"
app_id
182717
streamed
true
cancelled
false
latency
87085
moderation_latency
110
generation_time
89698
tokens_prompt
68
tokens_completion
7259
native_tokens_prompt
66
native_tokens_completion
7278
native_tokens_reasoning
6976
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0320958
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
200
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
202
wordsDialogue
0
50.0000%