NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-04-16-01JRZZ01DY146RFM6XC6A39GJV
Run Details
Model:
o4 Mini High
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
42.8s
Cost:
$0.020164
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
Chilly morning sun pressed against the windows of Daisy’s Delights, wreaths of steam drifting from every loaf. The hush was broken by a soft scuff of boots. “I swear I left them on the top shelf last night.” The scent of cinnamon rolls twisted in the air as another set of footsteps approached. “Top shelf where nobody ever looks.” Fingers brushed a chipped teacup. The bell above the door jingled again. “I needed that recipe for Friday’s fair.” A flicker of panic sharpened the glaze on her cheeks. Flour-dusted hands hovered by the mixing bowls. “Maybe someone borrowed it without asking?” She shook her head. “That recipe is my grandmother’s secret.” A sudden crash echoed from the back room. Both froze. “Probably just a stack of pans.” “A stack that belongs to me.” Dust motes danced in amber light. “If we don’t find it soon...” She offered a scone with trembling fingers. “Comfort carbs?” “Always.” He bit in, sweet crumbs at the corners of his mouth. His eyes were already scanning the corners of the bakery for clues. Her heart pounded beneath an apron spattered with flour and worry. Outside, a delivery van rattled by, faint music drifting through the door.
id
"gen-1744829417-6EfqysuJu9ogoDAhVdcA"
upstream_id
"chatcmpl-BN27txDHwcQzKyzcCXsvcuBIrsog4"
total_cost
0.0201641
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-16T18:51:00.045713+00:00"
4/16/2025, 6:51:00 PM
model
"openai/o4-mini-high-2025-04-16"
app_id
182717
streamed
true
cancelled
false
latency
39787
moderation_latency
112
generation_time
42278
tokens_prompt
68
tokens_completion
4547
native_tokens_prompt
67
native_tokens_completion
4566
native_tokens_reasoning
4288
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0201641
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)