NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8S2R4W5KEBSKZQ57YZXJZ
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
15.9s
Cost:
$0.000240
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
13.53%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimmed as the afternoon sun spilled honey over countertops. "Didn’t expect to see you here,"
someone said
, sliding into a corner booth. A tray of scones sat between them, still steaming. "Couldn’t stay away from your famous lemon tarts,"
another voice replied
, though their gaze lingered on the sheriff’s badge glinting on the first speaker’s belt. "Or your trouble." A laugh, soft as meringue. "You think I’d risk baking disasters after what happened at the fair?" "Three burnt pies and a fire alarm later, yeah. But here we are." The badge-wearer tapped a finger against the ceramic mug. "Still, someone’s got to keep the town entertained." The baker stirred a dollop of cream into their coffee, avoiding the stare. "You here about the missing strawberries?" "The ones deliveries keep vanishing? Could be a coyote." A pause. "Or someone with a sweet tooth." "Please. My coyotes have better manners. They’d nap by the compost pile, not steal from the shelves." A hand gestured to the empty display case. "Whoever’s doing this knows the schedule. Knows the back door." Another silence, broken by the clink of sugar packets. "Fingerprints?" "Barely any flour left to track." The baker’s smile was tight. "But there’s a note. Hidden in the pastry drawer. You want it, you’ll need to check the blueberry cobbler." The badge clinked as it was picked up. "You’re sure about this?" "Positive. It’s got your name on it." A shrug. "Literally." The wind rustled the lace curtains. Somewhere, a cake pan rattled in the oven.
created_at
"2026-03-25T10:31:23.657Z"
3/25/2026, 10:31:23 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
127
moderation_latency
(null)
generation_time
15838
tokens_prompt
98
tokens_completion
945
native_tokens_prompt
73
native_tokens_completion
836
native_tokens_completion_images
(null)
native_tokens_reasoning
536
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00023992
router
(null)
provider_responses
0
endpoint_id
"6b8c829d-3094-45e7-8139-0a67e09060c3"
id
"cmpl-5a776a1b86cc4fb2a566b540f9c8e8bb"
is_byok
false
latency
63
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434683-NwvHP5WgUwzdOCervUNC"
upstream_id
"cmpl-5a776a1b86cc4fb2a566b540f9c8e8bb"
total_cost
0.00023992
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
13.53%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"someone said"
1
"another voice replied"