NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ81MJW521057NNFMXJZ516
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
8.5s
Cost:
$0.000363
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
60.65%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon and espresso hung thick in the bakery’s golden light. “You sure you didn’t see her come in? Mrs. Peabody was supposed to pick up the blueberry scones an hour ago.” “She called last week, right? Said she’d left a deposit for the scones. Maybe she got stuck at the vet’s? Mr. Whiskers *is* a bit of a worrywart.” “I’m not talking about the cat. I’m talking about her granddaughter. Lila. Didn’t you hear? Her apartment’s been empty for days.” The register chimed. A man in a sheriff’s vest stepped inside, shaking rain from his shoulders. Both of them turned, the woman behind the counter adjusting her apron, the other—taller, in a wool coat—lifting a coffee cup to his lips as if it might steady him. “She’s not here,”
the baker said
, too quickly. The sheriff’s gaze lingered on the pie case. “Strained cranberry filling. She always made that for the fall festival. You’re using her recipe?” “She contributed it, yes. But I’ve got my own twist. Extra clove, maybe. A splash of lemon.” The taller man set the cup down. “That’s not what June Porter said. She said your batches taste… different. Less sweet.” The baker’s eyes flicked to the back door, where a smear of mud clung to the threshold. “June’s got a refined palate.Likes her desserts like her roads—sugared and straight.” A bell jingled as the sheriff reached for a croissant. “You’re not worried she might show up? With a claim that you—what was it?—'misappropriated’ her family’s signature recipe?” The woman slid a key across the counter, her smile tight as a dough twist. “If she does, point her to the journal. Third shelf, left side. We all need a little inspiration.” He caught the key, studied it. “Or maybe just point her to the security cameras. They’re pretty cozy around here… until someone knocks them offline.”
created_at
"2026-03-25T10:18:35.489Z"
3/25/2026, 10:18:35 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
389
moderation_latency
(null)
generation_time
8512
tokens_prompt
98
tokens_completion
1009
native_tokens_prompt
72
native_tokens_completion
854
native_tokens_completion_images
(null)
native_tokens_reasoning
407
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000362752
router
(null)
provider_responses
0
endpoint_id
"344ccdcc-6e59-464d-89ab-8e5b92fe2adc"
id
"chatcmpl-4e2e42de-3f31-9213-ac92-5c5fc9892b57"
is_byok
false
latency
389
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433915-XsBpCuehmldmpJABHynr"
upstream_id
"chatcmpl-4e2e42de-3f31-9213-ac92-5c5fc9892b57"
total_cost
0.000362752
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
60.65%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"the baker said"