NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBH67S12CKDQ19Q0P03QYDD
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
33.1s
Cost:
$0.013804
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell jingled, cutting through the scent of cinnamon and yeast. Flour dusted the counter. "You didn't sleep again, did you?" "Someone has to watch the ovens." "It's three in the morning. Even the rats are asleep." "Not the ones in the alley. I heard the dumpster lid slam." "Probably just the wind." "Wind doesn't wear boots, Martha." A pause. The rolling pin thumped rhythmically against dough. "You think it was him?" "I think the mayor's prize poodle didn't vanish into thin air." "Oh, poor Barnaby." "Barnaby is fine. It's the man who took him that worries me. He asked about the safe deposit boxes." "Here?" "At the bank. But he was looking at *this* shop." "Why?" "Because I hid the key inside the wedding cake tin." "You what?" "It's the only place no one looks." "That tin is full of sourdough starters." "Exactly." "You're mad." "I'm prepared." The bell jingled again. Heavy footsteps approached the counter. "Two scones, please. And keep the change." "Coming right up." "Don't forget the jam." "Strawberry or blackberry?" "The one with the black seeds. The dark one." "Right away."
created_at
"2026-02-25T23:12:18.728Z"
2/25/2026, 11:12:18 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
423
moderation_latency
(null)
generation_time
33038
tokens_prompt
81
tokens_completion
3754
native_tokens_prompt
78
native_tokens_completion
4304
native_tokens_completion_images
(null)
native_tokens_reasoning
4002
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.013804
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-8412e46f-4388-9dca-8d65-a34e95807f86"
is_byok
false
latency
423
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772061138-nN1rVZjseNq7OfztqcM9"
upstream_id
"chatcmpl-8412e46f-4388-9dca-8d65-a34e95807f86"
total_cost
0.013804
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)