NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7Y7A5B877J6PAK77B03FE
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
36.7s
Cost:
$0.000514
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon and toasted almonds hung heavy in the air as the bell above the door jingled. “You’ve used the last of the vanilla extract again, haven’t you?” A spatula tapped against a mixing bowl. “The last bottle vanished mid-morning. I checked under the counter, behind the sugar—” “It’s not like Mrs. Peabody to forget her own stock. Did you ask the new delivery guy about it?” “Harold? He smirked and said the route was ‘optimized.’ As if that explains why my favorite vanilla disappeared.” The other paused, stirring a latte foam until it formed peaks. “You think someone’s trying to throw off the competition for the fair?” “ Someone’s trying to sabotage me. The blue ribbon for Apple Stack Cake? They’ve got a grudge. Or a sweet tooth they can’t hide.” A tray of scones rattled on the counter. “You’re sure it’s not just misplaced? The other day I found a jar of pickled herring tucked behind the loaves of rye—” “Sue, I make *pastries*. Herring’s a different kind of mystery.” “True. But this? I saw the new family from the city picking over your cinnamon bins last week. The daughter had that look—like she was plotting something with a measuring cup.” The voice with the latte leaned closer. “You think they’re drugging the desserts?” “Or replacing them. Imagine: my famous recipe, ruined by outsiders who think ‘rustic’ means ‘disorganized.’” A faint clink of milk bottles outside. “Let’s not jump to conclusions. Maybe Harold just forgot to restock. He did drop off a mystery box at the back door yesterday.” “You’re not the only one with questions, are you?” The bell jingled again as footsteps retreated. “Keep an eye on the vanilla. And the door. Some secrets have a way of walking in on their own.”
created_at
"2026-03-25T10:16:43.594Z"
3/25/2026, 10:16:43 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5163
moderation_latency
(null)
generation_time
36678
tokens_prompt
98
tokens_completion
1081
native_tokens_prompt
72
native_tokens_completion
884
native_tokens_completion_images
(null)
native_tokens_reasoning
462
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00051396
router
(null)
provider_responses
0
endpoint_id
"6f9e09e5-e836-4de0-a27d-27c74b866001"
id
"019d247f23a0a6e3b20d11fb11bbb9f1"
is_byok
false
latency
5114
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433803-yd2xHAfgiXzeumCiCgGa"
upstream_id
"019d247f23a0a6e3b20d11fb11bbb9f1"
total_cost
0.00051396
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)