NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2YWW2ASAPTYSWDVE5CQJ6
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
3m 23s
Cost:
$0.026928
Provider:
Alibaba
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
Scent of burned sugar and vanilla hung heavy in the air, masking something much sharper beneath. Flour dusted the counter like snow. "Did you see him Tuesday night?" "Just another customer. Ordered a scone, paid cash, and left." "He didn't leave. Not really." The piping bag paused, cream freezing atop the delicate sponge. "What's that supposed to mean?" "You know. The police missed the back door." A metal tray clattered loudly onto the steel table. "This is a bakery, not a gossip mill. Pay or go." "Everyone knows you were closing up when the lights went out on Main Street." "Everyone talks too much." "Then why hide the ledger under the register?" Silence stretched between them, thick as rising dough. The oven timer dinged sharply. "That's not for sale." "Neither is the truth." Floured hands wiped slowly on an apron stained with dark berry juice. "Come back tomorrow morning. When the rolls are fresh." "I'll be here. Before you open." "Don't bother. The lock's changed." "Keys are easy to copy." "So are lies." The bell above the door jingled as the figure stepped into the rain, leaving the warmth behind for the cold.
created_at
"2026-02-16T11:25:15.743859+00:00"
2/16/2026, 11:25:15 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
847
moderation_latency
(null)
generation_time
203189
tokens_prompt
81
tokens_completion
5988
native_tokens_prompt
78
native_tokens_completion
7467
native_tokens_completion_images
(null)
native_tokens_reasoning
7188
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.026928
router
(null)
provider_responses
0
id
"chatcmpl-af2bfd65-b2cd-9013-838f-399c912bda75"
status
200
is_byok
false
latency
847
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240911-kOoAu8ZoEvLrGwypJCtj"
upstream_id
"chatcmpl-af2bfd65-b2cd-9013-838f-399c912bda75"
total_cost
0.026928
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)