NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ7008Z0ENK97HGJR68GM3J6
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
39.9s
Cost:
$0.006283
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of burned sugar clung to the air, warring with the sweet yeast rising from the ovens. Dust motes danced in the shaft of light cutting across the checkered floor tiles as the bell above the door chimed, though no one entered. A tray of cooling scones ticked softly behind the glass case. "You saw him leave the alley behind the post office?" "Back of his truck, covered by a tarp that looked suspiciously like Mrs. Gable's quilt. I swear I caught a glimpse of the pink polka dots." "That quilt hasn't moved from her sewing room since '98. If Harold has it, he wasn't hauling firewood." "No, he was hauling something all right. Heavy enough to make the suspension groan, and he was sweating through his wool sweater." "Wool? In July?" "Nervous sweat, darling. Or guilt. Either way, I'm not buying his honey-dew scones until we know where those polka dots ended up." "Check the shed behind the hardware store. The lock was cut this morning." "Right. Keep one eye on Harold when he comes in for his sourdough. If he drops a crumb, I'll know he's lying."
created_at
"2026-04-27T08:13:28.934Z"
4/27/2026, 8:13:28 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
742
moderation_latency
(null)
generation_time
39785
tokens_prompt
81
tokens_completion
2610
native_tokens_prompt
78
native_tokens_completion
2605
native_tokens_completion_images
(null)
native_tokens_reasoning
2338
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0062832
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-280ff9af-800f-9f47-98a4-3db1bd4154db"
is_byok
false
latency
742
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777277608-HOmAVkKqHIIKKS43qiXD"
session_id
(null)
api_type
"completions"
id
"gen-1777277608-YkLyt8dJWjwJBdswe3FR"
upstream_id
"chatcmpl-280ff9af-800f-9f47-98a4-3db1bd4154db"
total_cost
0.0062832
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)