NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ70DVSP3Z6WAZT5KGVE7JAA
Run Details
Model:
Qwen 3.5 Plus (2026-04-20)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
48.8s
Cost:
$0.007738
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
"Don't look at the door, but the mayor's wife walked out without paying for the sourdough." "She always pays. She's just distracted since the incident at the pier. Hand me the piping bag." "Incident? The police taped off a body, Martha. I saw the tarps, and I saw who stood near them when the sheriff arrived. Muddy boots matching the lighthouse path." "Keep your voice down. The flour dust catches secrets, and we need the Sunday reception contract. You can't go around accusing the postmaster." "I'm worried. Who sends oilcloth-wrapped packages to a bakery? This one smelled like bitter almonds. The wax seal was warm." "Cyanide? You're reading too many of those pamphlets. Probably just Mr. Henderson's extract." "Mr. Henderson died three years ago. The handwriting on the label matched the warning note taped to my oven this morning. Fresh ink. Someone is mixing a dangerous batch, and I think the first ingredient just walked in." "Glaze the crullers. Steady hands. If you shake, you'll ruin the batch. We have customers." "Right. But watch the bell. If I swallow instead of greeting, take the cash drawer and run out the back." "Fine. But cover the starter. I won't leave my life's work behind."
created_at
"2026-04-27T08:20:54.205Z"
4/27/2026, 8:20:54 AM
model
"qwen/qwen3.5-plus-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
947
moderation_latency
(null)
generation_time
48753
tokens_prompt
81
tokens_completion
3315
native_tokens_prompt
78
native_tokens_completion
3211
native_tokens_completion_images
(null)
native_tokens_reasoning
2921
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0077376
router
(null)
provider_responses
0
endpoint_id
"8b4a0fbb-6705-4f8d-a7f1-ad18ecc0edbc"
id
"chatcmpl-31968bd2-f4cf-95df-927a-98755a156291"
is_byok
false
latency
840
model_permaslug
"qwen/qwen3.5-plus-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777278054-EgY4adnAXRK8bEdPaqxs"
session_id
(null)
api_type
"completions"
id
"gen-1777278054-DnaXeOAyaUVKSisltEQN"
upstream_id
"chatcmpl-31968bd2-f4cf-95df-927a-98755a156291"
total_cost
0.0077376
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)