NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ79951P7MR28JHPPPS3YM17
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
34.1s
Cost:
$0.010466
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the bakery door chimed, cutting through the scent of yeast and cinnamon. Warmth radiated from the brick ovens, frosting the windows with condensation. "Sourdough needs more than flour, Martha. It needs secrets." "Don't start with that talk. The Association has eyes everywhere." "Eyes don't see inside the glaze. The Mayor visited this morning. He left his cane on the stool and looked like he'd seen a ghost. His napkin was torn and stained blue." "Pale from the commute. He's auditing the charity fund." "Finance or conscience? I found a silver locket in the mill dumpster. It matches your daughter's brooch from the night she vanished. You should hide the tongs." "Stop it. That locket belongs to the antique shop owner. Everyone knows it." "Everyone knows what they're told. And the Mayor's scone tasted like bitter almonds." "Almond extract. We're out of vanilla and the delivery isn't here yet. The baker swore it was safe, but the Mayor clutched his throat." "Vanilla doesn't leave a metallic tang, Mrs. Gable. Or stain fabric blue. The cloth slid across the scarred wood. I'm calling Detective Miller before the lunch rush ruins the evidence in the croissants."
created_at
"2026-04-27T10:55:37.022Z"
4/27/2026, 10:55:37 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
661
moderation_latency
(null)
generation_time
33941
tokens_prompt
81
tokens_completion
6678
native_tokens_prompt
78
native_tokens_completion
6964
native_tokens_completion_images
(null)
native_tokens_reasoning
6681
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0104655
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-5116beb4-d216-9bd1-b337-f5912bfb7983"
is_byok
false
latency
661
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777287337-x5anFfY4M4TxgzCXnB5S"
session_id
(null)
api_type
"completions"
id
"gen-1777287337-YfuLuHMFMOqxwVfx0ELg"
upstream_id
"chatcmpl-5116beb4-d216-9bd1-b337-f5912bfb7983"
total_cost
0.0104655
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)