NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1WXEW3R81FG81RKY7BJDR
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
12.1s
Cost:
$0.000733
Provider:
Alibaba
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of warm cinnamon rolls hung heavy in the air, masking the tension between the two figures at the corner table. "You really think adding extra cardamom was a good idea? Mrs. Gable is particular about her morning bun." "She said she wanted something new. Besides, the spice covers the bitterness of the burnt batch from yesterday." "Burnt? I thought we tossed those." "We did. Most of them. But a few might have slipped into the display case before I noticed." A nervous laugh escaped, quickly stifled by a sip of black coffee. "If she finds out, she'll cancel the order for the town festival. We can't afford that." "Then we just don't tell her. Smile, nod, and hope she thinks it's a bold new recipe." "It's not just the recipe. Did you see who was lurking behind the shop this morning? That stranger with the notebook." "Probably just a food critic or a tourist. Small towns attract them like flies to honey." "He wasn't taking pictures of the pastries. He was watching the back door. Watching us." "Stop being paranoid. Finish your coffee. The lunch rush starts in ten minutes, and we still need to frost the scones." "If he comes back, what do we say?" "We say nothing. We bake. And we pray Mrs. Gable loves cardamom."
created_at
"2026-02-16T11:03:30.927829+00:00"
2/16/2026, 11:03:30 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
981
moderation_latency
(null)
generation_time
12016
tokens_prompt
81
tokens_completion
326
native_tokens_prompt
80
native_tokens_completion
292
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007328
router
(null)
provider_responses
0
id
"chatcmpl-3571e1b1-354b-9263-9f56-6d30df8b02ab"
status
200
is_byok
false
latency
981
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239798-CEe3obK4z7GRcO60dxG3"
upstream_id
"chatcmpl-3571e1b1-354b-9263-9f56-6d30df8b02ab"
total_cost
0.0007328
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)