NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBDD2TSJBH7MA3M0SCARBQE
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
39.5s
Cost:
$0.012087
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon rolls hung heavy in the air, masking the dust motes dancing in the morning light. A scarred wooden counter separated the two, stained from decades of flour and spilled cream. A fine layer of white dust coated the table. "I heard a rumor about the supply shipment." "Rumors are cheap here, dear. Like the day-old scones." "Not cheap enough to disappear entirely. The flour bags were marked, but they came in full." "Business is good. Too good." "That's what concerns me. People don't just bake extra. They hide something." The knife stopped mid-slice on the cold marble slab. "Are you implying I'm cutting corners?" "I'm implying the delivery truck didn't turn off the main street. It took the back way." "Back gate has been locked for years. Rusty shut the mechanism himself." "Did he?" "Who else would know about the lock?" Silence stretched, thick as the icing on the wedding cakes stacked in the display case. The bell above the door chimed, announcing a new customer, but neither moved to greet them. "Maybe I should check the ledger." "Best not to look where the shadows are." "Shadows don't hide bodies, do they?" "Just secrets. Mostly." "Until they aren't."
created_at
"2026-02-25T22:06:09.032Z"
2/25/2026, 10:06:09 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
392
moderation_latency
(null)
generation_time
39110
tokens_prompt
81
tokens_completion
4480
native_tokens_prompt
78
native_tokens_completion
6034
native_tokens_completion_images
(null)
native_tokens_reasoning
5745
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0120875
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-47758b55-68b5-996f-814b-ba6ba67fcca0"
is_byok
false
latency
392
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057169-zdMZprmW1GuJd5OK8GNm"
upstream_id
"chatcmpl-47758b55-68b5-996f-814b-ba6ba67fcca0"
total_cost
0.0120875
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)