NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBK97EF4SFDQ8XME71JTKW3
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
52.1s
Cost:
$0.017743
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
Warmth radiated from the ovens, mixing with the heavy scent of powdered sugar and damp wool. One figure wiped the counter, dusting away crumbs. The other leaned against the glass display. Outside, the street was silent. The bell chimed softly, but no one entered. "The blueberry scones were gone again. I checked the display case at six this morning." "They sell out quickly. It seems a popular morning." "Not that quickly. Not enough people in town to justify a full dozen vanishing overnight." "Don’t be absurd. I locked the pantry myself." "You always lock the pantry. Something tells me you missed a latch." "I don’t miss latches. I watch the front door. I watch the back." "Then why hasn’t anyone noticed?" "Maybe they noticed. Maybe they’re too polite to say anything." "I’ll start baking earlier. I’ll leave the door cracked open." "You can’t leave the door open with the smell of cinnamon. People know quietly." "Then I suggest you check the cellar." "I never leave the cellar unlocked." "That is exactly." "You never leave the cellar unlocked. But someone else might be hiding in the flour sacks." "If you find a missing person, don’t tell the sheriff." "I wouldn’t dare. I keep secrets just like I keep receipts, right now."
created_at
"2026-02-25T23:48:53.848Z"
2/25/2026, 11:48:53 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
447
moderation_latency
(null)
generation_time
52002
tokens_prompt
81
tokens_completion
8452
native_tokens_prompt
78
native_tokens_completion
8862
native_tokens_completion_images
(null)
native_tokens_reasoning
8571
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0177435
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-6e3e6735-dbae-9a8d-bc39-cc8691879437"
is_byok
false
latency
447
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063333-p6JmmbwWkOP1fRysX23W"
upstream_id
"chatcmpl-6e3e6735-dbae-9a8d-bc39-cc8691879437"
total_cost
0.0177435
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)