NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBEW45SM0J48DB4Y1R5N9Q6
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1m 27s
Cost:
$0.025968
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell jingled, cutting through the scent of yeast. Dust danced in the morning light. "You’re late. Sheriff Miller said he’d be here by nine." "Traffic on the bridge was a nightmare. And I had to drop something off at the post office. You know how he gets." Boots hit the floor hard. "Did you hear about the Miller’s old safe?" A pause followed, the hum of the oven filling the silence. "That thing hasn’t been touched since the fifties. It’s full of junk." "Is it?" A napkin was picked at idly. "Funny how some things keep showing up in the same spot. Old letters. Bank books. But never the key." "You’re looking for trouble again." Black coffee was poured. "That safe is better left buried." "Boring doesn’t keep people alive." A sip was taken, eyes narrowing at the clock. "Someone’s watching the shop. Just like they watched the Miller’s last." "Then I should lock the back door." Dough stirring stopped. "You should be more careful. Secrets smell sweeter than croissants if you know where to look." Silence stretched slowly, broken by the hum of the refrigerator. "Coffee’s on the house. Just don’t tell anyone what you heard." The bell jingled again.
created_at
"2026-02-25T22:31:50.21Z"
2/25/2026, 10:31:50 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
404
moderation_latency
(null)
generation_time
86317
tokens_prompt
81
tokens_completion
9235
native_tokens_prompt
78
native_tokens_completion
12974
native_tokens_completion_images
(null)
native_tokens_reasoning
12688
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0259675
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-c35a586e-1555-91af-a0d5-95c7f7c9806a"
is_byok
false
latency
404
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058710-yuT0sMJTUzxOKaFlq4wZ"
upstream_id
"chatcmpl-c35a586e-1555-91af-a0d5-95c7f7c9806a"
total_cost
0.0259675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)