NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJ6NSPZE8KZQG7467E1RV6
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
42.4s
Cost:
$0.014637
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell chimed as the door swung shut, cutting off the rush of cold wind. Inside, the air was thick with cinnamon and yeast, a warm, sweet blanket against the chill. "Morning rush was quieter than usual today." "Too quiet." The woman behind the counter didn't turn from the kneading dough. "People are still spooked." "About the mayor?" "About everything." Flour dusted her apron like fresh snow. She wiped her hands on a faded blue towel. "You wouldn't happen to know why the police are asking about the delivery from yesterday." "Just a baguette shipment, isn't it?" "A baguette doesn't need a coroner’s note attached to the receipt." He leaned against the counter, eyes scanning the glass display case. "Maybe it was the cream puffs. Sweet, but hiding something rotten inside." "Nothing sweet about this town anymore." She picked up a metal tray, checking the crust. "Besides, the only thing going missing is your appetite for the truth." "I’m just here for coffee." "You came in after three a.m., Arthur." "I was checking the ovens." "Ovens simply don't run at three in the morning." "And they don't hide bodies in the dough." "Then maybe you better start checking the flour bags." "I am." "Good."
created_at
"2026-02-25T23:30:01.66Z"
2/25/2026, 11:30:01 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
448
moderation_latency
(null)
generation_time
42269
tokens_prompt
81
tokens_completion
5220
native_tokens_prompt
78
native_tokens_completion
7309
native_tokens_completion_images
(null)
native_tokens_reasoning
7012
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0146375
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-6203c659-a419-92cb-a29e-3011cc6020d9"
is_byok
false
latency
448
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772062201-o2JOj9GpCRr6MccV6tAw"
upstream_id
"chatcmpl-6203c659-a419-92cb-a29e-3011cc6020d9"
total_cost
0.0146375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)