NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJ1JQTPCX8NJJ4JG1VSZ4X
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
40.8s
Cost:
$0.011959
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell chimed. Cutting through the haze of yeast and cinnamon. She wiped her apron, staring at the stranger standing in the corner. "Still keeping the secrets, I see." "Only the good ones. The rest spoil in the heat." "There are too many bad ones in this town lately. Even for a place like this." "Then why come in?" "To taste the truth. Just like always." He picked up a croissant, inspecting the buttery layers in the light. "This one’s different. Bitter crust." "Maybe the ingredients spoke. Or the water." "Or maybe someone poisoned the flour." "That’s a new recipe, darling. No poison, just trouble." She slid a warm plate across the wood. "Eat up. You never know when you might need energy." "You sound sure." "I’ve been running this shop since the fire. People come, people go. Some just vanish." "Vanishing makes the mystery last longer." "Mystery is good for sales. Bodies are better." "You’re getting dark." "I’m getting honest." She poured cups of coffee, steam rising like a ghost. "Drink it fast. The mayor’s outside." "He know you’re feeding me poison?" "He knows nothing. Just like everyone else." The clock ticked to noon. The bell didn’t ring again.
created_at
"2026-02-25T23:27:14.687Z"
2/25/2026, 11:27:14 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
421
moderation_latency
(null)
generation_time
40666
tokens_prompt
81
tokens_completion
4852
native_tokens_prompt
78
native_tokens_completion
5970
native_tokens_completion_images
(null)
native_tokens_reasoning
5673
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0119595
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-77b396d6-d1ad-9560-9909-796ca2e8157a"
is_byok
false
latency
421
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772062034-oTItakG626HzRvflIlbJ"
upstream_id
"chatcmpl-77b396d6-d1ad-9560-9909-796ca2e8157a"
total_cost
0.0119595
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)