NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBHSD2NTA0727NGM7RDGCKG
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1m 43s
Cost:
$0.015640
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon and burnt sugar clung to the air, thick as wool against the winter chill. Flour dusted the counter like fresh snow, undisturbed by the morning rush. "The vanilla extract is gone again." "That's the third time this week. You're sure it wasn't used in the cupcakes?" "I checked the logs. Nothing was baked. Just disappeared from the shelf." "Could be a leak in the back room. Or someone with a key." "Only the owner and I have keys. And the owner is dead." Silence stretched, broken only by the refrigerator's hum. The bell chimed, announcing a new arrival, but neither looked up. "So, it's not a leak." "No. It's a message. Or sabotage." "Who benefits from your closure?" "Everyone who wants to buy the building cheap. But nobody wants the family recipes." "Except maybe the new owner down the street." "He bought the shop next door yesterday. Coincidence?" "Probably. But I'll keep an eye on the flour bins." "Do that. And hide the yeast." "Already done. In the safe." "Smart. Now, about that coffee?" "Black. No sugar." "Just like the truth."
created_at
"2026-02-25T23:22:46.747Z"
2/25/2026, 11:22:46 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
502
moderation_latency
(null)
generation_time
103176
tokens_prompt
81
tokens_completion
4754
native_tokens_prompt
78
native_tokens_completion
6507
native_tokens_completion_images
(null)
native_tokens_reasoning
6233
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0156402
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-092eb03f-82c6-900d-9a86-fdb7d75974b2"
is_byok
false
latency
502
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772061766-jbsJz9otNpnBH7AuDcUZ"
upstream_id
"chatcmpl-092eb03f-82c6-900d-9a86-fdb7d75974b2"
total_cost
0.0156402
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)