NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBHTQV9NFD65FVZT5M0KV3R
Run Details
Model:
Qwen 3.5 27B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1m 55s
Cost:
$0.015971
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed, cutting through the heavy scent of cardamom and burnt sugar. Flour dusted the counter like fresh snow, undisturbed. "You didn't sell the last batch of sourdough to Mrs. Gable?" "No. I locked it in the safe after closing." "Then how did she have a fresh loaf at the church potluck this morning?" A hand wiped a smudge from the glass case. "She's a liar. Or someone slipped past the back door while I was in the walk-in." "The back door was bolted. I checked myself before leaving." "Did you check the window above the sink?" "I checked everything. The alarm didn't trip, and the camera was facing the wrong way." "Maybe it wasn't the bread she wanted." "What else is in the safe?" "The ledger. The one with the missing donations." Silence stretched between them, broken only by the hum of the proofing oven. Warmth radiated from the metal doors. The clock on the wall ticked loudly. "You think she stole it to hide something?" "I think someone needed to make sure the numbers didn't add up before the audit." "Then we better find the real baker." "Before the next bake sale." "Before the next bake sale."
created_at
"2026-02-25T23:23:30.546Z"
2/25/2026, 11:23:30 PM
model
"qwen/qwen3.5-27b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
553
moderation_latency
(null)
generation_time
114773
tokens_prompt
81
tokens_completion
5176
native_tokens_prompt
78
native_tokens_completion
6645
native_tokens_completion_images
(null)
native_tokens_reasoning
6360
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0159714
router
(null)
provider_responses
0
endpoint_id
"f3f88554-0c22-4d57-973b-0dec948808e8"
id
"chatcmpl-ad8690d9-e0e3-9d3e-9ad3-7cf881d2a647"
is_byok
false
latency
553
model_permaslug
"qwen/qwen3.5-27b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772061810-S0ybE7m1AKO4UrKXJP8V"
upstream_id
"chatcmpl-ad8690d9-e0e3-9d3e-9ad3-7cf881d2a647"
total_cost
0.0159714
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)