NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBDV0NG5WVGJC2NKS4RSS4Q
Run Details
Model:
Qwen 3.5 Flash
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
50.5s
Cost:
$0.003160
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The ovens hummed, a low vibration against the counter where flour still dusted the floor. "Did you find the note hidden in the dough?" "No note. Just a receipt for extra sugar." "Sugar doesn't explain the open front door." "I locked it up tight." "Then who left the key under the mat?" "The police asked that too." "I told them you're the baker, not the killer." "Maybe they have a secret recipe." "It keeps business very good." "Is the jam from the cellar still the same?" "Just blackberries, same as always." "It tasted bitter today." "Bitter's best. Needs a little edge." "Town is whispering again. Ledger missing." "Ledgers are boring. Bread is always warm." "Don't change the subject." "Someone wants the ledger, but nobody wants my oven." "That's exactly why they're here." "So, are you going to finish that scone?" "I'll finish it. Then I'll check the alley." "Don't sweep up too much dust." "Some things settle there. Some things stay. You know how to hide them." "Or eat them. Depends on the appetite." "Mine wants the truth wrapped in flour." "Then take a bite. The rest belongs to the morning."
created_at
"2026-02-25T22:13:45.27Z"
2/25/2026, 10:13:45 PM
model
"qwen/qwen3.5-flash-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
424
moderation_latency
(null)
generation_time
50371
tokens_prompt
81
tokens_completion
6502
native_tokens_prompt
78
native_tokens_completion
7880
native_tokens_completion_images
(null)
native_tokens_reasoning
7591
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0031598
router
(null)
provider_responses
0
endpoint_id
"5f6b71b5-8672-4b27-8b2d-f674429f9e63"
id
"chatcmpl-1bd90e2f-e211-9df9-8635-b2d8f171dbfb"
is_byok
false
latency
424
model_permaslug
"qwen/qwen3.5-flash-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772057625-rIhgO2jvPLiHqqdFz2ea"
upstream_id
"chatcmpl-1bd90e2f-e211-9df9-8635-b2d8f171dbfb"
total_cost
0.0031598
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)