NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBJS8K42Q6EEF0MHH7JMB8H
Run Details
Model:
Qwen 3.5 Flash
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
38.3s
Cost:
$0.003047
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of burnt sugar hung heavy in the air, masking something sharper beneath. You didn't touch the croissants this morning. I had my reasons. The ones with the blue ribbon were the same. The baker's knife gleamed on the counter, resting on the receipt for the flour delivery. That receipt's been open to the wrong page. A mistake happens to everyone. Except you haven't made a mistake in years. You know exactly how many pounds of sugar we use before the first train leaves the station. Stop looking at the window. Mrs. Gable is watching the delivery truck. Let her watch. She sees nothing but loaves. Not the ones hidden in the back. There aren't any hidden loaves. I counted them myself when you locked up. Not loaves. The package you dropped behind the mixing bowls yesterday. The flour bag moved on its own. Flour doesn't have pockets, but ink does. I shouldn't have asked you to verify the weight. Too late. Open the oven. Cold dough won't rise, but secrets always ferment eventually. Check the timer. It's been running since Tuesday. And what if it's not just dough? Then we've got a lot of bread to eat now.
created_at
"2026-02-25T23:40:10.735Z"
2/25/2026, 11:40:10 PM
model
"qwen/qwen3.5-flash-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
450
moderation_latency
(null)
generation_time
38163
tokens_prompt
81
tokens_completion
6139
native_tokens_prompt
78
native_tokens_completion
7598
native_tokens_completion_images
(null)
native_tokens_reasoning
7324
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003047
router
(null)
provider_responses
0
endpoint_id
"5f6b71b5-8672-4b27-8b2d-f674429f9e63"
id
"chatcmpl-37869f00-89cd-9e7e-bdee-ff213c785609"
is_byok
false
latency
450
model_permaslug
"qwen/qwen3.5-flash-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772062810-eDTKnSQPKb3e8bEiMng9"
upstream_id
"chatcmpl-37869f00-89cd-9e7e-bdee-ff213c785609"
total_cost
0.003047
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Count dialogue tags
n/a
neededClean
false
noDialogue
true