NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK12NYNYAE8SRXZT5X6CMJ6
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1m 36s
Cost:
$0.013345
Provider:
Alibaba
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of cinnamon hung heavy in the air, masking something sharper beneath the sugar. A hand slammed a ceramic mug onto the counter, coffee sloshing over the rim. "You heard nothing?" "Just the oven timer. And the sirens wailing past." "Not around here. Not at midnight." Flour dusted the apron tied tight around a waist that hadn't moved from behind the register. "Midnight's for sleeping. Or kneading dough." "Someone saw your van near the library steps." A metal spatula clattered against the stainless steel prep table, echoing off the tile. "My van stays in the garage. You know the lock sticks." "Then explain the mud on the tires. Red clay. Only found down by the creek where they found him." "I deliver to the campsite." "In November? With the snow coming?" Silence stretched, filled only by the low hum of the display case lights. A blueberry muffin was wrapped in wax paper and slid aggressively across the surface. "Eat. You're getting cranky." "Don't change the subject." "I'm not. Just saying, hungry people see things that aren't there." "And full people hide things that are." The baker turned away, checking the trays with a little too much focus, wiping an already clean spot. "Next customer."
created_at
"2026-02-16T10:50:35.150132+00:00"
2/16/2026, 10:50:35 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
832
moderation_latency
(null)
generation_time
93799
tokens_prompt
81
tokens_completion
3241
native_tokens_prompt
78
native_tokens_completion
3694
native_tokens_completion_images
(null)
native_tokens_reasoning
3404
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0133452
router
(null)
provider_responses
0
id
"chatcmpl-ed0463ad-9b2b-9abc-9e7b-be9a1d238f33"
status
200
is_byok
false
latency
832
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771238940-bFyTvx7bTQjIN38ZQrPE"
upstream_id
"chatcmpl-ed0463ad-9b2b-9abc-9e7b-be9a1d238f33"
total_cost
0.0133452
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
100.0000%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)