NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ777BTFSJMDF5YAXKEQSQ1B
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 1s
Cost:
$0.019509
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
9.61%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
4.8041%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The cardboard box smelled faintly of damp cereal and despair. Molly crouched behind the dumpster, adjusting her visor with one gloved hand. Bill fumbled with a zip-tie and a half-eaten granola bar. They were currently operating out of a commercial alleyway three blocks from the Miller residence. The target was a sixth birthday, specifically the fully unwrapped gift mountain awaiting its young master tomorrow morning. Rain began to patter against the pavement. Molly checked her chronometer. The alley was empty except for a stray cat and a discarded soda can. She adjusted her grip on the duffel bag. Bill wiped his palms on his jeans. The plan required absolute precision. One wrong step and the whole operation collapsed. They stood in silence for a moment, listening to the distant hum of traffic. Molly finally broke the quiet. She pointed toward the fence. Bill nodded slowly. They prepared themselves for the climb. The weather forecast promised clear skies by evening. They needed to move quickly. The neighborhood was notoriously well lit. Security cameras glared from every corner. But timing would compensate for poor lighting. Molly tightened her boots. Bill checked his pockets for tools. Everything was accounted for. They were professionals. This was merely a tedious exercise in patience. The presents would not steal themselves. They waited patiently for the exact moment. The clock ticked loudly. The air grew heavy with anticipation. Every second counted. Rain continued falling. Puddles formed near the curb. They checked their watches again. The wind picked up. Cold wind whistled past quietly. Streetlights flickered on. Look at that landscaping, Bill said. Those bushes are practically screaming today. They are not screaming, Molly replied. They are hydrangeas. Wind speed is negligible. We walk in at two fourteen. I hate that you are right, Bill muttered. The getaway cart is parked behind the truck. Perfect. Speed is secondary to discretion. We grab the tech first. Small things are dead weight. Dead weight might be valuable, Bill argued. Collectibles sell for hundreds online. We are not stealing collectibles, Molly snapped. This is a professional extraction. We need to be out before six. Six is tight. Six gives us forty minutes to load and exit. I say we stretch it. The guard is a retired accountant named Gerald who naps. We move on schedule. Pack the duffels. I would never touch the fondue fountain, Bill lied. This job goes exactly as planned. No alarms. Drama is already happening, Molly noted. They forgot the punch bowls. The parents are distracted. We have our window. Window is open, Bill confirmed. Should I bring the ladder? The folding step stool. The ladder is too obvious. Leave the trowels. Noted. Step stool only. I will remember the trash bags. Actually, toss the wrappers in the neighbor bins. We need plausible misdirection. You are enjoying this far too much, Molly said, finally standing. Let us move now. Do not trip. Enthusiasm is my greatest asset, Bill said. Next to my moral compass. Just walk, Molly replied. Do not make eye contact with mascots. Never seen mascots before, Bill muttered, already scanning the yard. This is absolutely glorious.
created_at
"2026-04-27T10:19:41.269Z"
4/27/2026, 10:19:41 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
608
moderation_latency
(null)
generation_time
60848
tokens_prompt
77
tokens_completion
11929
native_tokens_prompt
76
native_tokens_completion
12993
native_tokens_completion_images
(null)
native_tokens_reasoning
12320
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0195085
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-82b62fee-6e55-9e39-b649-a9d8f6ebbb63"
is_byok
false
latency
608
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777285181-EefIBUCY4LIx9wmWAici"
session_id
(null)
api_type
"completions"
id
"gen-1777285181-ObT0lJQdbcua4SmSJv4g"
upstream_id
"chatcmpl-82b62fee-6e55-9e39-b649-a9d8f6ebbb63"
total_cost
0.0195085
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
9.61%
Matches word count
n/a
neededClean
false
words
522
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
524
wordsDialogue
0
4.8041%