NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ782W72E1XSAP248DHHX008
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
53.4s
Cost:
$0.015631
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
87.27%
Dialogue to Total Word Ratio
(details)
86.8234%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the strap of her oversized tutu, the itchy tulle rubbing harshly against the Kevlar weave of her tactical vest. The van smelled of stale pretzels and damp wool. A crayon map on the seat highlighted the route in red marker. Outside, the suburban lawn of 42 Maple Drive pulsed with manic energy as twelve six-year-olds chased a dragon piñata that had already lost three limbs. Molly checked her watch, encased in pink silicone to mimic a bouncy castle clock. The dial read 2:14 PM. This required surgical precision. The napkin drop was the cover. Once the parents finished the salmon, the guard dog would drift into a salmon-induced coma. "We're cutting it close," she whispered, tapping a gloved finger against the dashboard. "That's our window. Bill, are the suction cups charged and the glue loaded? Carefully, Bill." Bill fumbled with a device that resembled a modified glue gun, his fingers trembling as he twisted the battery casing. He wore a ruff of lace that scratched his chin, and his wig was slightly askew, revealing nervous hair matted by sweat. The van shook as a delivery truck rumbled past. Bill swallowed hard, his Adam's apple bobbing visible through the camouflage paint on his face. "They're charged and loaded. I hope no one sees us. If you say 'Go,' I fire. If I miss, we're stuck to the fireplace and then we die." Molly rolled her eyes, adjusting her propeller hat. She checked her pockets, fingers brushing the noise-canceling headphones and the heavy canvas of the loot sacks. The plan was simple. She extracted the tree while Bill created a diversion. He was the distraction; she was the ghost. "You will not miss. Please don't fail me. Just remember the plan. I extract the tree. You create the diversion. Do not engage the piñata. It has sharp sticks. We need clean hands, not puncture wounds." Bill nodded, eyes fixed on the door slats. The laughter drifted in, cruel and bright. He watched the host mother juggle a bottle of juice, her smile twitching with desperation. The streamer dangled from her fingers like a surrender flag. The juice bottle sloshed precariously. The backyard was a labyrinth of plastic furniture and discarded wrapping paper. Molly gripped her bag, the weight of the empty sacks promising glory. This was the target. No security. Just a mother running on caffeine, surrounded by shiny loot. "Bill, focus. Glitter is tactical. It blinds the birthday boy for four seconds. Do you have the blackout hood?" Bill patted his vest. "I have the hood. And the juice." Molly murmured, peering at Bill's reflection. He looked ridiculous, a caricature of a thief, but he was hers. "On my mark, we roll. You handle the glitter. I handle the loot. And remember, no glitter in the car. And check your zipper. Stay alive out there." Bill's voice was barely audible. "Understood. Glitter and go. I'll try not to trip." Molly popped the door. She stepped out. Bill followed, holding the glitter cannon like a holy relic, trembling with criminal absurdity.
created_at
"2026-04-27T10:34:42.798Z"
4/27/2026, 10:34:42 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
638
moderation_latency
(null)
generation_time
53229
tokens_prompt
77
tokens_completion
10251
native_tokens_prompt
76
native_tokens_completion
10408
native_tokens_completion_images
(null)
native_tokens_reasoning
9710
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.015631
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-d9c2f9c9-6007-9bfc-95b9-dae6b4c93bb0"
is_byok
false
latency
638
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777286082-suohXuIhfNBX1Wj3kcqo"
session_id
(null)
api_type
"completions"
id
"gen-1777286082-vX4oryZsYrbSsp51zsc9"
upstream_id
"chatcmpl-d9c2f9c9-6007-9bfc-95b9-dae6b4c93bb0"
total_cost
0.015631
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
511
87.27%
Dialogue to Total Word Ratio
Ratio: 28.35%, Deviation: 1.65%
neededClean
false
wordsTotal
515
wordsDialogue
146
86.8234%