NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ76RVFS8VWABM52Z29SV4YC
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
50.3s
Cost:
$0.014356
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
14.94%
Dialogue to Total Word Ratio
(details)
7.4694%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her camouflage, a daisy onesie and a plastic tiara threatening her temple, while Bill wrestled a grappling hook loose from a potted fern. Sour gummy worms scattered across Bill's tactical kneepads as he scrambled to retrieve them, his breath hitching in the lavender-scented air beyond the hedge. Techno-kids music vibrated through the chain-link fence, synchronizing with the frantic pulse on Molly's hair-clip monitor. She checked her pouch: rubber duck foghorn, zip-tie grapples, and a net for runaway piñatas. The living room window revealed the target, a spruce groaning under wrapped cardboard rectangles, guarded by a mother with a spatula and the trajectory of a dangerously full bouncy castle. "Molly, we need to discuss the extraction metrics before-" "Focus, Bill." Bill dropped the gummy worms, but a butterfly drone snatched three before he could recover, leaving a purple stain on his glove. He wiped his face, smearing the residue across his goggles. "The mother is moving toward the cake. Her grip on the spatula is tightening. This is a high-risk variable." "Leo's pulse is elevated. Once the cake hits him, his cognitive defenses drop. That's our window." Bill adjusted his tiara, which slid over his eye, plunging him into darkness until he flailed and caught a balloon arch. He straightened, breathing heavily as he stared at the flaming torches being juggled by a woman in a crown made of helium-filled latex tubes. "Are those real fire? My insurance doesn't cover fire-related liquidation." "We are shadows. We move now." A donkey piñata swayed menacingly from the roof, its rope groaning as shadows stretched across their hiding spot, mimicking grasping claws. Molly felt sweat slick her stealth-diaper fabric, the moisture compromising the grip rating significantly. "Negative on the hesitation. The donkey is a decoy. On three." Bill gripped a twenty-gallon gift bag, the plastic crinkling loudly against the silence of their crouch. "One. Two..." "Wait! Did you check the bag topology?" "It fits the heuristic limits. The volume is sufficient for the payload density." "Bill. That bag has a bottom. The presents don't have bottoms. You'll carry three thousand dollars of plastic in a sack that collapses under its own volume. Do you have backup containment units?" Bill froze, his mouth opening and closing like a landed fish. "I assumed the structural integrity would hold." "Bill, my stealth-diaper has the capacity of a thimble and the grip of a wet sock. You grab the high-value small stuff." "The small stuff? You want me to loot crayons and plastic rings?" "Bouncy balls and stickers! Get off the ground before Leo's father notices our daisy pattern is a tracker for a drone strike. On three! One! Two!" Molly raised the rubber duck. Bill prepared to lunge, his knees cracking audibly. The piñata swung lower, casting a long shadow over the target tree. Somewhere, a kazoo wailed the theme to a show they were both pretending not to hate. Molly took a deep breath, centered her chi, and prepared to unleash the sound that would signal the end of a childhood. "Three!" Molly screeched the honk with the force of a retired operative, unleashing a sonic blast that shattered the illusion of calm and sent a flock of decorative paper cranes scattering like panicked spirits.
created_at
"2026-04-27T10:11:45.792Z"
4/27/2026, 10:11:45 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
646
moderation_latency
(null)
generation_time
49744
tokens_prompt
77
tokens_completion
9219
native_tokens_prompt
76
native_tokens_completion
9558
native_tokens_completion_images
(null)
native_tokens_reasoning
8800
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.014356
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-9baba1aa-feb9-9057-b55c-4aeaae2e6d86"
is_byok
false
latency
645
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777284705-JvPJFnHWiXCEXTm1E7Xd"
session_id
(null)
api_type
"completions"
id
"gen-1777284705-NthWT8SdRxlsu70kf1jh"
upstream_id
"chatcmpl-9baba1aa-feb9-9057-b55c-4aeaae2e6d86"
total_cost
0.014356
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
541
14.94%
Dialogue to Total Word Ratio
Ratio: 36.17%, Deviation: 6.17%
neededClean
false
wordsTotal
553
wordsDialogue
200
7.4694%