NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ78CY5KWFSAJF362BMXW4G7
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 23s
Cost:
$0.052928
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly smoothed the wax-paper blueprint across the sticky laminate of the diner booth, weighing down the corners with salt and pepper shakers. The schematic wasn't drawn in ink, but in wobbly purple crayon, detailing a bouncy castle labeled FORTRESS OF FUN. Bill adjusted his black turtleneck, which was unfortunately dusted with a humiliating layer of pink glitter from a reconnaissance mission involving a rogue piñata. He nervously polished a grappling hook fashioned from a bent coat hanger and dental floss, his knee bouncing beneath the table. "Focus," Molly hissed, tapping a red X marked near the cake table with her butter knife. "Target is Gift Mountain. Intel confirms a high concentration of LEGO sets and at least one Nintendo Switch. We strike during musical chairs. That's our window of opportunity." Bill leaned in, his eyes wide and manic behind the ski mask he refused to remove despite the sweltering heat. "Musical chairs? Molly, that's a kill zone. I saw a six-year-old elbow-drop a toddler for a plastic throne last week. We need heavy armor. Or a diversion. I brought a whoopee cushion the size of a manhole cover. It creates a sonic boom of embarrassment." "Stow the cushion, Bill. We need stealth, not flatulence. The primary threat is the entertainer. Captain Giggles. He's a professional." Bill shuddered, rattling the sugar dispenser. "The mime? I heard he can trap a grown man in an invisible box until they wet themselves. How do we neutralize a mime without touching him?" "We bypass. While he constructs balloon animals for the weeping masses, we hit the table. You're on bag duty. I'll manage the parents; they'll be too paralyzed by cheap boxed wine and existential dread to notice us." "What about the birthday girl? She has hawk eyes and a scream capable of shattering glass. She'll spot the bags." Molly reached into her duffel and slammed a bright red foam nose onto the table. It wobbled mockingly between them. "You wear this. If spotted, you honk and dance. It's perfect camouflage. Nobody questions a dancing clown at a party." "I draw the line at the nose. It chafes my spirit. And what about Fluffy? The invitation specifically warned of an aggressive dog." "Fluffy is a three-legged pug with asthma. You can outrun him in your sleep. Extraction is through the bouncy castle, over the fence, and into the minivan." Bill stared at her, his mouth agape beneath the wool. "The minivan? The one with the Baby on Board sticker and the dent in the bumper?" "It's the ultimate getaway vehicle. Nobody looks twice at a minivan. Now check your gear. Do you have the candy bribes?" Bill patted his cargo pockets, producing a handful of crushed lollipops and a lint-covered gummy worm. "Sustenance secured. I also sharpened my letter opener. Just in case the piñata fights back." Molly rubbed her temples. "We're stealing gifts, not assaulting party favors. Be professional. Grab the loot, leave the cake, and do not open the LEGO Death Star. Clear?" "Crystal. But I'm calling dibs on any lightsabers. A thief has needs." "Bags ready. We roll in ten. Wipe that glitter off. You look like a nervous disco ball."
created_at
"2026-04-27T10:40:12.473Z"
4/27/2026, 10:40:12 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1360
moderation_latency
(null)
generation_time
203142
tokens_prompt
77
tokens_completion
6921
native_tokens_prompt
76
native_tokens_completion
6773
native_tokens_completion_images
(null)
native_tokens_reasoning
6036
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0529282
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-93443f08-ad7f-9b51-a777-35dd781736a5"
is_byok
false
latency
1360
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777286412-jTCNLEiF1dnTpIDECzjV"
session_id
(null)
api_type
"completions"
id
"gen-1777286412-SdSetpMuVIwhPbCcSlSW"
upstream_id
"chatcmpl-93443f08-ad7f-9b51-a777-35dd781736a5"
total_cost
0.0529282
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
530
0.00%
Dialogue to Total Word Ratio
Ratio: 66.60%, Deviation: 16.60%
neededClean
false
wordsTotal
536
wordsDialogue
357
0.0152%