NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ79RV6RVY07RYKMSGJRRPK8
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
51.5s
Cost:
$0.015770
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
11.82%
Dialogue to Total Word Ratio
(details)
5.9106%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her nose against the cold chain-link fence, flattening it into a grotesque pancake that smelled of chlorine and regret. Inside the backyard, a bouncy castle thumped rhythmically against a house draped in streamers that suggested a rainbow had suffered a violent malfunction. "Status report," Bill whispered, adjusting his tights which were two sizes too small and chafing with the intensity of a thousand suns. "Are the parents distracted?" "The host is blind," Molly mumbled, her voice muffled by the metal links digging into her cheekbone. "Sir Hops-a-Lot just ate a mime. The mime is still moving. The presents are visible. Thirty-six bags. A mountain of greed." Bill peered over her shoulder, clutching his velvet sack with white-knuckled desperation. "Mimes are Class-A distractions. This is smoother than the Barnabus job. Don't mention Barnabus to the feds." "Focus. The perimeter is tricky. The birthday father holds a flare gun and drinks tequila from a sippy cup." "Tequila dads are volatile. They might gift-wrap us in duct tape. What's the prize?" "Nintendo Switch. Extra controllers. Sitting alone on the sideboard like a lonely diamond." "Jackpot. The loot is secure and portable. How do we extract? The coat-hanger grappling hook is fraying." "No grappling hook. Plan Alpha. Sir Hops falls off the unicycle. You deploy the napkin dispenser smoke screen. I snatch the Switch and vanish into the rhododendrons." "Napkin dispenser? You want me to weaponize cellulose? Bill, you cry if you open a pickle jar." "It's tactical fluff! Chaos dust! It blinds their sinuses just like the popcorn incident!" "The popcorn incident resulted in a food coma and a lawsuit for assault with a condiment." "Semantics. We got the cash. What about the dog? The beagle sleeps in the center. If it wakes, we're toast. Literally. With marshmallows." "The castle vibration keeps the beast comatose. Window is four minutes twelve seconds. The clown's magic act ends in ninety ticks. Movement required." Bill checked his wrist, adorned with three pedometers and a glittery watch reading 'Party Animal' in Comic Sans. "Sir Hops is wobbling. The unicycle lists left. I'm initiating the bio-hazard aerosol event." "A sneeze? That's your cover? You're not going to sneeze on the velvet sack?" "Fear me, children! This sneeze has range and volume! It will clear the room!" "Do not sneeze on the Nintendo. Smudges devalue the resale market on the dark web. And if you sneeze on me, I'm docking your share." "No promises. My sinuses are keyed up. Ready?" Molly gripped the bent coat hanger and a plastic spoon, her knuckles white. "Ready. Let's make like trees and steal some toys." "Bad metaphor. The spirit is true. On three." "Bill, the clown is sliding. This is now." "Right. On three. One... Two..." Bill inhaled sharply, drawing a lungful of backyard air, his eyes widening to dinner-plate proportions, his face contorting into the mask of a sneeze destined to destroy all known civilizations. Molly counted down, her heart hammering against her ribs like a prisoner in a bouncy castle, her hand trembling not with fear but with the raw anticipation of plastic joy. "SNEEZE!" A confetti cannon hidden in Bill's sleeve detonated prematurely, blasting a thick cloud of pink ribbons and gold stars directly into Molly's face, blinding her instantly. "You idiot!" Molly shrieked, tugging frantically at her eyelashes as ribbons wrapped around her like a pink constrictor. "That was the distraction! You blew the element of surprise!" "Creative flair!" Bill yelled, backing away as a mother turned, confused by the sudden party explosion. "Go! Get the Switch! I'll negotiate with the beast!" Molly sprinted, tripping over a judging garden gnome, and slid across the damp grass like a baseball player stealing home, her hand outstretched toward the purple satin table and the glorious prize waiting just beyond her grasp, while behind her, Bill attempted to bribe a beagle with a piece of duct tape.
created_at
"2026-04-27T11:04:11.231Z"
4/27/2026, 11:04:11 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
656
moderation_latency
(null)
generation_time
51401
tokens_prompt
77
tokens_completion
10084
native_tokens_prompt
76
native_tokens_completion
10501
native_tokens_completion_images
(null)
native_tokens_reasoning
9552
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0157705
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-c21d4d50-07c1-96a8-8229-532493c0c028"
is_byok
false
latency
656
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777287851-FxbtRMKTFz2VFp4H7dI2"
session_id
(null)
api_type
"completions"
id
"gen-1777287851-rldtdBDEtwAM5jSsGpVN"
upstream_id
"chatcmpl-c21d4d50-07c1-96a8-8229-532493c0c028"
total_cost
0.0157705
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
648
11.82%
Dialogue to Total Word Ratio
Ratio: 56.53%, Deviation: 6.53%
neededClean
false
wordsTotal
658
wordsDialogue
372
5.9106%