NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8GMBZKYXJ4AKCYN25HXBD
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
21.1s
Cost:
$0.000875
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
48.61%
Dialogue to Total Word Ratio
(details)
24.3051%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a stack of oversized building blocks, eyeing the bouncy castle like it was a bank vault. Bill, draped in a neon-green pirate costume with a parrot hat that looked like it had been assembled by a toddler, adjusted his monocle with a flourish. “Right, luv,” he whispered, “I’ve scoped it. The loot’s under the table where the cake is. Covered in a *tarp*. You know what that means.” “That means it’s probably just a tarp,” Molly said, peering through a plastic telescope (leftover from a previous job involving Renaissance fairs). “And if it *is* presents, they’re probably tiny and wrapped in non-stop rainbows. Stop using ‘loot’ like you’re in *Treasure Island*.” Bill puffed out his chest. “We’re professionals. And professionals use *themes*. Besides, if we get caught, I’m just a pirate collectin’ ‘loot’ for the cake table. Logical.” “Logical? You tried to ‘swoop in’ during a cupcake demonstration last week and triggered a smoke alarm. You don’t *swoop* in a preschool, Bill. You *blend*.” A loud giggle punctuated the conversation. From their hiding spot, they saw kids darting past the bouncy castle—a sea of glittery sneakers and dinosaur backpacks. A clown in polka-dots and a name tag (Bubbles, 100% Real) tossed confetti into the air, momentarily blinding two toddlers. Molly pulled a pair of jingle bells from her utility vest and clipped them to her wrist. “Plan’s simple. You become Bubbles. I’ll pose as the DJ. When the cake smash starts, you hit the tarp with my *special* piñata stick.” Bill saluted. “And pop out the presents like a giant Easter bunny who’s had one too many!” He brandished a neon rod covered in sticky notes labeled *“Do Not Wave This Around Like a Lightsaber.”* “You’ll *gently* puncture the tarp. I’ll cause a distraction with the music. Then we swap the gifts for the decoys in the bounce house.” Bill gasped. “Molly! That bounce house is fortified. There’s *streamers*—the modern-day equivalent of razor wire!” “They’re *strung with* decoys,” she hissed. “Now shut up before I replace your monocle with a noisemaker.” The clown—Bubbles—ambled over, grinning. “Hey, uh, bubbles?” a boy asked, tugging his leg. “Can I have another balloon?” Bill mimed vomiting into his hat. “That’s your cue. Break the DJ’s console. *Gracefully*.” “I’ll ‘accidentally’ drop my walkie-talkie,” Molly said, hurling a device at his boots. Bill yelped as it bounced off his shins. “And you get the crowd excited about the ‘magic balloon explosion’ so no one notices us swapping the stash.” “Magic balloon explosion? I’m in. I’ve *always* wanted to explode balloons.” Ten minutes later, Molly stood at the DJ booth, fingers trembling. This wasn’t her element. She’d robbed art galleries, not… whatever this was. A kazoo cover of “Twinkle, Twinkle” played over the speakers. Molly mashed buttons until the music shifted to a thumping remix of “Hey, ho, let’s go!” The kids erupted into a frenzy. Meanwhile, Bill waddled toward the cake table as Bubbles, waving a glitter bomb disguised as a “birthday siren.” “And now, mates,” he boomed, voice cracking midway, “I’ll show ye the *proper* way to celebr—*ahem*—rate!” He dropped the bomb. A burst of rainbow mist engulfed the room, and somewhere a parent yelled, “Is that food-safe glitter?!” Molly slipped off her DJ headphones, revealing stolen presents stacked like bricks. “Move, move, *move*,” she said, shoving boxes into Bill’s arms. Bill dove into the bouncy castle, which immediately deflated under the weight of his panic. “IT’S TOO BOUNCY,” he screeched as he launched into a ceiling. “Land on the trampoline!” Molly shouted. Bill did, then slid across it like a penguin on an ice rink. Security, a lanky man with a lollipop necklace, lumbered toward them. “Uh, guys?” he said. “The kid’s dad’s a lawyer. And the piñata’s… full of kittens.” Molly yanked Bill into a cake-decorated escape van. “Kittens,” she deadpanned. “Aww, you’ve *got* to be kitten me,” Bill said, cradling the stolen toys. “No, but I’ve got these,” she said, tossing him a box. Inside, a dozen tiny gold watches read, “World’s Best Baby Collector.” They sped off as the sun set, the van filled with candy wrappers and Bill’s parrot hat. “This was a success, innit?” Bill asked, nibbling a gummy worm. Molly slumped. “We’ll be rich by… 3 PM. Once we outrun the parents chasing us in Ugg boots.” “Speak for yourself,” Bill muttered, fiddling with a stolen plush crocodile. “My contingency plan is this here. Name’s Kevin.” Molly sighed. “Next job, we hit a teen’s party. Less glitter. More electronics. And *fewer* sentient balloons.” Bill tossed Kevin the crocodile at her head. It hit the dashboard. “Hey! Santa’s leaving *notes* now!” Outside, the parents on the chase realized the “magic glitter explosion” had glued their hair to their heads.
created_at
"2026-03-25T10:26:46.788Z"
3/25/2026, 10:26:46 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
381
moderation_latency
(null)
generation_time
20711
tokens_prompt
93
tokens_completion
2348
native_tokens_prompt
72
native_tokens_completion
2086
native_tokens_completion_images
(null)
native_tokens_reasoning
825
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000875264
router
(null)
provider_responses
0
endpoint_id
"344ccdcc-6e59-464d-89ab-8e5b92fe2adc"
id
"chatcmpl-b06c5ab2-317d-94cd-8d08-3e3713a83790"
is_byok
false
latency
381
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434406-tkg2ZOvJV4E1zhcmDlue"
upstream_id
"chatcmpl-b06c5ab2-317d-94cd-8d08-3e3713a83790"
total_cost
0.000875264
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
796
48.61%
Dialogue to Total Word Ratio
Ratio: 46.20%, Deviation: 3.80%
neededClean
false
wordsTotal
803
wordsDialogue
371
24.3051%