NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7T22JQ3BN01BTN18HK1W7
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
10.5s
Cost:
$0.000442
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.00%
Dialogue to Total Word Ratio
(details)
6.5009%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a stack of deflated bounce houses, adjusting her neon tiara. "Bill," she whispered, "the cupcakes are laced with glitter. *Glitter.*" Bill peered over the edge of a helium tank. "So we just avoid the cupcakes, right? And the cake? And the glitter?" "Bill," Molly said, gripping his collar, "this isn’t a *normal* party! They’ve got motion sensors on the piñatas. And a kid named ‘Detective’ who carries a magnifying glass." Bill adjusted his trench coat. "No worries. I’ve got a plan. You see, the presents are stacked in a gift tower—30 feet high, laser-guarded by rainbow-colored strings. But the lasers are powered by the clown’s walkie-talkie. So we just fake a balloon animal emergency. He’ll sprint to the main stage, radio, *boom.*" Molly raised an eyebrow. "And what makes you the expert on clown logistics?" "I once robbed a circus," Bill said, straight-faced. "Didn’t get caught. But I didn’t count on the *camel* with a Sudoku addiction." "Focus," Molly snapped, gesturing to the bounce houses. "The guests are 60% cupcakes, 40% hyperactive. We need stealth. Sneak in during ‘Pass the Hashimoto’s Disease Song’ or whatever they’re playing." Bill pulled out a gadget resembling a glitter cannon. "What if we use my ‘Confetti Decoy 3000’? Fires cinnamon rolls filled with smoke powder!" "*Smoke* powder?" Molly hissed. "We’re in a room with a *sprinkler system*! One spark and we’ll drown in vanilla-scented water!" "Okay, fine," Bill muttered. "What’s *your* idea?" Molly tapped her temple. "The DJ’s playlist. I’ll pose as a ‘Birthday Fairy’ and swap the gift tower’s base with my cake-dispensing briefcase. The lasers’ll short-circuit. You’ll gather the loot while the kids sing ‘Happy Birthday’ at me." Bill saluted. "Brilliant. I’ll be the ‘Ballooner of Secrets’ and hide in the—uh—that giant cupcake floatie." "Floatie?" "Relax! The escape tunnels are lined with balloon animals. They’re *quiet*." Molly sighed. "Just don’t get hit by the piñata lasers. And for God’s sake, stop eating party favor candy before we go. Your plan to ‘blend in’ ended with you in a glitter bomb." Bill popped a gummy worm in his mouth. "What’s the code phrase to trigger the decoy?" "‘Surprise!’" Molly said, rolling her eyes. "It’s a **children’s** party, Bill. We’re stealing *fuzzy dice* and *overpriced* socks." "Then I call dibs on the socks," Bill said, already preening. Molly’s eye twitched. "Let’s just hope the ‘world’s toughest 8-year-old’ isn’t your future boss."
created_at
"2026-03-25T10:14:27.158Z"
3/25/2026, 10:14:27 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
340
moderation_latency
(null)
generation_time
10436
tokens_prompt
93
tokens_completion
1142
native_tokens_prompt
72
native_tokens_completion
1044
native_tokens_completion_images
(null)
native_tokens_reasoning
365
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000441792
router
(null)
provider_responses
0
endpoint_id
"344ccdcc-6e59-464d-89ab-8e5b92fe2adc"
id
"chatcmpl-67d9d67e-4ff4-92f9-89df-c84c6591a8f8"
is_byok
false
latency
340
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433667-iGxegyTZgysFsa2gTmJH"
upstream_id
"chatcmpl-67d9d67e-4ff4-92f9-89df-c84c6591a8f8"
total_cost
0.000441792
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
403
13.00%
Dialogue to Total Word Ratio
Ratio: 78.05%, Deviation: 11.95%
neededClean
false
wordsTotal
410
wordsDialogue
320
6.5009%