NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ80TRB938CZDDSPZYJ6SC2
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
21.9s
Cost:
$0.000451
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.92%
Dialogue to Total Word Ratio
(details)
6.9601%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against a stack of pizza boxes in the dimly lit storage unit, twirling a lockpick like a baton. “Alright, Bill. The party starts in two hours. We’re crashing the Stevenses’ backyard bash, swapping out every gift with dummy versions, and vanishing before the piñata’s last stanza.” Bill adjusted his glasses, peering at a blueprint he’d printed from a suspiciously child-friendly website. “Dummy versions? You sure we can’t just, I don’t know, cart out the whole Amazon box and call it a day?” “Dummy versions,” Molly repeated, enunciating like he was a particularly dense toddler. “We don’t want the parents noticing their kids opened $5 plastic☞” “Hey, my sister’s wedding favor was $5 plastic!” “This isn’t a wedding. It’s a *children’s party*. The only thing they care about is whether the bouncy castle has a squirrel in it.” She tossed him a pair of sequined elf costumes. “You’re playing an elf. I’m Santa. Easy, right?” Bill held up the green tunic, which looked like it had been stained by existential dread. “Easy. What could go wrong? I mean, aside from the inherent wrongness of this entire enterprise? Or the fact that I’m allergic to glitter?” “Glitter’s a temporary setback. So’s prison. Now, here’s the plan: You distract the kids with your *charming, non-allergic* presence. I’ll sneak into the tent, switch the presents. Done.” Bill stared at a neon sign reading “DON’T GET CAUGHT” that he’d duct-taped to a cereal box. “Distract the kids? What if they ask me questions? Like, why I’m sweating through my hat? Or why I urinated in a plant pot three years ago?” Molly paused, considering. “If they ask about the plant pot, tell them it’s a holiday tradition. Works every time.” “Great. So I’m a walking urban legend. What if a parent sees us? What if someone recognizes Santa’s… *modest proportions*?” “I’m proportionally flawless. You, however, need to stop eating the ‘healthy’ snacks I bring.” She tossed him a mini drumstick. “Chew quietly. Also, if things go sideways, we’re leaving via the clown’s cart. Don’t question it.” Bill gaped. “You hired a clown?” “Borrowed. His name’s Mr.ubbles. He’s sleeping it off in the van. Now, practice your elf emoji.” Bill struck a pose, costume snagging on a rogue balloon. “Is this convincing enough for a six-yearer?” “Convincing enough for a six-yearer who’s been fed three scoops of confetti ice cream. Relax. Worst-case scenario, we’re stuck with a bunch of toddlers quoting *The Polar Express* at us.” Bill paled. “That *would* be the worst case. I can’t handle kids reciting my mom’s maiden name.” “Then don’t let them touch your phone. Also, remember: The replacement gifts are in the duffel. Toy frogs, bubble machines, that one weirdo action figure I found at the thrift store. Nothing too… *engaging*.” Bill furrowed his brow. “What’s the point of stealing toys no one wants?” “They’re not *stealing*. They’re… *rearranging*. For profit. Now, if you’re done philosophizing, we’ve got a party to crash.” Bill took a shuddering breath. “What if we get caught?” Molly handed him a fake mustache. “Then you’re a confused dad. I’m your *wife*. We brought cupcakes. Also, eat this mustache if you panic. It’s cinnamon.” “Why?” “In case you need to, uh, *incognito* into the dessert table. Now. Move. Or we’re Santa-ing without you.” Bill saluted, then tripped over a rogue teddy bear. “Lead the way, Your柑橘!” “*Your柑橘?*” “It’s Santa! I thought you liked Christmas!” She shoved him toward the door. “I *hate* Christmas.”
created_at
"2026-03-25T10:18:09.041Z"
3/25/2026, 10:18:09 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
108
moderation_latency
(null)
generation_time
21781
tokens_prompt
93
tokens_completion
1809
native_tokens_prompt
73
native_tokens_completion
1589
native_tokens_completion_images
(null)
native_tokens_reasoning
788
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00045076
router
(null)
provider_responses
0
endpoint_id
"6b8c829d-3094-45e7-8139-0a67e09060c3"
id
"cmpl-ac069a851d8944a4b392739dfb1f4767"
is_byok
false
latency
55
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433889-TRJ4ClKetLlM4bbFvtoM"
upstream_id
"cmpl-ac069a851d8944a4b392739dfb1f4767"
total_cost
0.00045076
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
579
13.92%
Dialogue to Total Word Ratio
Ratio: 76.28%, Deviation: 6.28%
neededClean
false
wordsTotal
586
wordsDialogue
447
6.9601%