NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8H3J6VA5Y9J45ER8XV7TD
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.7s
Cost:
$0.000352
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.62%
Dialogue to Total Word Ratio
(details)
49.8110%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her wire-rimmed glasses and scribbled equations on a napkin. “The key to this heist isn’t stealth, Bill. It’s *chemistry*.” She tapped the napkin. “We’ll need 3.2 liters of glow-in-the-dark slime, a motorized piñata, and a whoopee cushion calibrated to mimic a dying platypus.” Bill, holding a stolen party store map upside down, squinted. “Why a platypus?” “Because the cake is a *surprise*. We need distraction. Also, we’re 90% sure the venue’s owned by a guy named ‘Bubba’ who collects taxidermied wombats. Don’t ask.” She gestured to her outfit—a glittery unitard covered in holographic butterflies. “I’m the magician’s assistant. You’re… the magician.” Bill poked his head out of a neon-green tuxedo coat. “I look like a bag of Skittles exploded on a zebra. And what’s the plan again? Steal presents, or steal the *presents*?” “*Steal the presents*, genius. The parents are handing out a hundred gift baskets— оргiastic haul of stuffed animals, tech toys, and those tiny bottles of soda that kids never get to open.” She paused. “We pretend to be part of the entertainment. You ‘perform’ while I disarm the security system behind the bounce house.” Bill mock-whispered. “But the bounce house is *inflated*. How do you ‘disarm’ airflow?” “Thermal tripwires. Motion sensors. And *please* don’t bounce. Just stand there and look mysterious.” They crouched behind a row of plastic alligators near the birthday castle—balloons swaying, children’s laughter bouncing off bounce house walls. Bill adjusted a monocle he’d “borrowed” from Molly’s costume kit. “What’s the signal if things go sideways?” “The platypus sound.” Bill mimed a gasping sob. “Perfect. If I emit a dying platypus noise, you trigger the slime?” “Exactly. Now stay focused. The birthday kid’s name is Liam. He’s allergic to squirrels, so if you see a man in a squirrel suit, run.” A djembe drumroll erupted. Bill straightened, adjusted his monocle, and strode toward the stage, tripping over a balloon snake. He face-planted into a table of cupcake toppings. Molly hissed, “Bill! You’re supposed to be charming danger!” Bill, frosting in his hair, waved a spatula like a sword. “This is the *preliminary heist phase*! Wait for the dying platypus!” A toddler tugged his leg. “Are you the magician? Can you make my goldfish bigger?” Bill’s eyes widened. He fumbled a deck of cards, accidentally setting off the whoopee cushion rigged to his belt. The room fell silent. Molly, fumbling with the bounce house’s wires, whispered, “Bill, *do something*!” Bill pulled a鸽子 from his sleeve—then realized he’d packed pigeons and platypus inflatables by mistake. The bird flew into a piñata, which swung into a cake decorator’s face. “CODE PLATYPUS!” Molly yelled. Bill hit the slime dispenser. A tidal wave of glowing goo splattered across the room. Kids shrieked with delight. Parents screamed in horror. “GO!” Molly hissed, yanking a duffel bag labeled “PRIZES.” Bill waded through slime, tripped over a wombat statue, and vanished into the chaos. Molly vaulted over a table, snatching a basket of toys mid-air. As they bolted, she muttered, “Next time, *you* handle the math. I’m getting a headache from your physics.” Bill, now half-submerged in a pool, called back, “But the platypus was *on point*!” Molly sighed. Somewhere, a whoopee cushion honked.
created_at
"2026-03-25T10:27:02.348Z"
3/25/2026, 10:27:02 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
109
moderation_latency
(null)
generation_time
15588
tokens_prompt
93
tokens_completion
1372
native_tokens_prompt
73
native_tokens_completion
1236
native_tokens_completion_images
(null)
native_tokens_reasoning
419
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00035192
router
(null)
provider_responses
0
endpoint_id
"6b8c829d-3094-45e7-8139-0a67e09060c3"
id
"cmpl-a01cfc64f8b24d1c827dfae4cee5c9a9"
is_byok
false
latency
47
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434422-I4ljUXVQMZGt3fameKFZ"
upstream_id
"cmpl-a01cfc64f8b24d1c827dfae4cee5c9a9"
total_cost
0.00035192
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
536
99.62%
Dialogue to Total Word Ratio
Ratio: 49.72%, Deviation: 0.28%
neededClean
false
wordsTotal
545
wordsDialogue
271
49.8110%