NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5F8JEX0VYQH8M86CTVMGF
Run Details
Model:
DeepSeek V3.2
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 17s
Cost:
$0.000392
Provider:
Parasail
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in the dank basement was thick with the smell of old coffee and desperation. Blueprints of a suburban McMansion were spread across a rickety table, but these weren’t the usual security schematics. They were a child’s crayon drawing of “Princess Sparklehoof’s Palace,” meticulously annotated by Molly in red pen. Bill squinted at it, his beefy finger tapping a wobbly turret. “I still don’t get it. The take is, what? Squishy-brained dolls and plastic junk?” “It’s not junk, it’s leverage,” Molly hissed, her pencil-thin frame coiled like a spring. “Little Ava Hendricks is turning six. Her daddy is Clayton ‘The Claw’ Hendricks. You know, the guy who ‘imports’ vintage sports cars? He’s got a vault, Bill. A real one. Behind a painting of a sad clown in his study.” “Okay, so we case the clown. Why the tea party?” Molly sighed, the long-suffering sound of a genius tethered to an anchor. “Because on a normal day, that house has more silent alarms than a museum. Today, it will have twenty sugar-crazed kindergarteners, a petting zoo in the backyard, and a magician named Zoltan. Chaos is a ladder, Bill. A sticky, glitter-covered ladder.” Bill scratched his stubble, a slow grin spreading. “So we use the party as cover. Sneak in, bypass the clown, clean out the vault.” “No! We *are* the party,” Molly corrected, holding up two garish polo shirts. “We’re ‘Cheery Chuckles,’ the backup party entertainers. My research indicates Zoltan has a dubious relationship with both punctuality and sobriety. We arrive ‘just in case,’ looking official. We blend. I’ll do balloon animals.” “I’m not doing balloon animals.” “You’ll be my assistant, ‘Bumbling Billy.’ Your job is to be conspicuously clumsy. Spill the punch on the security panel by the kitchen. Create a diversion involving the pony and the magician’s rabbit. Keep all adult eyes on you.” Bill puffed out his chest. “A distraction gig. I can do that. But how do we get the presents? They’re not in the vault.” “The gifts are phase one,” Molly said, her eyes gleaming. “We load the ‘Cheery Chuckles’ gift box into our van. Standard procedure, everyone will think. Inside will be every overpriced toy from that pile. But we also leave behind a listening device in a specific present—the one from Aunt Margaret, which is always a hideous, oversized teddy bear. We hear where The Claw stashes the real goods after the party, when he thinks the coast is clear.” Bill leaned back, a grudging respect in his eyes. “So the kids’ loot is just…practice? And bait?” “Exactly. A test run for the big score. And if we get caught, we’re just incompetent party clowns who accidentally took the gift box. The worst they do is make us do the chicken dance.” “I’m not doing the chicken dance.” “You will if it keeps us out of prison!” Molly snapped. She folded the crayon blueprint with military precision. “Now, the hard part. You have to learn to make a poodle. A simple, three-balloon poodle.” Bill stared at the long, twisting balloon she thrust into his hands as if it were a live serpent. “This is beneath me, Moll. I once cracked a time-lock in Belgrade.” “And in Belgrade, did you have to avoid a six-year-old in a tiara demanding a unicorn while her father’s armed guards ate cake? This is the big leagues. Now twist. And try not to pop it. The sound will startle the pony.”
created_at
"2026-02-16T21:27:17.356496+00:00"
2/16/2026, 9:27:17 PM
model
"deepseek/deepseek-v3.2-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
778
moderation_latency
(null)
generation_time
137094
tokens_prompt
77
tokens_completion
915
native_tokens_prompt
67
native_tokens_completion
830
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00039226
router
(null)
provider_responses
0
id
"d7519249e5f04a5091757ae0d6ac8d87"
status
200
is_byok
false
latency
778
endpoint_id
"4c18a672-9069-469d-9e6b-8af2300fe7b7"
provider_name
"Parasail"
model_permaslug
"deepseek/deepseek-v3.2-20251201"
api_type
"completions"
id
"gen-1771277099-JWtQQ5OpR8cPeyCyJ0h2"
upstream_id
"d7519249e5f04a5091757ae0d6ac8d87"
total_cost
0.00039226
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
584
0.0000%
Dialogue to Total Word Ratio
Ratio: 75.00%, Deviation: 25.00%
neededClean
false
wordsTotal
584
wordsDialogue
438
0.0000%