NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM62G8G4CVXEZTVQ8MKMR2A
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
18.6s
Cost:
$0.000721
Provider:
AtlasCloud
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**The Great Party Heist** Molly adjusted the oversized clown nose she’d crammed onto her face while Bill struggled to tie a balloon animal that looked more like a mutated sausage. They crouched behind a hedge outside the brightly decorated Henderson house, where the sounds of children screaming and a tinny rendition of *Happy Birthday* blared from a cheap speaker. "This is stupid," Bill muttered, wrestling with the balloon until it popped in his face. "We’re gonna rob a *kid’s* party?" Molly shot him a glare. "Not just any kid—*Tiffany Henderson’s* kid. Her dad owns half the city. You think those presents under the sparkly *Happy 6th Birthday* banner are dollar-store junk? That’s primo loot, Bill." "I still feel bad," Bill grumbled, rubbing his ear where the balloon had startled him. "Nobody’s making you steal the kid’s favorite teddy bear, Bill. Just the expensive stuff." Molly pulled out a crumpled map of the backyard drawn in crayon. "Okay, here’s the plan: We go in disguised as party entertainers—" "Clowns? Really?" "You got a better idea, genius?" "Yeah. Not being clowns." "Too late for that," Molly said, shoving a rainbow wig onto his head. "We mingle, pass out candy, then sneak the presents to the ‘clown car’—which is *your* car, by the way. I put a sign on it." Bill squinted toward the driveway where his rusty sedan now bore a slapped-on sign: *SMILES & GIGGLES ENTERTAINMENT*. "We’re gonna get caught." "No, *you’re* gonna get us caught with that attitude." Molly straightened her polka-dot suspenders. "Remember: distraction is key. Once I start the *Big Magic Show*, you grab the loot. Simple." Bill sighed. "What’s the *Big Magic Show*?" "The sprinklers." Bill groaned. Inside the party, squealing children ran between sugar-fueled chaos as parents sipped lukewarm punch, looking exhausted. Molly bounded in with exaggerated enthusiasm. "HELLO, LITTLE FRIENDS! WHO WANTS TO SEE A *MAGIC TRICK*?" The kids shrieked, circling her like tiny, feral wolves. Bill shuffled behind, eyeing the glittering pile of presents. Molly whipped out a cheap plastic wand. "BEHOLD! I SHALL MAKE… *WATER APPEAR FROM THE SKY!*" She flipped a hidden switch on her belt. The sprinklers erupted. Screams. Absolute pandemonium. Bill seized the moment, shoving presents into a sack while parents scrambled to rescue soggy cupcakes. A small boy tugged at Bill’s sleeve. "Are you a bad clown?" "Uh. No?" Bill said, sweating. "You look like a bad clown." Bill turned just as a furious mother in dripping mascara zeroed in on him. "Who *are* you?" Molly grabbed Bill’s arm. "ABRACADABRA—*BYE!*" They bolted for the "clown car," arms full of loot, balloons trailing behind them like a ridiculous getaway flag. Bill wheezed as they peeled out. "We left the maps! The crayon maps!" Molly cackled. "Relax, Bill. Those were *decoys*." Bill stared. "You drew *fake* crayon maps?" Molly grinned. "Welcome to the big leagues, partner." The sedan sputtered away, leaving only the faint echo of a deflating balloon behind.
created_at
"2026-02-16T21:35:49.132809+00:00"
2/16/2026, 9:35:49 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
732
moderation_latency
(null)
generation_time
18602
tokens_prompt
77
tokens_completion
833
native_tokens_prompt
67
native_tokens_completion
804
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000721344
router
(null)
provider_responses
0
id
"b556cc9d2fd045d29ceb6a83560d3dc0"
status
200
is_byok
false
latency
732
endpoint_id
"5bb598f5-fd97-4f8b-91e4-ceab3743a03b"
provider_name
"AtlasCloud"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277730-2vvY2MEbixaAhqd5Aab3"
upstream_id
"b556cc9d2fd045d29ceb6a83560d3dc0"
total_cost
0.000721344
cache_discount
6.48e-7
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
97.6276%
Matches word count
n/a
neededClean
false
words
493
0.0758%
Dialogue to Total Word Ratio
Ratio: 41.99%, Deviation: 11.99%
neededClean
false
wordsTotal
493
wordsDialogue
207
48.8517%