NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5X6GX3T2Y335J9S4FEFBB
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
26.0s
Cost:
$0.000533
Provider:
ModelRun
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse basement smelled of mildew and old cardboard, but Molly and Bill hardly noticed. They were hunched over a hand-drawn blueprint of Maplewood Estates, their latest—and strangest—target. "Lemme get this straight," Bill said, scratching his stubble. "We’re robbing a *six-year-old’s birthday party*?" Molly grinned, tossing a glittery party hat onto the rickety table. "Not just any six-year-old. Timmy Wellington III. His dad owns half the banks in the city." "Yeah, but—*presents*?" Bill flung his arms wide. "Last job was a diamond exchange!" "Mood rings and action figures ain’t our usual haul, sure," Molly admitted. "But think about it—kid that rich? His friends are bringing *gold-plated* LEGOs. Plus, it's a *soft* target. Cake coma, zero security, just a clown who smells like regret." Bill sighed. "Alright, fine. What’s the play?" Molly tapped the blueprint. "We go in disguised. You’re ‘Un uch’ Ch’les”—she coughed—“*Uncle Charles*, distant relative from outta town." "You want me to fake an accent?" Bill scoffed. "What accent? Just mumble and pat kids on the head awkwardly. Kid’ll buy anything if you’re holding cotton candy." Bill narrowed his eyes. "And you?" Molly twirled a pink feather boa around her neck. "I’m your lovely wife, *Barbara*. I’ll distract the parents with tragic stories about our fictional cat’s gluten allergy while you case the gift table." Bill groaned. "Any shot we skip the acting and just grab-and-go?" "Too risky. Security’s light, but witnesses are everywhere. Kids are like tiny drunk people—unpredictable and loud." Bill rubbed his temples. "Alright, fine. But what’s the exit?" Molly folded the blueprint with a flourish. "Piñata chaos. I’ll ‘accidentally’ knock over the cake right before presents. You grab the loot during the meltdown, stash it in a gift bag marked ‘TIM,’ and meet me by the bounce house." "And if someone notices?" "Please. Parents are either on their third mimosa or praying for bedtime. We’ll be in Fiji before they realize Timmy got two identical scooters." Bill stared at her for a long moment, then shrugged. "Worst comes to worst, I *can* outrun a six-year-old." Molly clapped him on the back. "That’s the spirit! Now c’mon—we gotta practice your ‘uncleMPa smile.’ It’s creepier than I remembered." Bill bared his teeth. "How’s this?" Molly winced. "…Maybe stick to the mThreshold." Outside, a car backfired, sending them both diving for cover. Bill peeked over the table. "We’re gonna get *destroyed* by actual criminals one day." Molly grinned. "Yeah, but not today."
created_at
"2026-02-16T21:33:03.112835+00:00"
2/16/2026, 9:33:03 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1240
moderation_latency
(null)
generation_time
26015
tokens_prompt
77
tokens_completion
721
native_tokens_prompt
66
native_tokens_completion
675
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
17
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00053295
router
(null)
provider_responses
0
id
"chatcmpl-e5f688657eaa490093d37788adea6331"
status
200
is_byok
false
latency
90
endpoint_id
"a3938710-1b99-4a1f-85f4-2494f3f659d3"
provider_name
"ModelRun"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277556-7D9SDku5JrWd8WrLojmx"
upstream_id
"chatcmpl-e5f688657eaa490093d37788adea6331"
total_cost
0.00053295
cache_discount
(null)
upstream_inference_cost
0
provider_name
"ModelRun"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
415
98.2019%
Dialogue to Total Word Ratio
Ratio: 69.40%, Deviation: 0.60%
neededClean
false
wordsTotal
415
wordsDialogue
288
49.1009%