NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5ZXTF08MKHWX5BEAM1M97
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
25.8s
Cost:
$0.000652
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dimly lit basement of Molly’s apartment smelled faintly of mildew and stale coffee. A single bulb swung above the cluttered table, casting erratic shadows on the blueprints spread out before them. Molly leaned over the map of the neighborhood park, her finger tracing the path from the parking lot to the party pavilion. Bill sat across from her, twirling a pen between his fingers and wearing a grin that suggested he’d just eaten something he shouldn’t have. “So,” Molly began, her voice low and serious, “we hit them during the piñata. Everyone’s distracted, kids screaming, parents arguing over whose turn it is to swing the bat. We slip in, grab the presents, and vanish.” Bill snorted, leaning back in his chair. “You make it sound so easy. What if someone sees us? What if little Timmy decides to wander off and catches us red-handed with his Paw Patrol Mega Mixer?” Molly rolled her eyes. “Timmy’s six. We’ll distract him with a candy bar or something. Kids are easy.” She tapped the blueprint emphatically. “The real problem is the cake. It’s right next to the presents. Someone’s bound to notice if we’re elbow-deep in gifts while the frosting’s still moist.” Bill scratched his chin, staring at the ceiling as if it held the answers. “What if we dress up? You know, blend in. I could be a clown. You could be… I don’t know, a balloon animal maker or something.” Molly shot him a withering look. “Bill, you’re terrified of clowns. Last time we drove past a circus, you locked the car doors and started reciting the Lord’s Prayer.” “That’s irrelevant,” Bill said, waving a hand dismissively. “Besides, it’s the perfect disguise. No one questions a clown at a kids’ party. It’s like a free pass to do whatever you want. Plus, I could juggle or something. Keep ’em entertained while you grab the loot.” Molly sighed, pinching the bridge of her nose. “Fine. You can be the clown. But if you start hyperventilating in the middle of the job, I’m leaving you behind.” Bill grinned triumphantly. “Deal. Now, about transportation. We gonna use your car?” “My car’s got a ‘Check Engine’ light that’s been on since the Bush administration,” Molly muttered. “No, we’ll use yours. It’s less conspicuous.” “Mine?” Bill protested. “It’s bright green and smells like burnt popcorn!” “Exactly,” Molly said with a smirk. “No one’s gonna suspect a car that ridiculous.” She folded the blueprint and slid it into her bag. “We hit at noon sharp. Meet me here at 11. And for the love of all things holy, don’t forget your clown shoes.” Bill saluted mockingly. “You got it, boss.” He paused, then added, “What if we get caught?” Molly stood, slinging her bag over her shoulder. “Then we improvise. Worst case scenario, we blame it on the clown.” She shot him a mischievous grin. “Ready to ruin some kids’ birthday?” Bill chuckled, shaking his head. “Ready as I’ll ever be.”
created_at
"2026-02-16T21:34:32.102024+00:00"
2/16/2026, 9:34:32 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
116
moderation_latency
(null)
generation_time
25721
tokens_prompt
77
tokens_completion
799
native_tokens_prompt
66
native_tokens_completion
709
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00065213
router
(null)
provider_responses
0
id
"chatcmpl-RRsuQM1b3SOuhYDvYgTblSVB"
status
200
is_byok
false
latency
18
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277645-S4IJM0prw3FM8iwcCgof"
upstream_id
"chatcmpl-RRsuQM1b3SOuhYDvYgTblSVB"
total_cost
0.00065213
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9190%
Matches word count
n/a
neededClean
false
words
497
0.0000%
Dialogue to Total Word Ratio
Ratio: 63.18%, Deviation: 33.18%
neededClean
false
wordsTotal
497
wordsDialogue
314
49.9595%