NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5Y6WE7DB6CQ62E18FYNEE
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
31.3s
Cost:
$0.000762
Provider:
DeepInfra
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The dim light of their hideout flickered weakly, casting odd shadows on the walls. Molly leaned over a table cluttered with crumpled napkins, half-empty coffee cups, and a map of the neighborhood. She tapped the map with a pen, her brow furrowed in concentration. "Alright, Bill, listen up. The party's at three PM sharp. Kids will be hopped up on sugar, parents will be distracted, and that’s when we strike." Bill, sprawled on a rickety chair with his feet propped up on a box labeled "confiscated inflatables," squinted at her. "So, what’s the plan? Walk in, grab the loot, and run? Seems straightforward." "Straightforward?" Molly raised an eyebrow. "Bill, this isn’t just *any* heist. This is a children’s party. There’s security guards disguised as clowns, a bouncy castle that doubles as a surveillance tower, and let’s not forget the piñata—rigged to explode if tampered with." Bill snorted. "Exploding piñatas? You’re joking." "Dead serious," Molly said, flipping the map to reveal a hastily sketched diagram of the piñata. "See these wires? It’s a decoy. Looks innocent, but one wrong move and boom! Candy everywhere, alarms blaring, and us—caught red-handed with a handful of lollipops." Bill scratched his head. "Alright, so we avoid the piñata. Got it. What about the clowns?" "Clowns are our biggest threat," Molly said, pacing now. "They’re trained professionals. Jugglers by day, security experts by night. We need to distract them." "How?" Molly smirked. "Easy. We unleash the inflatables." She gestured to the box under Bill’s feet. "We’ve got a dozen rogue balloons—helium-filled, untethered, and unpredictable. Release them into the party, and chaos ensues. Clowns will be too busy chasing balloons to notice us." Bill grinned. "I like it. But how do we get in? We can’t exactly waltz through the front door." Molly pulled out a crumpled invitation from her pocket. "Thanks to your cousin’s kid, we’ve got a golden ticket. We’re posing as Mr. and Mrs. Clatterbottom, professional balloon artists. Nobody questions balloon artists." Bill blinked. "Balloon artists? Molly, I can’t even tie my own shoes." "You don’t have to," Molly said, waving a hand dismissively. "Just twirl a balloon around, look artistic, and let me handle the rest. Once we’re inside, we split up. You grab the presents from the gift table while I sabotage the piñata." "Wait, sabotage? I thought we were avoiding it." "Sabotage *carefully*," Molly clarified. "See, if we disable it now, it won’t explode later. Plus, we can loot the candy. Double win." Bill nodded slowly, though he still looked skeptical. "Alright, but what if someone catches us?" Molly grinned, pulling out a pair of oversized sunglasses and a fake mustache. "That’s why we have disguises. Nobody recognizes a man with a mustache." Bill frowned. "I’m pretty sure that’s not how disguises work." "Trust me, Bill," Molly said, tossing him the mustache. "This is foolproof. Now, let’s rehearse our balloon-twirling skills before the big day." She grabbed a balloon from the box and began twisting it into an unrecognizable shape. Bill sighed, adjusting the mustache. "This is either the dumbest plan we’ve ever had or the most brilliant." "Both," Molly said without missing a beat. "That’s how we know it’ll work."
created_at
"2026-02-16T21:33:41.312895+00:00"
2/16/2026, 9:33:41 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
372
moderation_latency
(null)
generation_time
31222
tokens_prompt
77
tokens_completion
879
native_tokens_prompt
66
native_tokens_completion
832
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007616
router
(null)
provider_responses
0
id
"chatcmpl-RqtqkYZoEj09uY55JQXeaMGI"
status
200
is_byok
false
latency
48
endpoint_id
"5294d55f-9012-496b-8f22-8cc919432dcd"
provider_name
"DeepInfra"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277589-eUtfsaSpseOdrWGHrGC5"
upstream_id
"chatcmpl-RqtqkYZoEj09uY55JQXeaMGI"
total_cost
0.0007616
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
537
0.0000%
Dialogue to Total Word Ratio
Ratio: 67.04%, Deviation: 37.04%
neededClean
false
wordsTotal
537
wordsDialogue
360
0.0000%