NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5TJAD0Y8YG8CW1N431R7G
Run Details
Model:
DeepSeek V3.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
10.8s
Cost:
$0.001279
Provider:
Fireworks
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in the cramped van was thick with the smell of old coffee and desperation. Molly tapped a glittery pink crayon against a well-chewed notepad, her brow furrowed in concentration. Bill, a man built like a disappointed refrigerator, was attempting to stretch a pair of tights over his prodigious head. “The problem,” Molly said, not looking up from her schematics, “is the dragon.” Bill’s muffled voice emerged from the stretched nylon. “I thought the problem was the laser security system. Or the motion-activated sprinklers. Or the fact that we’re two grown adults planning a felony at a six-year-old’s birthday party.” “The dragon is the perimeter guard,” Molly clarified, drawing a shaky circle around a stick-figure dragon breathing what appeared to be glitter. “His name is Sir Reginald. He’s a retired show dog with a nervous bladder and a deep-seated mistrust of clowns. Which is what you’ll be.” Bill finally won the battle with the tights, his face emerging, red and squashed. “I look like a burst sausage. Why do I have to be the clown?” “Because,” Molly said with the patience of a bomb disposal expert, “you’re the ‘in’. You create the distraction. The parents will be hypnotized by your tragic balloon animals, the children will be terrified, and I, disguised as the overly-enthusiastic magician’s assistant, will liberate the gifts from the secure holding area—aka, the garden shed.” “What’s my clown name?” “Weeping William. It’s thematically appropriate. Now, focus. The cake is a seven-tier chocolate fortress arriving at 1400 hours. That’s our signal. Once the sugar high kicks in and the first child has a meltdown over a broken piñata sword, we move.” Bill picked up a tiny, rainbow-colored bicycle horn and gave it a despondent squeak. “I just feel like our skills are being wasted. We lifted the Crown Jewels of Svenborgia.” “And we’d have gotten away with it if you hadn’t tried to pay for a hot dog with a diamond the size of a quail’s egg,” Molly snapped. “This is a simple in-and-out. Low security, high reward. The guest of honor is a child named ‘Braxxlynn’. Her father is a tech billionaire. That pile of presents is a goldmine of unregistered gift cards and ethically-sourced wooden toys we can fence for a fortune.” “What’s the exit strategy?” “The bouncy castle. We create a panic, you ‘accidentally’ release the anchoring straps. While it’s careening across the lawn with nineteen screaming children aboard, we slip out the side gate with the loot stuffed into a giant, novelty goody bag.” Bill was silent for a long moment, staring at his oversized, floppy shoes. “I have a question.” “What?” “Do I get a piece of the cake? I heard it’s gluten-free vanilla bean with a passionfruit ganache.” Molly slammed her notepad shut. “Fine. You can have the top tier with the edible picture of Braxxlynn’s face on it. Now, are you committed, or am I going to have to find a new clown?” Bill sighed, the sound deflating his polka-dot ruff. He picked up a red foam nose and put it on. “Alright. For the cake.” He gave the bicycle horn another sad squeak. “Honk honk.”
created_at
"2026-02-16T21:31:21.542137+00:00"
2/16/2026, 9:31:21 PM
model
"deepseek/deepseek-chat-v3.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
270
moderation_latency
(null)
generation_time
10697
tokens_prompt
95
tokens_completion
852
native_tokens_prompt
68
native_tokens_completion
750
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
67
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00127932
router
(null)
provider_responses
0
id
"9e7e0a59-0780-4f5e-8a24-a25ecf1edf03"
status
200
is_byok
false
latency
229
endpoint_id
"0c00f0d2-8aad-4700-bd36-11cb9a717fde"
provider_name
"Fireworks"
model_permaslug
"deepseek/deepseek-chat-v3.1"
api_type
"completions"
id
"gen-1771277470-znF6hj6Vfq9ge4CKtJvC"
upstream_id
"9e7e0a59-0780-4f5e-8a24-a25ecf1edf03"
total_cost
0.00127932
cache_discount
0.00001876
upstream_inference_cost
0
provider_name
"Fireworks"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
541
0.0000%
Dialogue to Total Word Ratio
Ratio: 68.21%, Deviation: 18.21%
neededClean
false
wordsTotal
541
wordsDialogue
369
0.0000%