NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2KN2FZZQKX1GQSP2YQGY5Q
Run Details
Model:
DeepSeek V4 Pro (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
33.4s
Cost:
$0.003826
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unfolded a crinkled party invitation across the sticky Formica table, her pinkie finger tracing the cartoon dinosaur wearing a party hat. “See that? ‘Bring a gift to share the magic.’ Magic is just unguarded consumer goods with shiny wrapping, Bill. And I want all of it.” Bill sucked the last of a blue raspberry slushie through a straw, the gurgle a mournful punctuation mark. “It’s a five-year-old’s birthday, Mol. The target demographic still believes in nose-picking fairies. What’s the fence going to give us for a glitter-encrusted xylophone and a plush narwhal that farts rainbows?” “Farts rainbows, you say? That’s proprietary technology. We’re not fencing them.” Molly pulled a second sheet from her bag, a detailed blueprint hand-drawn in crayon on the back of a kids’ menu. She’d bribed the birthday boy’s older sister with a pack of stickers that smelled like grape soda. “We’re regifting them. There’s a seven-year-old’s ‘princess rave’ across town next Saturday. Ever tried to buy a light-up tutu at retail, Bill? It’ll break your soul. We swap the loot, charge a premium for last-minute parental desperation, and vanish like sugar-high phantoms.” Bill’s eyes slowly widened. “So we’re present pirates. With a triage system for joy.” He tapped the crayon drawing. A wobbly square represented the bouncy castle. “What’s our entry point?” “I’m going in as Sparkles the Mime.” Molly said it with the gravity of a general announcing a beachhead landing. A long pause hung between them, flavoured with artificial sweetener. “You hate mimes. You cross the street to avoid them. You once body-checked one who got too close with an invisible rope.” “Which is why nobody will expect it. The costume comes with white greasepaint that hides my identity and a pre-established character trait of not speaking, which eliminates the risk of me telling a four-year-old what a shakedown is. You’ll be on wheels.” “My knees make a popping sound when I crouch. The paediatric community has incredibly acute hearing for structural weakness.” Molly pointed to a jagged line on the menu. “That’s the piñata blindfold run. You’ll be dressed as a generic woodland creature, a raccoon maybe. A lovable trash bandit. Your job is to intercept the gift table during the post-candle chaos. The sugar crash hits, parents are comforting sobbing toddlers who’ve seen the clown’s real face, and you waddle in with an empty biodegradable gift bag. You’re not taking, you’re cleaning up. Eco-friendly sanitation. The raccoon suit has a false belly. With pockets.” Bill considered this, scratching his chin. “A false belly. For loot. So my cover is a marsupial with a commitment to recycling.” He nodded slowly. “I’ll need articulated paw gloves for the larger boxes. And what about the big one? The birthday boy’s main present from the parents. It’s a motorised miniature Range Rover. It has torque.” “That’s the piñata distraction. The candy isn’t just candy, Bill. I’ve laced the payload with popping candy and pure caffeine pills, crushed to a fine powder. While thirty tiny adrenaline goblins are literally bouncing off the walls and the inflatable architecture, the adults will form a human shield wall of exhaustion. You drive the Range Rover straight out the back gate disguised as a recycling truck. I’ll mime an elaborate farewell, creating a mobile no-man’s-land of awkwardness to cover your retreat.” Bill gathered their trash, dabbing a napkin at a spot of ketchup on his shirt. “You packed the mime shoes with the silent soles?” “Spray-painted them matte black myself. Look like a dance accident, sound like a cougar’s prayer.” “Then I’m in.” He crumpled the napkin with a grin. “Let’s go steal a future tycoon’s sense of entitlement and redistribute it to the raving princesses.”
created_at
"2026-04-25T15:20:41.224Z"
4/25/2026, 3:20:41 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
910
moderation_latency
(null)
generation_time
32862
tokens_prompt
77
tokens_completion
1206
native_tokens_prompt
67
native_tokens_completion
1066
native_tokens_completion_images
(null)
native_tokens_reasoning
159
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00382626
router
(null)
provider_responses
0
endpoint_id
"9b918aa8-bd4d-4489-b665-df3f6df60aec"
id
"93a336d979b4822e10c2aa93a8d6a6df"
is_byok
false
latency
910
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777130441-1GM2qXNmjDM7peZII2iA"
session_id
(null)
api_type
"completions"
id
"gen-1777130441-XEMdqWy01OTEqlaXyib0"
upstream_id
"93a336d979b4822e10c2aa93a8d6a6df"
total_cost
0.00382626
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
616
0.00%
Dialogue to Total Word Ratio
Ratio: 75.36%, Deviation: 45.36%
neededClean
false
wordsTotal
633
wordsDialogue
477
0.0000%