NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2F1VHZY1NCFCH6SK03QPK2
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9.9s
Cost:
$0.000177
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled like stale coffee and regret. Molly was sprawled across the passenger seat, peering through a pair of binoculars at the inflatable castle bobbing in the distance. “Okay, Bill. Talk to me. What’s the vibe?” she asked. Bill, crouched in the back, adjusted a clown wig that was three sizes too small. “Vibe is… aggressive. There’s a woman with a face like a slapped ham handing out juice boxes. She’s guarding the table.” “The gift table?” “The gift table,” Bill confirmed. “And it’s not just any gift table. It’s a mountain of glitter and bad parenting. I saw a Remote Control T-Rex wrapped in dinosaur paper. That thing’s half our take right there.” Molly lowered the binoculars. “Half? Please. That’s a retirement fund right there. How do we get around Ham-Face?” Bill held up a spritz can. “I have a solution. It’s called ‘Smell My Fingers.’ ” “That’s not a plan, Bill. That’s a prank from a cereal box.” “It’s a distraction. I walk in, offer her a whiff of this,” he wiggled the can, “she gags, you scoop the presents into a bin bag, we rendezvous by the porta-potty. Simple.” “And what about the parents? The screaming kids? The tiny human with the fake lightsaber who keeps hitting himself in the face?” Bill shrugged. “We ignore the small one. He’s his own worst enemy. As for the parents, they’re all staring at their phones. They won’t notice a thing until the cake comes out.” Molly chewed her lip. “I don’t know. I feel like we’re missing a step. What if the T-Rex isn’t the real prize? What if under that dinosaur paper is a *Book*?” Bill recoiled. “A book? At a party? Who brings a book to a party? That’s sociopathic.” “Bill, we are literally planning to rob children.” “For their toys, Molly. There’s a code. A *Book* is just dead weight. It’s litter.” She sighed, rubbing her temples. “Fine. Fine. We do the Smell My Fingers gambit. But if Ham-Face has a black belt in juice-box distribution, I’m leaving you with the T-Rex and finding a bank. At least there’s dignity in armed robbery.” Bill grinned, his clown smile painted crooked. “Dignity is for people who can afford full-price presents. I’ve got a van full of jester hats and a dream. You ready?” Molly grabbed a trash bag. “No. But let’s go ruin some birthdays.”
created_at
"2026-04-25T14:00:17.219Z"
4/25/2026, 2:00:17 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
722
moderation_latency
(null)
generation_time
9809
tokens_prompt
77
tokens_completion
641
native_tokens_prompt
67
native_tokens_completion
599
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001771
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"066bf19f-7003-4167-bebb-2d32ef420d78"
is_byok
false
latency
278
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777125617-AXTl80WG3w0dOExmp7KW"
session_id
(null)
api_type
"completions"
id
"gen-1777125617-n6Fl0GJm8ccCKtazcA8S"
upstream_id
"066bf19f-7003-4167-bebb-2d32ef420d78"
total_cost
0.0001771
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
399
0.00%
Dialogue to Total Word Ratio
Ratio: 78.47%, Deviation: 28.47%
neededClean
false
wordsTotal
404
wordsDialogue
317
0.0000%