NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HXNGWVYGCW1N8PT8BM41W
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
7.5s
Cost:
$0.000126
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
99.95%
Dialogue to Total Word Ratio
(details)
49.9881%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“So we go in as the entertainers?” Bill asked, frowning at the glitter-covered invitation. “Clowns, Bill. We go in as clowns.” “I hate clowns. They’re creepy.” “Creepy is good. Kids expect a little cringe. It’s the parents you gotta watch.” “And the presents. How do we get forty gift bags out without looking suspicious?” Molly tapped her temple. “You wear the balloon-animal vest. Pockets for days. I’ll do face painting. My kit’s full of zippered compartments.” “Face painting? You can’t draw a stick figure.” “I’ll do hearts. Hearts and stars. Easy. While I paint the birthday girl, you mingle. Bonbon, the unicorn-themed loot bags, the gift cards—they go in the vest.” Bill sighed. “And when little Timmy starts wailing because his Slimezilla action figure is gone?” “We blame the magician. There’s always a magician. Poor guy gets no tips and has a rabbit that bites.” “What if they search us?” “They won’t. We’re entertainers. We’ll be too busy doing the chicken dance.” Bill stared at her. “The chicken dance?” “It’s a crowd pleaser. Gets them distracted. You flap, I stuff. We’re out before the cake is cut.” “And if it goes wrong?” Molly grinned. “Then we use the emergency squirting flower. Works every time.” “Fine. But I’m wearing baggy pants. For the smoke bombs.” “You’re a natural, Bill. Now let’s go steal a unicorn-themed loot bag from a six-year-old.”
created_at
"2026-04-25T14:50:25.698Z"
4/25/2026, 2:50:25 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
979
moderation_latency
(null)
generation_time
7455
tokens_prompt
77
tokens_completion
465
native_tokens_prompt
67
native_tokens_completion
416
native_tokens_completion_images
(null)
native_tokens_reasoning
48
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00012586
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"4531f711-1e9a-4937-b741-e983d0c9905f"
is_byok
false
latency
433
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128625-7wGo39oWLUBAEe0ocOZ9"
session_id
(null)
api_type
"completions"
id
"gen-1777128625-vCZydClXaAUhI7LCMIAt"
upstream_id
"4531f711-1e9a-4937-b741-e983d0c9905f"
total_cost
0.00012586
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
230
99.95%
Dialogue to Total Word Ratio
Ratio: 91.53%, Deviation: 1.53%
neededClean
false
wordsTotal
236
wordsDialogue
216
49.9881%