NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2DRE7JHZDSKSYZZ7DZ5WZM
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
12.3s
Cost:
$0.000338
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.49%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.6400%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"A giant inflatable dragon." "A dragon." "It's their bouncy castle, Bill. It's a forty-foot bouncy dragon with a wading pool of ball-pit gold in its belly." "Still a dragon." "Fine. We're robbing Smaug, but he's full of toddlers. The point is the presents are piled inside the 'Treasure Hoard' themed snack tent. We need a distraction." "So we pop the dragon." "We absolutely do *not* pop the dragon. That's the first rule of kid-heisting: you pop the bouncy castle, you wake the beast. Thirty sugar-crazed five-year-olds, Bill. They'll swarm us like furious, glittery piranhas." "Fine. No popping. We blend in." "You want to blend. With the clown? Because 'Bubbles the Magic Clown' is already sitting in a police van after I may have tipped off his parole officer." "I was going to say the magician. I do a devastating disappearing coin trick." "You do a devastating dropping-the-coin-and-fumbling-it-into-a-heating-vent trick. I've booked 'Baxter the Birthday Warlock.'" "I'm Baxter the Birthday Warlock?" "You are. Robes, beard, dubious smoke effects. You'll announce a ceremony to 'enchant the presents for eternal merriment.' The parents will eat it up. You levitate the biggest gift, smoke bomb, we're out." "You mean I push it out the back tent flap while you catch it in a wagon disguised as a tiny fire truck." "That's the enchantment. What's my cover?" "Irish elf. Your name is Nibbles."
created_at
"2026-04-25T13:37:40.097Z"
4/25/2026, 1:37:40 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
991
moderation_latency
(null)
generation_time
12206
tokens_prompt
77
tokens_completion
361
native_tokens_prompt
67
native_tokens_completion
355
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000337995
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
id
"459bb4a3-1426-4145-a767-0efc9ac33d1a"
is_byok
false
latency
269
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777124260-pizEAuiY6GlxnG4mb9Gc"
session_id
(null)
api_type
"completions"
id
"gen-1777124260-5diiF2sBEqD1mgMUdYAp"
upstream_id
"459bb4a3-1426-4145-a767-0efc9ac33d1a"
total_cost
0.000337995
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.49%
Matches word count
n/a
neededClean
false
words
227
36.79%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
242
wordsDialogue
242
18.6400%