NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2HEF64GV21TZ5FZDTYWJQJ
Run Details
Model:
DeepSeek V4 Pro
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
16.9s
Cost:
$0.001707
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"And you're absolutely sure about the clown?" "I'm telling you, Bill, the clown is the key. Little Timmy's parents hired a premium clown, a full 'Jingles the Juggler' tier package. That means giant shoes, a squirting flower, everything." "Yeah, I get that. But the map shows the present room is guarded by the clown's emotional support alpaca." "Don't be dramatic, it's a therapy llama, and his name is Barnaby. According to the brochure, he only spits if you mock his tiny hat." "Fine, but why am *I* the one hiding in the giant cake? I'm six-foot-four and deeply claustrophobic." "Because you lost the coin toss, and your legs make a surprisingly convincing tiered cake stand. I’ll be the magician’s assistant who misdirects the kids with a tragic, never-ending scarf trick." "A tragic scarf trick?" "It’ll be a real tear-jerker. Now, I’ll distract the children with unsolvable balloon animals while you shimmy out the back of the cake, avoid Barnaby, and fill the sack." "You said the loot was a mountain of tablets and dolls. Why can I only hear a frantic squeaking?" "Ah, yes. A last-minute snag. The parents are those hyper-eco-conscious types. They replaced all the plastic toys with hand-carved, ethically sourced rubber ducklings. Live ducklings, Bill. Thirty-seven of them." "Live... ducklings." "Think of the resale value to a petting zoo! It's a symphony of profit, just with more beak. Now, synchronize watches and try not to quack."
created_at
"2026-04-25T14:42:09.372Z"
4/25/2026, 2:42:09 PM
model
"deepseek/deepseek-v4-pro-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
408
moderation_latency
(null)
generation_time
14893
tokens_prompt
77
tokens_completion
373
native_tokens_prompt
67
native_tokens_completion
356
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0017071
router
(null)
provider_responses
0
endpoint_id
"d084170b-dd99-4e53-9ae1-b63881f3cfbf"
is_byok
false
latency
269
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"DeepSeek"
status
429
1
endpoint_id
"13bda129-f8a6-4a60-b1db-98082e5f7ccd"
id
"ogZjBk5-2dTqGa-9f1e1cda549ced35"
is_byok
false
latency
401
model_permaslug
"deepseek/deepseek-v4-pro-20260423"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777128129-UlryCfx2AEUnhJ51YZNg"
session_id
(null)
api_type
"completions"
id
"gen-1777128129-bx0vnsvL7VqKljJkscPM"
upstream_id
"ogZjBk5-2dTqGa-9f1e1cda549ced35"
total_cost
0.0017071
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
239
36.79%
Dialogue to Total Word Ratio
Ratio: 100.00%, Deviation: 10.00%
neededClean
false
wordsTotal
248
wordsDialogue
248
18.3940%