NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2H1FDCSH0WSAB15HKN61KS
Run Details
Model:
DeepSeek V4 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
6.7s
Cost:
$0.000123
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
8.70%
Dialogue to Total Word Ratio
(details)
4.3520%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The inflatable bouncy castle loomed like a plump, grinning fortress. Molly flattened herself against the hedge, binoculars trained on the front door. A glittering banner screamed “Happy 6th Birthday, Chloe!” in shimmering unicorn font. Beside her, Bill adjusted his fake mustache, which was already peeling at the edge. “They’re doing the piñata now,” she whispered. “Thirty seconds until the sugar crash opens the prize table.” Bill tugged at his rainbow-striped party hat, a prop purchased at considerable personal cost. “Remind me again,” he grumbled, “why we aren’t hitting a bank?” “Because banks have alarms, Bill. Kids have nap time.” Through the window, she watched a small girl in a tiara stumble away from the wrecked unicorn-piñata, clutching a bag of candy. Perfect. The parents were pouring juice, their attention scattered like the spilled sprinkles on the lawn. Molly slid a walkie-talkie from her cargo pocket. “Phase two. You go clowns, I go stolen.” Bill groaned. “I hate clowns.” “You hate not getting paid more.” He sighed, then plastered on a rictus grin and strode toward the party with a bag of balloon animals. Molly counted to ten and slipped around the side, toward the glittering mountain of presents. Five minutes, tops. She’d already found the perfect hiding spot behind the cake table. The only thing slower than a sugared-up six-year-old was a full bladder. And she had three juice boxes tucked in her belt.
created_at
"2026-04-25T14:35:01.936Z"
4/25/2026, 2:35:01 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
565
moderation_latency
(null)
generation_time
6562
tokens_prompt
77
tokens_completion
466
native_tokens_prompt
67
native_tokens_completion
407
native_tokens_completion_images
(null)
native_tokens_reasoning
73
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00012334
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"1b82a6e2-e0e7-4b66-bed0-3a1904586dff"
is_byok
false
latency
282
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777127701-JlJNdmfhdo2RCGLtlhOh"
session_id
(null)
api_type
"completions"
id
"gen-1777127701-WWGhekERkxXgWsJcJOaa"
upstream_id
"1b82a6e2-e0e7-4b66-bed0-3a1904586dff"
total_cost
0.00012334
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
234
8.70%
Dialogue to Total Word Ratio
Ratio: 22.50%, Deviation: 12.50%
neededClean
false
wordsTotal
240
wordsDialogue
54
4.3520%