NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-25-01KQ2JQ87KMSZ9XWDBWZ3TX82V
Run Details
Model:
DeepSeek V4 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.4s
Cost:
$0.000225
Provider:
DeepSeek
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
10.79%
Dialogue to Total Word Ratio
(details)
27.0846%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the strap of her neon-pink fanny pack and squinted at the inflatable bouncy castle across the street. “I’m telling you, Bill, it’s a goldmine. Look at all that shiny wrapping paper.” Bill, wedged behind a dumpster with a half-eaten corndog, shook his head. “It’s a five-year-old’s birthday party, Molly. They’ve got grab bags and a pinata. The most valuable thing there is the cake, and I’m not fencing a half-smashed SpongeBob.” “You’re not seeing the bigger picture,” Molly said, tapping a laminated map taped to the wall. “The parents. They bring gifts. Expensive gifts. I saw a drone box. And one of those fancy dollhouses that cost more than my car.” “Your car has a missing hubcap and a cassette player,” Bill muttered. “But fine. What’s the plan? We waddle in, do a magic show, and make off with a train set?” “Better.” Molly grinned. “You’re gonna be the balloon animal guy.” Bill choked on a piece of corndog. “I can’t make a balloon animal. I can’t even blow up a balloon without passing out.” “You don’t have to make them. You just have to *look* like you’re making them. I’ll hand you a pre-tied poodle, you hand it to a kid, and they squeal. Meanwhile, I’m ‘helping’ the parents with the presents. I’ll swap the real boxes for these.” She pulled a flattened cardboard box from her pack. “Empty replica boxes. Same size, same weight, but filled with rocks.” Bill stared. “Rocks. You’re stealing presents from children and replacing them with rocks.” “It’s a moral compromise. The kids still get the thrill of unwrapping something heavy. The parents think they’ve been generous. And we get the drone.” Bill tossed the corndog stick. “What about the birthday kid? The one whose face is supposed to light up when they see the dollhouse? You’ve just given them a rock.” “I’m prepared for that.” Molly pulled a crumpled piece of paper from her pocket. “I wrote a poem. It’s called ‘The Rock of Imagination.’ It says the rock is a magical stone that can be anything if they dream hard enough. I’ll read it at the cake-cutting.” Bill stared at the poem. “This is a grocery list. It says ‘apples, milk, and please don’t cry, the rock is for the soul.’” “It’s poignant.” “It’s a felony with a side of emotional damage,” Bill said. “We’re doing this. Fine. But if a kid starts crying, you’re the one who has to do the face. Because I don’t have a face for children. I have a face for eviction notices.” Molly clapped him on the back. “That’s the spirit. Now get your clown wig. The party starts in ten minutes.” Bill pulled the orange wig out of a garbage bag. “I hate you.” “No, you don’t,” Molly said, adjusting her fanny pack. “You just hate the rocks.” “And the poem.” “And the poem.” They crossed the street, stepping over a discarded party hat, and Molly whispered, “Remember: stay goofy, keep moving, and if anyone asks about the balloon poodle, tell them it’s a dachshund. They don’t know the difference.”
created_at
"2026-04-25T15:04:24.055Z"
4/25/2026, 3:04:24 PM
model
"deepseek/deepseek-v4-flash-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
569
moderation_latency
(null)
generation_time
11349
tokens_prompt
77
tokens_completion
836
native_tokens_prompt
67
native_tokens_completion
770
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00022498
router
(null)
provider_responses
0
endpoint_id
"722e0746-2034-4d6e-a08f-e6271849ee9e"
id
"80e6259a-d3cf-4195-b2f9-733fa54625f2"
is_byok
false
latency
302
model_permaslug
"deepseek/deepseek-v4-flash-20260423"
provider_name
"DeepSeek"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777129464-6XBuUrP41npI4UzceCBC"
session_id
(null)
api_type
"completions"
id
"gen-1777129464-l6S9kZoQyh3sgYOmRL3b"
upstream_id
"80e6259a-d3cf-4195-b2f9-733fa54625f2"
total_cost
0.00022498
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepSeek"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
10.79%
Dialogue to Total Word Ratio
Ratio: 76.67%, Deviation: 6.67%
neededClean
false
wordsTotal
523
wordsDialogue
401
27.0846%