NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHM5H05A8JWN1WWWGZ85HZFK
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.9s
Cost:
$0.000758
Provider:
AtlasCloud
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
**Scene: The Great Birthday Heist** Molly adjusted the fake mustache she’d glued above her lip and peered through binoculars at the suburban house across the street. Balloons bobbed in the yard, and a clown twisted a green balloon into a sword for a shrieking six-year-old. She grimaced. Bill crouched beside her behind a hedge, squinting at his notepad. “Alright, listen up. Here’s the plan. We blend in as—get this—*parents*.” Molly lowered the binoculars. “Parents.” “It’s genius! Who suspects parents at a kid’s party?” Bill beamed. Molly stared. “You look like someone who got lost on the way to a biker rally.” Bill glanced down at his leather vest and fingerless gloves. “This is my *blending* outfit.” Molly sighed. “We need disguises. Yours is jail, mine is actually convincing.” “Fine, fine. New plan.” Bill ripped a page from his notepad. “We go in as caterers.” “Do we *look* like we know how to work a tray of mini hot dogs?” “Okay, then, clowns.” “Too obvious.” “Mime?” “Hate mimes.” Bill groaned. “Fine. Your turn, thief extraordinaire.” Molly smirked. “Easy. We’re the entertainment—the *magic* team. Kids love magic. And magic involves *misdirection.*” Bill blinked. “You’re suggesting we *steal* all their presents *while doing magic tricks?*” “Exactly.” Molly pulled out a deck of cards and shuffled them dramatically. “We dazzle ‘em with flair, plant the idea of a ‘wishing box’ for presents, and *poof*—gifts disappear.” Bill scratched his chin. “That’s… terrifyingly devious. I like it.” Molly handed him a top hat and cape. “You’re the assistant. Try not to trip on stage.” Bill jammed the hat on his head. “Assistant? I’m the star!” Molly snapped her fingers. A pigeon flew out of Bill’s sleeve. He yelped. “Wha—*where did that—?*” Molly grinned. “Star material here.” Bill glowered but adjusted his cape. “Alright, Houdini. Let’s scope the loot.” Molly pulled out a crumpled invite. “Piñata full of candy, stack of presents by the fireplace, and—” “Wait.” Bill pointed. “What’s *that?*” A massive, pristine stuffed giraffe leaned against the gift table. Molly’s eyes gleamed. “That’s our centerpiece. The golden goose.” Bill swallowed. “That’s bigger than my escape routes.” “So improvise.” Molly stood, dusting off her sequined sleeves. “Alright—crowd work first, then the ‘wishing box’ bit. Kids put gifts in, we ‘make ‘em vanish.’” Bill gulped. “And when the *real* parents notice?” Molly shrugged. “We vanish too.” He sighed. “Remind me why we’re robbing a six-year-old’s party?” Molly adjusted her fake mustache. “Because banks have security. Birthday parties have juice boxes.” Bill nodded solemnly. “Profound.” “Now go—distract the clown. I heard he does funny voices.” Bill groaned but jogged toward the party, cape fluttering wildly. The clown turned, startled. Molly took a deep breath. “Showtime.” With a flourish, she pulled a bouquet of balloons from her sleeve and strolled toward the chaos. Somewhere behind her, Bill screamed as the clown sprayed him with a seltzer bottle. Perfect misdirection.
created_at
"2026-02-16T21:26:13.051755+00:00"
2/16/2026, 9:26:13 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
823
moderation_latency
(null)
generation_time
15823
tokens_prompt
77
tokens_completion
933
native_tokens_prompt
67
native_tokens_completion
846
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000758304
router
(null)
provider_responses
0
id
"f242c1f69ea047bbbe5672d64ec83e69"
status
200
is_byok
false
latency
823
endpoint_id
"5bb598f5-fd97-4f8b-91e4-ceab3743a03b"
provider_name
"AtlasCloud"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277156-Qn2ooLp6VzCciRcthuwe"
upstream_id
"f242c1f69ea047bbbe5672d64ec83e69"
total_cost
0.000758304
cache_discount
6.48e-7
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
75.1556%
Matches word count
n/a
neededClean
false
words
487
0.0000%
Dialogue to Total Word Ratio
Ratio: 50.72%, Deviation: 19.28%
neededClean
false
wordsTotal
487
wordsDialogue
247
37.5778%