NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72XB2WBBW994CE7674YZYP
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 49s
Cost:
$0.014747
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
51.47%
Dialogue to Total Word Ratio
(details)
25.7481%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her wire ears, peering through the gap in the hedge where the inflatable dinosaur wobbled menacingly. Bill kicked a plastic spade buried in the mulch, his knuckles white around the reinforced canvas sack. Molly tapped her tablet, displaying a schematic of the backyard drawn in jagged red lines. "Bill, stop fidgeting with loot bag. You're wrinkling velvet. We need stealth, not fabric softener incident." "It's not the fabric, Molly. It's the target. Are you sure the birthday boy won't notice five thousand dollars worth of electronics vanish before the piƱata even breaks? Last time we hit a garage sale, I still hear the ghost of a bargain bin." "Damp sponge. Cognitive function is roughly equivalent to a damp sponge. If we strike during the song of Happy Birthday and extinguish the candles, he won't realize his gifts are gone until morning. By then, we'll be three towns over wearing the stolen Lego sets as hats." Molly rolled her eyes, checking the timer on her wrist. Bill shrank back behind a potted fern, nearly crushing a plastic flamingo. Molly handed him a bag of glow-in-the-dark stickers and a whoopee cushion. Bill blinked, eyeing the whoopee cushion with deep suspicion. Molly pointed to a sewer grate near the oak tree, where three raccoons were already wearing tiny bandanas. "Then just trip the inflatable T-Rex. It'll roll toward the cake table, the kid will chase the apex predator, and you'll slip in like a shadow wearing cargo shorts. Timing is everything. When the mom starts the 'Blow out the candles and make a wish' chant, that's your window. You grab the tree, you roll out. If you linger, you risk the wrath of a toddler who realizes his new game console is missing. Never fight a hungover parent, and never fight a disappointed child. The screaming is worse." Bill zipped the sack open, testing the weight with his pinky. Molly nodded toward the house, where the muffled strains of "Baby Shark" began to rise from the kitchen. The air smelled of sugar and impending chaos. The backyard was a labyrinth of colorful obstacles. Bill adjusted his grip on the sack. Toys were scattered everywhere. Molly whispered a countdown, her eyes locked on the kitchen window where the silhouette of a woman began to arrange plates of frosting-covered triangles. The raccoons near the grate chittered softly, their bandana-wrapped heads twitching as they anticipated the drop of stolen berries. One of the creatures held up a paw, signaling readiness, while another gnawed enthusiastically on a particularly juicy blackberry. The wind shifted, carrying the scent of vanilla cake and the faint metallic tang of the loot inside Bill's bag. Molly checked her watch again, her expression stern as she calculated the remaining seconds. "It's a multi-species operation, Bill. Adapt or die. Also, check your goggles. One speck of glitter, and you're compromised. We don't leave traces. We leave confusion and a single, very sad birthday card signed by a raccoon." Bill slid the goggles on, the lenses reflecting the neon streamers. Bill nodded, swallowing hard. "Understood. I'll handle the dinosaur. You ensure the cake survives the initial chaos." Yes, Chief. Don't drop the stash.
created_at
"2026-04-27T09:04:18.531Z"
4/27/2026, 9:04:18 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1065
moderation_latency
(null)
generation_time
108741
tokens_prompt
77
tokens_completion
14396
native_tokens_prompt
76
native_tokens_completion
15265
native_tokens_completion_images
(null)
native_tokens_reasoning
14565
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.014746792
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"4e7cb7369f74480ca1629b007c4f83b9"
is_byok
false
latency
1065
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280658-HmAvoYSc6QWX6CznS10Q"
session_id
(null)
api_type
"completions"
id
"gen-1777280658-xJNGgWF33kVOHMHyX58U"
upstream_id
"4e7cb7369f74480ca1629b007c4f83b9"
total_cost
0.014746792
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
530
51.47%
Dialogue to Total Word Ratio
Ratio: 46.36%, Deviation: 3.64%
neededClean
false
wordsTotal
535
wordsDialogue
248
25.7481%