NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72KQDHYBZW1J915Q17PRKH
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 45s
Cost:
$0.023903
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
58.50%
Dialogue to Total Word Ratio
(details)
29.2523%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The wind rustled the hedges. Molly clutched her grappling hook, a coat hanger taped to a broomstick. Beside her, Bill fumbled with oven mitts. The suburban street was eerily quiet, save for the distant whine of a lawnmower and the thumping bass of a birthday playlist. A plastic flamingo stood sentinel in the yard, missing one foot. Molly adjusted her grip on the broomstick, her knuckles white inside the fabric mitts. The target was clear: the living room window, currently obscured by a layer of condensation and children's fingers. "Bill, the perimeter," Molly hissed, squinting through the rhododendrons. "Class-A balloon security. One red pop and the whole block is on us." "Red is a decoy, Mol. We need the purple valve. The pressure gauge is visible." "Bill, the hostess has a pug. That beast can smell a criminal from three zip codes away. Plus, you're wearing tuxedo tights. You look like a gummy bear in a tuxedo." "The tights are for aerodynamics. And the pug sleeps during 'Happy Birthday.' We have thirty seconds. We swap the presents for smaller boxes with identical paper, then we vanish." "Swap them? If we leave smaller boxes, the parents open them and find bricks. We'll end a marriage. We become the reason the Johnsons' divorce happens. We can't end a marriage, Bill. The guilt would ruin my appetite for stolen cupcakes." "We're professionals. We care about the loot. There's a diamond necklace inside the Tickle-Me-Elmo. That's the prize. Everything else is filler." "The Elmo? The pile is a mountain of wrapping paper. It's Everest with glitter. How do we extract the Elmo without toppling the whole structure?" "I've modified the Shop-Vac. Reverse suction. We hit the pile, air pressure pops the presents off the table, my net catches them. Efficient. Silent." "Silent? The Shop-Vac sounds like a jet engine! And reverse suction? You'll suck the head off the birthday boy! We're stealing toys, not creating a biological hazard!" "Fine. I'll dress as the entertainer. I take the stage, do the balloon animals, while you grab the Elmo. I'll create a diversion by setting my mustache on fire." "Your mustache is wax! It won't burn!" "I'm adding a wax substitute! I'm expanding my repertoire!" "Bill, you're lactose intolerant. Fire-eating is suicide. The fumes will knock out the toddler brigade." "Then I'll do a mime act. I'll trap myself in an invisible box. Parents will be too confused to notice you rifling through the gifts. It's psychological warfare." "Okay, mime has merit. But I'm not touching the Elmo. Too risky. I want the Lego Death Star. Four hundred bucks on eBay with stickers intact." "The Death Star? That's a twelve-inch box. You can't hide that in your tights. You'll look like you have a prosthetic leg." "I'll wear a skirt. I have a skirt in the van." "Over the tights? That's not stealth, that's drag aesthetics." "Just get the vacuum ready, maniac! And hide that lighter!" The house loomed ahead, draped in festive bunting and terror. Through the window, a pinata spun wildly. 'I want that one!' drifted through the screen, vibrating in Molly's molars. The smell of vanilla cake and desperation wafted from the open window. Bill checked his colander-goggles again, squinting at a fire ant marching across his nose, completely unaware that the pug's snout was now pressing against the glass.
created_at
"2026-04-27T08:59:03.48Z"
4/27/2026, 8:59:03 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
242
moderation_latency
(null)
generation_time
93596
tokens_prompt
77
tokens_completion
11280
native_tokens_prompt
76
native_tokens_completion
11938
native_tokens_completion_images
(null)
native_tokens_reasoning
10410
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0239026
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
is_byok
false
latency
10077
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
504
1
endpoint_id
"57ebdffc-b059-4d92-a1ac-94436bae3d31"
id
"chatcmpl-b03f15faa8dc9d9d"
is_byok
false
latency
242
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280343-zj31zECcGu4zZCr3S8Uy"
session_id
(null)
api_type
"completions"
id
"gen-1777280343-JC2oUuLKwkKxkTuoGpiD"
upstream_id
"chatcmpl-b03f15faa8dc9d9d"
total_cost
0.0239026
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
557
58.50%
Dialogue to Total Word Ratio
Ratio: 73.27%, Deviation: 3.27%
neededClean
false
wordsTotal
565
wordsDialogue
414
29.2523%