NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73DBFNRS8NAKDRBW5MQZH8
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m
Cost:
$0.018505
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.20%
Dialogue to Total Word Ratio
(details)
0.1009%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the neck seal of her inflatable T-Rex costume, the PVC whistling like a dying flute with every shallow breath. The costume smelled faintly of mildew and previous failed operations. Through the porthole mesh of the dinosaur's jaw, she watched the community center basement transform into a chaotic hellscape of primary colors and excessive sugar. Confetti coated the linoleum like radioactive snow, and the air hummed with the manic screech of thirty six-year-olds fueled by adrenaline and fruit punch. Bill was stuck inside a glittering unicorn piñata, his tactical vest too wide for the craft paper enclosure. The horn drooped sadly to the left as he twisted his torso, trying to avoid triggering the dangling candy streamers that tickled his knees. "Bill, get out of the horse," hissed Molly, poking the plastic mane. "You're blocking the gummy worm shaft." "I can't," Bill muffled through the rainbow fur. "The strap got caught on a juice box. Also, I think I tasted sugar. This is humiliating." Molly checked the timer strapped over her thick rubber arm. The digital display blinked red. "Focus. The birthday boy, Timothy, just turned six. The presents are stacked near the cake. That's our target. Once we grab the loot, we swap the real stuff with identical items wrapped in newspaper and filled with lint. Classic misdirection." Bill finally extricated himself with a violent tear of velcro, sending a cloud of glitter into the stagnant air. He brushed the residue off his vest, leaving one plastic horn embedded in his shoulder pad. "Lint? Really? That's our signature? We're professional criminals, Molly, not domestic terrorists." "It's psychological warfare, Bill. You open a present expecting a Lego set and find fuzzballs? Your confidence shatters. Besides, the lint is recycled from your own socks. It adds authenticity. Now, watch the aunt. The one with the camera. She doesn't blink. I've watched her for forty minutes. She hasn't blinked." Bill peered over the edge of the bouncy castle, clutching the dislodged horn like a shiv. His knuckles were white, and a smear of rainbow sprinkles marred his tactical goggles. "I saw a guard dog earlier." "That's an inflatable dragon with a leaf blower attached to its rear end. It's not guarding anything but the driveway. The real threat is the aunt. My mother warned me about her. She once returned a library book three years late with a passive-aggressive note written in red ink. We can't take the presents. The risk is too high. She'll report us to the HOA." "We've come this far, Bill. We practiced this heist in my backyard using action figures and a cardboard box. We are ready. The plan is simple. I create a distraction by tripping over the snack table. You sprint to the presents, bag them, and deploy the smoke bomb." Bill's eyes widened beneath the glitter. "Smoke bomb? Molly, this is a children's party. If I deploy a smoke bomb, we're not just thieves; we're monsters. Parents have strollers with hidden compartments; they'll hunt us." "It's not regular smoke, Bill. It's cotton candy vapor. It's sweet, it's colorful, and it confuses toddlers. They'll be too busy trying to eat the air to pursue us. Plus, I already ate the trigger mechanism, so it's hard to misfire." Bill stared at her, his face pale inside the piñata fur. "You ate the trigger?" "It looked like a gummy bear. Don't judge me. Anyway, once the vapor hits, we zip-line out using the streamers. I've tied the knot to the chandelier. It should hold." Bill wiped sweat from his brow, smearing a streak of unicorn fur across his forehead. "The chandelier in a community center basement? That's not a chandelier; that's a light fixture held by prayer and duct tape. It'll snap immediately." "Exactly! It'll snap immediately, providing excellent propulsion. Now, are you in or out? I'm hungry, and I have a craving for stolen chocolate cakes. My stomach is growling so loud the T-Rex speakers are picking it up." Bill sighed deeply, the sound muffled by the residual piñata stuffing in his collar, and pulled a roll of heavy-duty duct tape from his belt loop. "If we survive this, I'm billing you for the horn. The cake is being cut, and that's when the pressure drops. We go in thirty seconds." Molly raised a squeaking foam fist, the rubber squeaking loudly enough to be heard over the birthday music. "Roger that. Operation Sweet Tooth is a go. Try not to lick anything." "No promises."
created_at
"2026-04-27T09:13:03.231Z"
4/27/2026, 9:13:03 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
693
moderation_latency
(null)
generation_time
120281
tokens_prompt
77
tokens_completion
18359
native_tokens_prompt
76
native_tokens_completion
19158
native_tokens_completion_images
(null)
native_tokens_reasoning
18134
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01850451
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
id
"f401c40e90a24b5db90035f89423c69e"
is_byok
false
latency
693
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281183-qfu3qvKHhQYPCn9583ki"
session_id
(null)
api_type
"completions"
id
"gen-1777281183-ZGlr3QBVCg90Jf4cPoyF"
upstream_id
"f401c40e90a24b5db90035f89423c69e"
total_cost
0.01850451
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
749
0.20%
Dialogue to Total Word Ratio
Ratio: 61.14%, Deviation: 11.14%
neededClean
false
wordsTotal
754
wordsDialogue
461
0.1009%