NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ743HTT7SNY9H1TS1VS5CDS
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
50.6s
Cost:
$0.015145
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
16.38%
Dialogue to Total Word Ratio
(details)
8.1920%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tinfoil crown, the metallic crinkle syncing with the rhythmic thrum of "Baby Shark" vibrating through the cracked pavement behind the community center. She crouched behind an oversized inflatable unicorn that smelled faintly of PVC and desperation, her fingers working the tension dial on a grappling hook launcher assembled from a leaf blower, a garden rake, and enough silver duct tape to violate several international treaties. The label, scrawled in thick permanent marker, warned "DO NOT BLOW ON CAKE" with aggressive underlining. Inside, the disco ball spun lazily, casting fragmented light through the paper-mache dragon guarding the service entrance. Molly checked her watch, then the red crepe paper crisscrossing the doorway like a spiderweb of doom. The nanny was currently losing a war against a piñata that had fought back with surprising agility. The parents were trapped in a vortex of mandatory small talk about gluten-free cupcake alternatives and tax deductions. The window was twelve minutes wide. Behind her, Bill hyperventilated into a reusable tote bag, his 'World's Okayest Accountant' sweatshirt vibrating with the force of his existential dread. The font on the shirt was Comic Sans, chosen poorly for maximum intimidation avoidance. He clutched a burlap sack that crinkled with every tremor, the fabric woven from recycled potatoes and bad decisions. He had spent the last twenty minutes debating the ethical weight of stealing a Power Wheels jeep, a hesitation Molly viewed with the same patience one reserves for a jammed safe. He wiped a smear of blue marker from his cheek, realizing too late that the paint was waterproof and likely to mark him as a fugitive under the strobe lights. "The perimeter is a tin-can nightmare," Molly whispered, gesturing to the crepe paper laser grid. "Fishing line rigged with enough tension to launch you into the punch bowl. Mrs. Gable is asleep, but she snores like a chainsaw. If she wakes, Gary wakes. Gary has a clipboard." Bill nodded frantically, his eyes wide behind thick spectacles that reflected the flashing party lights like saucers. "I just need to know about the Lego Death Star. Is it sealed? My crypto payout depends on condition, and I have art school loans." Molly ignored the financial panic, her gaze locked on the door hinge. "Death Star is sealed. Power Wheels charged. We move in sixty seconds. You cut the can-sensor array. I handle the lock. Do not touch the face paint. Do not touch the piñata. If you see a child, you become a statue. Stay silent." Bill fumbled for wire cutters, dropping them instantly. They clattered against the concrete, a sound like a gunshot in the quiet alley. He scrambled to retrieve them, face draining of color as dust motes danced in the air around him. Molly closed her eyes, counted to ten, and snatched the cutters back, handing them to him with the gentle precision of someone passing a live grenade wrapped in birthday ribbon. "Gary gets us." Bill breathed, snipped. The line snapped with a soft twang. Molly exhaled, stepped through the parting paper barrier, and vanished into the chaos, her heart pounding a rhythm that felt dangerously close to joy as the scent of vanilla frosting and stolen opportunity washed over her.
created_at
"2026-04-27T09:25:10.626Z"
4/27/2026, 9:25:10 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
186
moderation_latency
(null)
generation_time
50176
tokens_prompt
77
tokens_completion
7640
native_tokens_prompt
76
native_tokens_completion
7559
native_tokens_completion_images
(null)
native_tokens_reasoning
6810
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0151446
router
(null)
provider_responses
0
endpoint_id
"57ebdffc-b059-4d92-a1ac-94436bae3d31"
id
"chatcmpl-a9a7f0850f521aac"
is_byok
false
latency
186
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281910-lLbYCpXxVtUCOyBU6uKz"
session_id
(null)
api_type
"completions"
id
"gen-1777281910-plx3PYwZ66CMFNZSEEO1"
upstream_id
"chatcmpl-a9a7f0850f521aac"
total_cost
0.0151446
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
538
16.38%
Dialogue to Total Word Ratio
Ratio: 23.99%, Deviation: 6.01%
neededClean
false
wordsTotal
542
wordsDialogue
130
8.1920%