NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73NKCX1HHMKA1760RXJN1W
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
38.7s
Cost:
$0.010721
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0004%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tactical goggles, which were actually swimming goggles she'd drawn mustaches on with a sharpie. Bill stood trembling in the oversized beige hamster suit, his knees knocking together with a sound like maracas. The backyard stretched out before them, illuminated by chaotic glow of battery-operated candles. The target was the living room window, currently occupied by a sleeping dog and a mountain of wrapped cardboard rectangles. Molly checked her watch, a digital stopwatch she'd bought at a gas station. "Five minutes until the song," she hissed, clutching the burlap sack labeled 'Emergency Socks' in permanent marker. "Bill, stop adjusting your hamster ears. You're making a whirring noise." "It's part of the costume, Molly. It adds authenticity." "It sounds like a lawnmower. The security system will trip. Did you get the cutters?" "I have the cutters. I also have the pretzels." "Pretzels? We're on a heist." "I'm hungry. Did you see the cake? Triple-layer chocolate." "I will throw you into the bouncy castle if you don't zip the bag. We go in at the climax of 'Happy Birthday', right?" "Right. Chaos ensues." "Exactly. I distract the mom with a fake choking incident." "You will not fake a choking incident on the hostess. That worked at the jewelry store because it was a diamond dispenser, not a toddler's juice box. We need subtlety." "Subtlety is for librarians. We are going for the grand haul. All the presents. Even the socks." "The socks? Why the socks?" "Cash value, Bill. Everyone hates socks. We can return them to three different stores in forty-five minutes. It's liquid assets." "This is insane. We're risking our necks for socks? I'm risking nothing. You're the one dressed as a giant hamster." "It's for camouflage. Kids love hamsters. Adults ignore hamsters. Zip me up. And if anyone asks, I'm a therapeutic stress animal." "Bill, that suit is too big. Your knees are sticking out. Now, about the distraction. I vote for the magic show gone wrong." "You can't do magic. You pulled a carrot out of your ear and gave yourself a concussion." "That was a prop carrot! Look, just get the cutters. We have four minutes before the piñata drops." "Four minutes? That's enough time for the heist, the sock run, and a snack break." "Bill!" "Fine. Cutters. Bag. Hamster suit. Let's steal some birthday loot and go." "Wait. Do we take the piñata?" "Absolutely not. That's just candy and sticks, Molly. Stick to the strategy." "Strategic candy acquisition is valid. Also, I have a question. Why do we need the socks? I have toes." "Focus, Bill. We are professionals. Professionals exploit market inefficiencies. Socks are the undervalued asset of the gift economy. Now, help me lift the tree. One, two, heave!" "I can't heave in this suit. The fur has no structural integrity. And you can't go in as the dog. That is a terrible idea." "It's genius. Everyone loves the dog. The dog can bark, knock over the tree, and create a diversion while you swipe the gifts. You swing the stick. I bark." "The dog doesn't have opposable thumbs, Molly! And who swings the stick? You want me to hit the piñata? The mom is walking over!" "Freeze. Cower adorably! That's a hamster pose." "I'm cowering. Why is the toddler holding a juice box? He's staring at me." "Don't look at the target. Targets are distractions. The presents are the target. If we get caught, Bill, you'll be the one explaining why a hamster is trying to loot a piñata." "Fair point. Ready? If we get caught, I'm blaming the socks." "Ready. Three. Two. One. Bork."
created_at
"2026-04-27T09:17:33.479Z"
4/27/2026, 9:17:33 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
212
moderation_latency
(null)
generation_time
38209
tokens_prompt
77
tokens_completion
5456
native_tokens_prompt
76
native_tokens_completion
5347
native_tokens_completion_images
(null)
native_tokens_reasoning
4507
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0107206
router
(null)
provider_responses
0
endpoint_id
"57ebdffc-b059-4d92-a1ac-94436bae3d31"
id
"chatcmpl-b433f4ea8854b2b9"
is_byok
false
latency
212
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281453-q7HyI3112J5k5MqZb7jg"
session_id
(null)
api_type
"completions"
id
"gen-1777281453-qaXcKR41Zo9F6lWLW3dJ"
upstream_id
"chatcmpl-b433f4ea8854b2b9"
total_cost
0.0107206
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
603
0.00%
Dialogue to Total Word Ratio
Ratio: 85.31%, Deviation: 15.31%
neededClean
false
wordsTotal
606
wordsDialogue
517
0.0004%