NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72NVEJ9GY9S7286MXZ871J
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
1m 5s
Cost:
$0.015647
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her night-vision goggles over her rainbow-colored birthday hat. She tapped a manicured nail against the dashboard, eyes fixed on the suburban house where festive lights pulsed in a rhythmic, judgmental display. The van smelled of stale coffee and high-stakes felony. "Do you have the socks?" Bill squeaked from the passenger seat, clutching a burlap sack that smelled aggressively of wet dog. His knuckles were white, his posture suggesting he expected the vehicle to spontaneously combust at any moment. "The socks?" Bill asked, his voice cracking. "I have the socks. But Molly, why do I need the socks? The targets are wearing socks. Why would I need my socks?" "Distraction, Bill. Basic misdirection. You trip, you look like a clumsy dad, you offer a spare pair. By the time they're arguing about the sock quality, I've bagged the LEGO set." Bill shifted his weight, his knees knocking together like maracas in a bad salsa record. He stared at the sack as if it contained radioactive plutonium. "I'm not tripping, Molly. I have balance. I'm a former accountant. I don't trip; I calculate." Molly checked the timer on her wrist. The display glowed green, marking the critical window of vulnerability. "You tripped over the sidewalk yesterday, Bill. You blamed the pavement's aggression." "That pavement was an accomplice to chaos! Anyway, what's the timeline? You said 'nap hour,' but the parents are still up. I saw a father doing craft. Why is the father doing craft?" "Because it's a unicorn-themed bash, you amateur. Craft is mandatory until 6:30. We move at 6:31. That's our window. The PiƱata will be the perimeter security. Once the kids take out the donkey, everyone descends. That's when we hit the pile." Bill's breath hitched. He looked at the rearview mirror, half-expecting a swarm of toddlers to be lurking in the backseat. The air in the van felt thick with anticipation. "The pile? You mean the mountain of wrapped plastic rectangles? Molly, we're talking forty pounds of merchandise. How are we getting that out? The exit is through the ball pit." Molly reached under the seat and heaved up a grey, wrinkled jumpsuit that pulsed faintly with the hum of industrial suction. She held it out like a sacred relic. "We use the vacuum suit." Bill recoiled, pressing himself against the door handle. The handle clicked ominously. "The what?" "The vacuum suit. I stole it from a janitor's closet in 1998. It can compress presents by eighty percent. We pack the loot, we suck the air, we look like deflated pancakes, and we roll out." "I don't want to roll out as a pancake. I have a reputation. Last week I was 'The Silhouette.' I can't be 'The Deflated Pancake.'" Molly tossed the suit into Bill's lap. She leaned across the console, her face inches from his, the green glow of her goggles reflecting in his terrified pupils. "You can be the Deflated Pancake with a Silhouette interior. Just get in the suit, Bill. And bring the socks." Bill fumbled with the sack, nearly dropping it on the floor mat. He looked at the house, then at the suit, then back at the house. "I still don't get the socks." "Trust the process. Now, check the audio feed. Are they playing 'Baby Shark'?" Bill tapped a dial on his belt. A muffled, repetitive shark sound leaked through the speakers, accompanied by the frantic yapping of a small dog. "They are. And I think the dog is barking in the key of C-sharp." Molly grinned, a sharp expression that lacked any warmth. She popped the locks on the van doors. "Perfect. The audio covers our scuttling. Grab the magnetized gloves. We're going shopping." Bill slowly pulled on the grey gloves, wincing as the elastic bit into his wrists. He stood up, the vacuum suit rustling like a dying wind instrument. "I hate children." "Love you too, Bill. Let's roll."
created_at
"2026-04-27T09:00:13.146Z"
4/27/2026, 9:00:13 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
222
moderation_latency
(null)
generation_time
54832
tokens_prompt
77
tokens_completion
7569
native_tokens_prompt
76
native_tokens_completion
7810
native_tokens_completion_images
(null)
native_tokens_reasoning
6563
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0156466
router
(null)
provider_responses
0
endpoint_id
"179c6985-3a1c-48f1-a009-9f3e3e402cdd"
is_byok
false
latency
10077
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"AtlasCloud"
status
504
1
endpoint_id
"57ebdffc-b059-4d92-a1ac-94436bae3d31"
id
"chatcmpl-bd9b6d4077d51384"
is_byok
false
latency
222
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280413-fhSOyc7gOIQxNCutw0GT"
session_id
(null)
api_type
"completions"
id
"gen-1777280413-eicu2lzq4wQsraCoelQN"
upstream_id
"chatcmpl-bd9b6d4077d51384"
total_cost
0.0156466
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
652
0.00%
Dialogue to Total Word Ratio
Ratio: 51.29%, Deviation: 18.71%
neededClean
false
wordsTotal
657
wordsDialogue
337
0.0000%