NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ72YWMM2G2XG5WZHWPJGMTH
Run Details
Model:
Qwen 3.6 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 51s
Cost:
$0.024099
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The tactical blueprint of the Henderson residence was spread across a sticky motel table, weighed down by a half-eaten bag of beef jerky and a plastic party horn. Molly traced the layout of the suburban living room with a grease pencil, her brow furrowed in the concentrated manner of a surgeon preparing for a routine appendectomy. Bill sat opposite her, meticulously sharpening a butter knife with a pocket whetstone. The rhythmic scraping echoed in the cramped room like a metronome set to anxious. The target was unprecedented. A children’s birthday party. Specifically, the unwrapped gifts arranged beneath a paper-laden tree in the Henderson parlor. The sheer audacity of it still made Molly’s fingers twitch with anticipation. Professional infiltrators avoided pediatric celebrations like plague zones. The sugar levels alone could destabilize a tactical grid. Yet the payout was substantial. High-value merchandise. Untraceable electronics. Limited edition plush bears. Outside the motel window, a rusted van sat idling in the gravel lot, its rear doors stacked with heavy-duty canvas sacks and infrared cutters. The vehicle smelled faintly of pine air freshener and old gym bags. Molly ran a diagnostic on the encrypted tablet, cross-referencing traffic patterns with the local school bus schedule. Every second counted when navigating a suburb saturated with minivans. Bill inspected the velvet sacks, checking each seam for tears. The fabric muffled the clinking of toy packaging and smelled like lavender and regret. Molly synced the tablet to the van’s ignition system, programming a fake traffic alert to delay emergency responders if the silent alarm tripped. The algorithm was elegant. She calibrated the garage door opener replica, ensuring the frequency matched the Henderson household’s outdated remote. The plastic casing felt cheap, but the circuitry was military grade. Bill tested the slingshot tension, firing a rubber band at a fly on the wall. It missed. He tried again. It hit. Molly noted the trajectory in her log. Precision mattered more than luck. The neighborhood streets were quiet. She visualized the escape route, mapping every intersection and impatient commuter. The plan was flawless on paper. Reality usually disagreed. She packed the final toolkit, aligning every wrench with obsessive symmetry. There was no room for error. Only silence, speed, and velvet. The tactical map featured annotated fire escapes and a highlighted kitchen window. Molly adjusted her goggles, checking polarization settings. Bill tightened his silent boots, the rubber soles specially treated with sound-dampening polymer. Every piece of equipment had been stress-tested. Every contingency had been simulated. The margin for failure was nonexistent. They operated in a domain of confetti and cake. Molly tapped the diagram. We bypass the patio doors. We wait for the sugar crash. Bill stopped sharpening. The window is forty minutes. I have timed it. He adjusted his utility belt. What is the extraction route? Molly flipped the map. We use the laundry chute. It dumps into the neighbor bin. Bill nodded slowly. I packed the velvet sacks. Molly glanced at him. For stolen toys? Bill shrugged. The parents will suspect nothing. We are curators. Molly sighed and reached for the blueprints again. The party starts at seven. We breach at six forty-five. I need you to disable the smart speaker. It alerts the mother’s phone. Bill stood and pocketed the whetstone. Consider it handled. I will reprogram the playlist to autoplay polka. She raised an eyebrow. Polka? Bill grinned. Chaos is a ladder. Molly rolled her eyes but allowed a faint smile. Check the timing on the dog. The beagle. Bill held up a small vial. Comedown drops. He will be snoring by the time we cross the lawn. Molly traced the kitchen window on the map. We move fast. We grab the tree. We retreat. No hesitation. The children will wake to empty wrists. Bill zipped his duffel bag. I will bring the truck. Molly picked up the party horn and blew a single, tragic note. Tomorrow night, we become legends. Bill shouldered his coat. The job is simple. The payout is sentimental. And I always collect what is mine.
created_at
"2026-04-27T09:05:09.275Z"
4/27/2026, 9:05:09 AM
model
"qwen/qwen3.6-35b-a3b-20260415"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
250
moderation_latency
(null)
generation_time
109678
tokens_prompt
77
tokens_completion
12583
native_tokens_prompt
76
native_tokens_completion
12036
native_tokens_completion_images
(null)
native_tokens_reasoning
11531
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0240986
router
(null)
provider_responses
0
endpoint_id
"57ebdffc-b059-4d92-a1ac-94436bae3d31"
id
"chatcmpl-a74100e46484866e"
is_byok
false
latency
250
model_permaslug
"qwen/qwen3.6-35b-a3b-20260415"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777280709-sNVdL1jq83Z1dHFhfBsB"
session_id
(null)
api_type
"completions"
id
"gen-1777280709-apOh3xDLpzMAvG1A07F4"
upstream_id
"chatcmpl-a74100e46484866e"
total_cost
0.0240986
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
671
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
679
wordsDialogue
0
0.0000%