NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ747D1SBRH7GBQ11QSCYZYC
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 22s
Cost:
$0.044742
Provider:
Morph
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tactical vest and studied the suburban house across the street. The driveway was plastered with neon streamers while a plastic flamingo stood guard by the mail slot. Through the open garage, they could hear the muffled shrieks of six children running loose. Bill leaned against the idling van, polishing his lockpicks with a damp handkerchief. He looked entirely too calm for a man about to commit armed burglary at a birthday party. Alright, Molly whispered, tapping her clipboard. Let us review the manifest. Sixteen unwrapped boxes, three gift bags, one massive plush unicorn currently guarding the living room. We swipe the lot, we load the van, we vanish. Standard playground extraction. Bill chewed a granola bar. You realize we are stealing children gifts, right. Not vintage watches. Not bearer bonds. We are going to be walking away with three dozen coloring books and a remote control dinosaur that barks. It is not about the objects, Bill. It is about the principle. The market for unopened joy is surprisingly liquid. Besides, the insurance payout on missing presents during a party is astronomical. Parents assume they are lost in the shuffle. We just need to be the shuffle. I was thinking we go in through the back window. Drop a smoke pellet shaped like a cupcake, cause a distraction, sweep the room, and we are out before the punch bowl is refreshed. You actually built one of those? Obviously. It lasts four seconds. Long enough to grab the presents and vanish. Four seconds for sixteen packages and a giant stuffed animal. Those things are basically bricks wrapped in polyester. My back is not a crane. Wear the exoskeleton. The what? I printed it out of bicycle parts and expensive Velcro. You strap it on, carry the loot, and walk like a determined mailman. I do not want to look like a determined mailman during a heist. We are professionals. We need elegance. We must not trigger a security camera disguised as a wind chime. That is why I have the electromagnetic disruptor. I will throw it at the wind chime, the camera goes dark, you slide in the door, and we clear the room on three. One. Two. Three. You know, Molly, that three year old throwing a tantrum over a missing juice box could accomplish the same chaos in half the time without the Velcro. Do not even joke about letting a toddler run our operation. Last time we outsourced planning to a preschooler, we ended up stealing a trampoline and a half eaten pizza. We are sticking to the plan. I will handle the unicorn. You take the gift bags. We coordinate on the walkie talkie. Do not drop the mic when you yell at the children to go outside. You will. They will absolutely question why two adults in tactical vests are aggressively carrying away their birthday presents. We need a cover story. We are the party cleaners. Early departure special. We take the gifts, leave with a smile, and pretend the parents invited us. It is just business. Grab your tools. Adjust your goggles. And for the love of all that is holy, do not track glitter on the clean carpet. Glitter is the kryptonite of organized crime. Then stick to the windows. On my mark. Three. Two. One. Go. They moved as one unit, slipping past the plastic flamingo with the grace of two grocery shoppers who had just remembered they left the oven on.
created_at
"2026-04-27T09:27:16.804Z"
4/27/2026, 9:27:16 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
785
moderation_latency
(null)
generation_time
201717
tokens_prompt
77
tokens_completion
17207
native_tokens_prompt
76
native_tokens_completion
18625
native_tokens_completion_images
(null)
native_tokens_reasoning
16345
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0447418
router
(null)
provider_responses
0
endpoint_id
"826cab4e-dc26-4472-83be-e4ddd5ebd2ab"
id
"393a71c282c046ceaf149377e0fc32ff"
is_byok
false
latency
607
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Morph"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777282036-WrrTpd1mq8cZoGnewUYu"
session_id
(null)
api_type
"completions"
id
"gen-1777282036-y6ckCaZqU4smr79BjThS"
upstream_id
"393a71c282c046ceaf149377e0fc32ff"
total_cost
0.0447418
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Morph"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
581
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
581
wordsDialogue
0
0.0000%