NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ76YS1T0HANR2FV00JCC7W7
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
31.9s
Cost:
$0.009991
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
52.72%
Dialogue to Total Word Ratio
(details)
26.3620%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint lay spread across the cracked linoleum, illuminated by the sickly buzz of a vending machine. Molly traced the perimeter of the community center with a grease pencil, her movements sharp and practiced. She adjusted her dark sweater and checked the tension on her utility belt. "Right. The main hall. Three exits. One blocked by a bouncy castle the size of a minivan." "I factor in the bounce. If we go through the gift wrap station, we clear the perimeter in forty seconds." "Forty seconds gives us a window for exactly two gift bags. We need every last one. The ledger is specific. No half-measures." "I know the ledger. The ledger costs me sleep. The ledger keeps me from buying another thermal imaging camera I don't need." Bill tapped the pencil against his chin, leaving a faint white streak across his cheekbone. He studied the floor plan like a military general surveying a particularly confusing kindergarten, occasionally muttering about ventilation shafts that did not exist. He unrolled a second map featuring snack tables and juice box distribution routes. "What about the security?" "Low-level threat. A woman named Brenda with a clipboard and a terrifying knowledge of party favors. She watches the clock like a hawk. Or a very tired mother. Same difference." "We need a distraction. Something that draws her attention away from the tree." "I have a plan. We trigger the piñata protocol." "The piñata protocol involves a stuffed donkey and a blindfolded seven-year-old named Timothy." "Exactly. Timothy swings. Brenda cheers. We slide under the folding chairs. Silent. Swift. Presently loaded." "Loaded how? We don't have the duffels." "I packed them in the recycling bin. The municipal ones. The blue ones. Everyone assumes they belong to the city." "Bill. That is either brilliant or completely unhinged." "Is there a difference on Tuesday nights?" Molly exhaled through her nose, folding the blueprint into a neat square. She slid it into her jacket pocket and checked her watch, rubbing a smudge of paper residue against her thumb. She tapped her boot twice against the floor and glanced toward the alley door. "Check the equipment one more time." "Velcro gloves, anti-static dusters, six complimentary juice boxes for the road, and my lucky squeaky toy." "The squeaky toy stays in the bag." "It boosts morale. You have heard the acoustics in a church basement. They are brutal." "We are not in a church basement. We are invading a six-year-old birthday. Keep the squeak dampened." "Dampened squeak, silent approach, full extraction of all wrapped assets." "And remember the rules of engagement. Nothing breaks. Nothing cries. If a child wakes up, we leave them a coupon for free video games." "They do not have video games at four years old, Molly." "Then leave them a lollipop. And we move at eight-oh-seven. Right after the cake cuts. Right before the goodbyes. We are ghosts. We are wind. We are aggressively organized." "Aggressively organized. I love it." "Do not love it too much. Just do it right." "I always do it right. You know that. I will grab the van. Meet you at the dumpster behind the multiplex." "I will be there. And Bill?" "Yeah?" "Wear the sensible shoes." "Always. For the presents."
created_at
"2026-04-27T10:14:59.904Z"
4/27/2026, 10:14:59 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
621
moderation_latency
(null)
generation_time
31820
tokens_prompt
77
tokens_completion
6923
native_tokens_prompt
76
native_tokens_completion
6648
native_tokens_completion_images
(null)
native_tokens_reasoning
5907
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009991
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-d45c1af8-c172-9abe-b45e-0229515e6a94"
is_byok
false
latency
621
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777284899-sikLSQGp6tC0fYi5qJev"
session_id
(null)
api_type
"completions"
id
"gen-1777284899-tE6lss0MzMKsjNTXtfyF"
upstream_id
"chatcmpl-d45c1af8-c172-9abe-b45e-0229515e6a94"
total_cost
0.009991
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
537
52.72%
Dialogue to Total Word Ratio
Ratio: 73.58%, Deviation: 3.58%
neededClean
false
wordsTotal
545
wordsDialogue
401
26.3620%