NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ73BP11AES9NMSGMFV318AR
Run Details
Model:
Qwen 3.6 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 3s
Cost:
$0.031776
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The neon diner sign buzzed overhead, casting a sickly pink glow across the sticky vinyl booth. Molly tapped a crumpled invitation for Timmy Hendersons seventh birthday bash. You realize the target is fundamentally a bouncy castle surrounded by screaming toddlers right. The diner radio crackled with static. Outside, rain tapped against the windshield. Bill slowly stirred his lukewarm coffee. It is a high security facility disguised as a suburban backyard. The perimeter is guarded by parents armed with aggressive photography and expired patience. The loot is lightweight, highly portable, and packed with zero anti theft technology. We are talking about wind up dinosaurs, glow sticks, and forty three goody bags filled with cheap plastic jewelry. My grandmother could crack the safe with a knitting needle. Your grandmother is seventy eight and currently serving a suspended sentence for stealing a community garden gnome. I prefer professionals. Bill adjusted his tweed cap over thinning hair. We execute during the cupcake distribution window. Everyone will be distracted by frosting on their chins and the sudden onset of sugar induced chaos. We sweep the present table, load the duffels, and exit through the side gate before the piñata breaks. What about security. Security is a golden retriever named Barnaby who sleeps under a folding chair alongside two teenagers hired to monitor the hot dog bun situation. We toss the dog a meatball. We distract the teenagers by asking if they have seen a missing party favor named Steve. They will waste forty minutes looking. And if a child spots us. We hand them a juice box and tell them the magic elves need supplies. Children believe in magic elves. It is our greatest exploit. You have been reading too many parenting blogs. We need disguises. Something that signals we belong in a space decorated with paper lanterns and aggressive streamers. I already bought the outfits. Bill slid a plastic bag across the laminate table. Inside were bright yellow ponchos, unicycle party hats, and oversized foam dice. We are the piñata wranglers. No one questions the people responsible for swinging a donkey shaped cylinder. That is either brilliant or deeply concerning. I prefer brilliant. What is the extraction signal. When I scream about a surplus of glitter bombs. You roll the duffels out the side gate. I will cover with confetti. Confetti causes sneezing not panic. It is psychological warfare. You are thinking too small. We are redistributing duplicate Lego sets to the underground economy. You and I are going to get tackled by the homeowners association for excessive joy. Fine. Route mapped. We enter through the compost bin, cover the distance in four strides, and load up. Total operation time is ninety seconds. Long enough for one tantrum, short enough that nobody notices the presents vanished. Start the van.
created_at
"2026-04-27T09:12:08.487Z"
4/27/2026, 9:12:08 AM
model
"qwen/qwen3.6-27b-20260422"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
763
moderation_latency
(null)
generation_time
123360
tokens_prompt
77
tokens_completion
8165
native_tokens_prompt
76
native_tokens_completion
8814
native_tokens_completion_images
(null)
native_tokens_reasoning
8240
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.031776
router
(null)
provider_responses
0
endpoint_id
"82097366-3a73-4790-ac36-f96ba9f6f577"
id
"chatcmpl-e68bafa1-6a5e-91db-b219-882a5a493b22"
is_byok
false
latency
763
model_permaslug
"qwen/qwen3.6-27b-20260422"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777281128-d5aYTNHHJd2z27divXNz"
session_id
(null)
api_type
"completions"
id
"gen-1777281128-kAHVy8EugnrRRkzHI7yw"
upstream_id
"chatcmpl-e68bafa1-6a5e-91db-b219-882a5a493b22"
total_cost
0.031776
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
465
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
465
wordsDialogue
0
0.0000%