NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ7A07ZW0C7ZHC4Z6CX5HS7J
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
46.7s
Cost:
$0.014005
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her wire earpiece and squinted at the blueprints spread across Bill’s kitchen table. The plans depicted a suburban living room, a plastic piñata, and a suspiciously vulnerable pile of wrapped cardboard boxes. The kitchen smelled of stale coffee. She traced the perimeter with a grease-stained finger against the laminated surface. We have a three minute window, Bill said, tapping the diagram. Two adults will be manning the inflatable bouncer. They will be drinking wine and checking their phones. Three minutes is tight. I need at least four for the good stuff. The Lego castle weighs seven pounds. If we drag it through the cat flap, the frame will split. She shifted the blueprints. Then we use the delivery route. The truck comes at nine. They stack the presents near the back door. Perfect staging area. He poured two cups of coffee onto chipped ceramic mugs. I will need a distraction. Something irresistible to children. Try the juice boxes. The kind with plastic spouts. If you leave three on the staircase, they will swarm. Parents get confused. Chaos ensures. He slid a napkin across the table. Excellent. What about the extraction? I have a stroller. It has a hidden compartment. We stuff the bags in, pull the canopy down, and roll out the front door like any other exhausted mother. They will not question the stroller. She folded the map into a tight, nervous square. I am concerned about the noise. Cardboard tears loudly. Tape peels like a dying seagull. Wear the thick gloves. The ones lined with felt. Move slow. Breathe. Remember why we are doing this. For the retirement fund. For the waterfront condo. For the absolute necessity of unopened board games. He checked his watch, noting the fading daylight. I am doing it for the vintage action figures. The ones still in plastic. Pristine condition. She adjusted her wireless receiver, checking the battery indicator. I need the heavy duty cart. The wheels are quiet. Rubber treads. No squeaking on hardwood. She pulled a heavy catalog from her leather jacket. Same thing, Bill. Just heavier inventory. What is the drop time? Eleven forty five. Party ends, parents start loading cars, we slip out the side gate before they realize the tree has been stripped bare. I will bring the magnetic lock pick. And the velvet pouches. I always bring extra pouches for the smaller gift boxes. Pack extra duct tape. And maybe a snack. You get hangry when you are sweating through a paper crown. I am always hangry. It keeps me sharp. Dave is not involved. He is just the unwitting delivery volunteer. Exactly. A flawless alibi wrapped in disposable cups. Now check your gloves. We move Friday.
created_at
"2026-04-27T11:08:13.7Z"
4/27/2026, 11:08:13 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
684
moderation_latency
(null)
generation_time
46536
tokens_prompt
77
tokens_completion
9659
native_tokens_prompt
76
native_tokens_completion
9324
native_tokens_completion_images
(null)
native_tokens_reasoning
8725
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.014005
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-8f756ba6-61e3-9e93-8393-7518484a3dcf"
is_byok
false
latency
684
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777288093-OMLr0gY59XYNlUVP7Wz0"
session_id
(null)
api_type
"completions"
id
"gen-1777288093-0aPGVoPxXIUv1g5xrSKM"
upstream_id
"chatcmpl-8f756ba6-61e3-9e93-8393-7518484a3dcf"
total_cost
0.014005
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
452
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
453
wordsDialogue
0
0.0000%