NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2906PGA2G9T0SP68YYE3H
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 31s
Cost:
$0.021242
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peeked through the cheap vinyl blinds. Colorful streamers hung limp against the drywall like depressed snakes. It is quiet, she whispered. Bill adjusted his black ski mask. It was too tight around the ears. Too quiet. Where are the parents? McDonald's run. Twenty minutes max. They ordered the Happy Meal bundle. We are stealing from toddlers, Mol. That is low. Even for us. It is high yield. Lego sets are basically currency. And cash envelopes. Parents love giving cash now. I feel sick. My stomach is doing flips. Put the vomit away. We need the grab bags too. Do not leave the partial bags. Grab bags? Party favors. Plastic whistles and sticky hands. Resale value on eBay is surprising. Bill sighed heavily. The fabric of his mask stretched. What about the clown? What clown? The booking confirmation said Binky the Bonker. I hacked the email. Molly paused. Her hand hovered over the latch. Clowns know security systems. Exactly. He could be watching us right now. Behind the balloon arch. Do not be paranoid. He is probably inflating balloons in the van. Balloon animals are weapons, Mol. Twist one wrong and it is a shank. Focus. The tree is loaded. Wrapped gifts everywhere. Gold foil paper. It is a six-year-old birthday. What are we going to do with a dollhouse? Fence it. Or live in it. Rent is high in this city. Bill checked his digital watch. Seventeen minutes until the nuggets arrive. Go time. You take the kitchen table, I will hit the tree. Wait. Did you hear that? Hear what? Laughter. Evil laughter. It is a cartoon on the TV. Leave it. I cannot do this. My mother raised me better than this. Your mother raised you to be a thief. She taught you the lockpick trade. She said not to steal from kids. Specifically kids. She cried when I stole that bicycle. That was a community bike. It had a basket. Focus. Okay. Bill kicked a red toy truck. It squeaked loudly in the silent room. Great. Now the Lego Avengers know we are here. Shh. Did you secure the perimeter? I tied the dog up with a jump rope. A jump rope? It was accessible near the door. You are an amateur. What if the dog chews through it? He is a pug. He will just fall asleep. They sleep twelve hours a day. Molly shoved a large canvas bag into Bill chest. Start stuffing. Velcro wraps are easiest to rip. This feels wrong. Like stealing Christmas. It is July. Metaphorically. Less metaphoring, more larceny. Bill picked up a large rectangular gift. It is heavy. Gold bullion? No. It is a play kitchen. Complete with plastic sink. Throw it. We want high value, low weight. Cash and electronics. What about this box? Legos. Perfect. Star Wars set. Mol, the dog is barking. The pug? Yeah. Ignore it. He sounds angry. Pugs do not get angry. They get wheezy. Something is coming up the driveway. Headlights. Molly looked out the window. Her eyes widened. It is the clown. I told you. He is holding balloons. And he is running. Run? Run. Wait, grab the cake money. No time. Just the envelope. Fine. He is at the door. Back window. Now. Where is the getaway car? Around the corner. Is the engine running? Always. Good. Okay. Move.
created_at
"2026-02-16T11:12:25.35059+00:00"
2/16/2026, 11:12:25 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
834
moderation_latency
(null)
generation_time
150505
tokens_prompt
77
tokens_completion
5855
native_tokens_prompt
76
native_tokens_completion
5888
native_tokens_completion_images
(null)
native_tokens_reasoning
5054
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0212424
router
(null)
provider_responses
0
id
"chatcmpl-5e9accaf-9fd1-9056-ab6a-b70ea90151e5"
status
200
is_byok
false
latency
834
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240194-OKGHeyVvbksmmW3Szc1X"
upstream_id
"chatcmpl-5e9accaf-9fd1-9056-ab6a-b70ea90151e5"
total_cost
0.0212424
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
561
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
561
wordsDialogue
0
0.0000%