NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK3401G04QQJA6R8SP9VTA3
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
4m 25s
Cost:
$0.036758
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread blueprints across the sticky diner table. The paper smelled like syrup. Bill stirred his cold coffee. They looked like criminals, minus the masks. It is literally a birthday party, Bill said. I know exactly what it is, Molly replied sharply. For kids. Small humans. Yes, Bill. Children. That is where the unguarded loot is stored. They get LEGO. It is mostly plastic bricks and cheap cardboard. She tapped the map with a manicured nail. High value sets are directly under the tree. We go in at peak sugar crash. The parents will be zonked out on the patio drinking wine. What about the clown? Molly paused mid-sentence. She had not considered the clown. A rogue variable in an otherwise tight operation. We neutralize the clown immediately. How do you neutralize a clown? Distraction. Balloon animals. He cannot resist a very professional challenge. What if he is good? Then we tie him up with his own ribbons. Bill sighed and sipped cold coffee. This felt like a new low for their long career. They had stolen diamonds from a museum last year. Now they were targeting a six-year-old's birthday bash in the boring suburbs. Why are we doing this again? Because the client pays double for untraceable goods, she said firmly. Toy resale market is booming globally. I feel bad about it. Do not feel bad. Feel rich instead. He looked at the grainy photo of the house. It had a massive inflatable castle in the front yard. That thing looked like a bouncy obstacle course designed by a demon. The fence is six feet tall. I can jump six feet easily. There are dogs inside. I brought special treats for them. Are they spicy treats? Just enough to make them sleep. Bill shook his head slowly. He really needed to find a new partner. Someone who wanted to rob banks, not Kindergarteners. But the rent was due and Molly had a way of making insanity sound completely logical. Okay. What is the entry signal? When the cake comes out. Everyone sings happy birthday. That is our window. And the getaway vehicle? The ice cream truck. It is perfect camouflage. Bill stared at her wide-eyed. She thought they could escape in a music truck. That is the stupidest plan ever made. It is brilliant strategy. It is completely insane. It is work for us. Do we wear masks? Only if necessary. Ready? Always. Promise? Promise. Good. He finished his coffee and stood up. He grabbed his leather jacket. Robbing toddlers required better gear. Let us go now. Molly smiled. She folded the map and put it in her purse. Even if the loot was mostly action figures and gift cards.
created_at
"2026-02-16T11:29:04.535299+00:00"
2/16/2026, 11:29:04 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
823
moderation_latency
(null)
generation_time
264600
tokens_prompt
77
tokens_completion
10894
native_tokens_prompt
76
native_tokens_completion
10198
native_tokens_completion_images
(null)
native_tokens_reasoning
9585
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0367584
router
(null)
provider_responses
0
id
"chatcmpl-0a264402-5005-92dd-83fd-9e570bc4e013"
status
200
is_byok
false
latency
823
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771241078-qbdKpZxAanJcWTUhZgAE"
upstream_id
"chatcmpl-0a264402-5005-92dd-83fd-9e570bc4e013"
total_cost
0.0367584
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
455
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
455
wordsDialogue
0
0.0000%