NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK28DV8PNT3ZBMRERBZVBBF
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4m 8s
Cost:
$0.031988
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hydrangea bush, adjusting her night vision goggles until they sat comfortably. The suburban lawn stretched out before them, illuminated by inflatable castle lights. Bill checked his lockpick set, clicking the metal tools together for comfort. This is low, even for us, Bill whispered while watching the window. It is easy money, Molly replied without looking away. There are clowns, Mol. I really hate clowns. Focus on the gifts, not the entertainers or their shoes. What if there is a magician too. Then we steal his rabbit as well. Bill shuddered. He shifted his weight, causing a twig to snap loudly under his boot. Molly froze, glaring at him. She pointed toward the back door. The smell of cheap pizza drifted over the fence. It was the scent of childhood innocence, which Molly intended to monetize. She pulled out a blueprint drawn on a napkin. The kitchen layout was complex. A golden retriever named Buster guarded the perimeter near the fridge. What is the plan for the dog inside. I brought treats in my left pocket. Will that work on a guard animal like that. He is a lab. Food is his only god ever. Okay. I will watch the perimeter closely. Good. Do not break any more gnomes. Bill nodded. He looked at the inflatable castle again. It mocked him, a giant purple dragon smiling down with plastic teeth that gleamed in the moonlight. Molly began to crawl forward, her gear rustling against the wet grass. She signaled for Bill to cover the shed. He moved with the grace of a falling refrigerator. The figure shattered quietly. Molly did not look back. She reached the sliding glass door. It was a standard child-proof latch. She produced a small tool and jimmied the latch. The door slid open with a smooth hiss of rubber seals against the frame. Inside, the chaos of the party aftermath awaited them fully. Tables were draped in sticky cloths. Chairs were overturned. A piƱata hung from the ceiling, decapitated and empty. Is it clear to enter now. Clear. Just grab the big boxes near the tree. What about the greeting cards on the table. Cash goes in the bag. Cards stay there. Roger that. I am loading the sack now. Did you get the laptop. Yes. It is in the bag. Bill entered behind her. It was a fortress of cardboard. Molly started loading her sack. Bill picked up a dollhouse. Molly slapped his hand away. They moved quickly. The job was too easy. A light flicked on upstairs. A voice called out. The floorboards creaked under their heavy tactical boots as they navigated the minefield of toys. Molly ignored the Lego sets in favor of the larger boxes. Bill hesitated as if checking the weight for value. Daddy. Is that you out there. Run. They sprinted into the night, sacks thumping against their sides, leaving behind a trail of shredded wrapping paper.
created_at
"2026-02-16T11:13:43.383965+00:00"
2/16/2026, 11:13:43 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1063
moderation_latency
(null)
generation_time
247679
tokens_prompt
77
tokens_completion
8858
native_tokens_prompt
76
native_tokens_completion
8873
native_tokens_completion_images
(null)
native_tokens_reasoning
8226
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0319884
router
(null)
provider_responses
0
id
"chatcmpl-57544882-490b-96f5-a8f8-c57777f845b5"
status
200
is_byok
false
latency
1063
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771240175-eyCLBnMX5xHuPwxntYag"
upstream_id
"chatcmpl-57544882-490b-96f5-a8f8-c57777f845b5"
total_cost
0.0319884
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
95.9868%
Matches word count
n/a
neededClean
false
words
492
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
492
wordsDialogue
0
47.9934%