NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK39RYDBDQE83AVBG02CVQF
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5m 20s
Cost:
$0.043804
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the hedge, adjusting her black beanie. The lawn stretched out before them, dotted with colorful tables and shrieking children. Bill adjusted his grip on the burlap sack, looking very uneasy. He shifted his weight nervously. The air smelled of cut grass and sugar. A dog barked in the distance. "This feels morally wrong, Mol," Bill whispered, wiping sweat from his brow. "It is just logistics, Bill. Think of it as supply and demand," Molly replied, eyes fixed on the pile of wrapped boxes. "They are innocent children. We are stealing magic right from under their noses." "We are redistributing wealth. Look at that Lego set. Do you know what that fetches?" Bill sighed, shifting the sack. "I think that kid is watching us closely." "He is looking at the magician. Keep your focus on the prize." The magician pulled a rabbit from a hat, distracting the crowd. Molly saw her window. She pointed toward the unlocked patio door. The handle gleamed in the moonlight. Moths circled the porch light above. "We go in through the kitchen. The parents will be drinking wine by now." "What about the clown? He looks suspicious and is staring at our position." "Clowns are distracted by balloons. It is their weakness and we will exploit it." "We need a bigger bag. There are simply too many presents here." "We do multiple trips. Efficiency is the key to success." "I did not sign up for trips. I thought this was a smash and grab." "Nothing is smash and grab here. There is too much glass." Molly checked her watch. The cake ceremony was starting. The lights dimmed as candles were lit. This was the moment. The shadows lengthened across the yard. "Now or never, Bill. Move your feet." "I think we should reconsider the ethics of this situation." "Ethics do not pay the rent. Grab the sack." Bill stood up, brushing grass off his knees. He looked at the house. His heart raced in his chest. He took a deep breath. "If we get caught, I am telling them you made me do it." "That is protocol. Let us move." They crept across the lawn. A ball rolled toward them. Bill kicked it away. The night was quiet otherwise. "Nice save," Molly muttered. "I played striker in high school," Bill said. "Focus on the loot." They reached the door. Molly picked the lock. It clicked open. The mechanism surrendered easily. "Inside. Grab the big boxes." Bill hesitated. "Do you hear singing?" "Happy Birthday," Molly recited. "Let us go." They slipped into the kitchen. Pizza smell filled the air. Molly pointed to the room. Bill tightened his grip. The floor creaked under their boots. Dust motes danced in the light. "Remember, no noise." "I know how to be quiet," Bill whispered softly. Molly nodded slowly. The heist was underway. They vanished into the dark house. Silence returned to the yard.
created_at
"2026-02-16T11:33:08.156805+00:00"
2/16/2026, 11:33:08 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
858
moderation_latency
(null)
generation_time
319528
tokens_prompt
77
tokens_completion
11449
native_tokens_prompt
76
native_tokens_completion
12155
native_tokens_completion_images
(null)
native_tokens_reasoning
11490
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0438036
router
(null)
provider_responses
0
id
"chatcmpl-4878a189-4de4-91b6-ab0e-be2f98e7ac40"
status
200
is_byok
false
latency
858
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771241268-idgXSOdeV0fV9elFWuSA"
upstream_id
"chatcmpl-4878a189-4de4-91b6-ab0e-be2f98e7ac40"
total_cost
0.0438036
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
68.1022%
Matches word count
n/a
neededClean
false
words
486
84.2427%
Dialogue to Total Word Ratio
Ratio: 48.15%, Deviation: 1.85%
neededClean
false
wordsTotal
486
wordsDialogue
234
76.1725%