NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK21SZXMG0VHZ6F32V35VR7
Run Details
Model:
Qwen 3.5 397B A17B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 36s
Cost:
$0.030340
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her mask while peering through the hedge. The lawn stretched out before them, littered with plastic chairs. Bill checked his watch and sighed loudly into the night air. Are you sure about this specific plan working? Bill asked. It is absolutely perfect for us today, Molly said. No security systems. Just sugar high parents looking at phones. They are kids, Bill said sadly. We are technically stealing from toddlers here now. They have the best stuff, Molly replied quickly. Lego sets are liquid gold on the black market. Bill kicked a stone across the pavement slowly. The noise seemed loud in the quiet street. He looked at the house where balloons bobbed against the mailbox. I feel physically sick about this job, Bill said. Put on the uniform right now please, Molly ordered. Bill pulled a clown suit from his bag. The fabric rustled like dry leaves. He stared at the red nose with resentment. This is profoundly humiliating for me specifically, Bill said. It is strict camouflage, Molly said firmly. Nobody suspects a working clown at all ever. Everyone suspects a clown, Bill argued loudly. They are inherently suspicious figures everywhere locally. Molly shoved the nose into his hand roughly. She pointed at the door where boxes gleamed under the light. Look at that massive haul there now, Molly said. That is a dinosaur, Bill said quietly. It is made of cheap plastic usually mass produced. It is a vintage model, Molly insisted strongly. Do you know what collectors pay for those items? Bill zipped the suit up slowly. The zipper sounded like a dying mosquito nearby. He waddled as the fabric bunched around his knees. I cannot run in this outfit easily, Bill said. You do not need to run fast, Molly said. You need to jiggle naturally instead mostly. Jiggle? Bill asked confused. Clowns jiggle, Molly explained calmly. It throws off their visual aim completely always. Bill looked at the house again nervously. A child screamed inside loudly. The sound was piercing and high pitched. They are wild animals, Bill said scared. Distract them with balloons, Molly instructed sharply. I hate balloons deeply, Bill said sadly. Molly checked her lockpick set carefully. The tools clinked softly in the quiet night. She felt ready for anything except parental rage. Are you ready now? Molly asked. No, Bill said honestly. Too bad for you, Molly said. Bill took a deep breath slowly. The rubber smell filled his nose completely. He stepped onto the grass. Here we go then, Bill said. Molly followed behind him closely. She carried a sack with a dollar sign. It was classic but effective. Do not trip over anything, Molly whispered. I am trying hard, Bill whispered back. They reached the porch steps. The boxes waited silently. Bill reached for a package. This feels morally wrong, Bill said. Think of the money involved, Molly said. I am thinking of the tears, Bill said. Molly opened the sack. The canvas mouth gaped open. Just put it in, Molly said. Bill dropped the box inside. It landed with a heavy thud. One down already, Molly said. Forty to go still, Bill said. Molly smiled behind her mask. This was going to be easy money.
created_at
"2026-02-16T11:09:35.336049+00:00"
2/16/2026, 11:09:35 AM
model
"qwen/qwen3.5-397b-a17b-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
869
moderation_latency
(null)
generation_time
216082
tokens_prompt
77
tokens_completion
8989
native_tokens_prompt
76
native_tokens_completion
8415
native_tokens_completion_images
(null)
native_tokens_reasoning
7681
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0303396
router
(null)
provider_responses
0
id
"chatcmpl-d250abee-eb4e-9bf3-83c1-12687da3df2c"
status
200
is_byok
false
latency
869
endpoint_id
"180484a2-e24b-495a-ba41-b34b680d5a9f"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-397b-a17b-20260216"
api_type
"completions"
id
"gen-1771239958-Z8IYHBLs4wcbpaIIxdA9"
upstream_id
"chatcmpl-d250abee-eb4e-9bf3-83c1-12687da3df2c"
total_cost
0.0303396
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
539
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
539
wordsDialogue
0
0.0000%