NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1DMRA3J6XXPGERDHT2B9K
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
31.4s
Cost:
$0.001961
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain slicked the alleyway behind the community center, turning the cracked pavement into a mirror for the flickering neon sign that buzzed overhead. Molly adjusted her black beanie, pulling it low over her eyes as she peered around the corner of the dumpster. Bill stood beside her, nervously checking the strap of his oversized duffel bag, which was currently empty but soon hoped to be full of plush toys and board games. "Okay, here is the plan," Molly whispered, her voice tight with professional intensity. "We go in through the side door. The janitor leaves it unlocked during the party for smoke breaks." Bill blinked, looking skeptical in the dim light. "You want us to rob a six-year-old's birthday party? Really, Molly? This feels low, even for us." "Stop whining and start focusing," she snapped, tapping a laminated floor plan she had stolen from the front desk. "The target is the gift table. It is located in the main hall, directly opposite the pinata. We have a twenty-minute window while the parents are distracted by the magician." "A magician? Great, just what I needed, more illusions while I commit felony larceny against toddlers," Bill muttered, kicking a soda can. "What if the kids see us? They have eyes like hawks, Molly. Tiny, judgmental hawks." "Then we blend in," she said, pulling two oversized clown noses out of her pocket and tossing one to him. "Put that on. We pose as entertainers. Nobody questions a clown carrying a bag of balloons." Bill caught the red foam nose and stared at it with deep resignation. "I am not wearing this. I have a reputation, you know. Well, I did, before I agreed to steal Legos." "Your reputation is currently waiting in line at the DMV, Bill. Put on the nose," Molly ordered, already affixing hers to her own face. It squeaked loudly in the quiet alley. "Now, listen. Once we are inside, you create a diversion. Start a conga line or something. Get the adults moving away from the gift table." "A conga line? Are you insane?" Bill hissed, finally shoving the nose onto his face where it sat crookedly. "I do not know how to lead a conga line. I have two left feet and zero rhythm." "Then trip over something expensive," she suggested, checking her watch. "Just get their attention. I will sweep the table. Grab the big boxes first. Ignore the greeting cards; they have no resale value unless they are vintage, which these are not." "What about the cake?" Bill asked, his stomach growling audibly. "Can we take the cake? I haven't eaten since breakfast." "No cake," Molly said firmly. "Cake is messy and melts. We are professionals, not gluttons. Stick to the electronics and the high-end dolls." "This is pathetic," Bill sighed, adjusting his bag. "We used to hit banks. Now we are targeting a theme party called 'Dino-Mite Fun'." "Times are tough, Bill. The economy is down, and children's parties are surprisingly lucrative if you know where to look," she replied, creeping toward the side door. "Ready?" "No," Bill admitted, following her anyway. "But let's get this over with before I lose my soul completely." "Good attitude," Molly whispered as she jimmied the lock. "Remember, smile big. Kids love a smiling thief." "They also love calling the police," Bill grumbled as the door creaked open, revealing a hallway decorated with streamers and the faint sound of happy birthday being butchered by a group of sugar-rushed children. "Move it," Molly said, slipping inside. "Showtime."
created_at
"2026-02-16T10:55:29.727466+00:00"
2/16/2026, 10:55:29 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
881
moderation_latency
(null)
generation_time
31377
tokens_prompt
77
tokens_completion
894
native_tokens_prompt
78
native_tokens_completion
804
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0019608
router
(null)
provider_responses
0
id
"chatcmpl-ad10d716-dfeb-9e20-8694-b885209ff740"
status
200
is_byok
false
latency
881
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239297-5KMAcaDywm7XwbEWcxg2"
upstream_id
"chatcmpl-ad10d716-dfeb-9e20-8694-b885209ff740"
total_cost
0.0019608
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
594
0.7206%
Dialogue to Total Word Ratio
Ratio: 59.93%, Deviation: 9.93%
neededClean
false
wordsTotal
594
wordsDialogue
356
0.3603%