NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1DA8BZ0218WMF4JGA76P8
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
30.5s
Cost:
$0.001860
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain slicked the colorful plastic slide, turning the backyard of the suburban fortress into a treacherous swamp of discarded streamers and deflated balloons. Molly crouched behind a large inflatable T-Rex, her black tactical gear soaking up the damp night air, while Bill adjusted his ski mask, which was decorated with a clumsy patch of glitter glue he had insisted was necessary for camouflage. They stared at the prize: a mountain of wrapped boxes堆积 under the patio awning, guarded only by a sleeping golden retriever and the lingering scent of cheap pizza. This was the big score, the kind of job that would set them up for life, or at least until the sugar rush from the stolen candy wore off. "Do you think the dog bites?" Bill whispered, his voice trembling slightly as he eyed the slumbering beast. "It's a golden retriever, Bill. Its primary instinct is to fetch, not to maim," Molly hissed back, checking her lock-picking set which she had swapped for a set of oversized plastic spoons earlier that day. "Besides, we have the distraction." Bill reached into his pocket and pulled out a squeaky toy shaped like a rubber chicken. He squeezed it once, testing the volume, and winced at the ear-piercing shriek that echoed in the quiet yard. "That seems loud. Maybe too loud." "Perfection is the enemy of progress," she muttered, creeping forward on her hands and knees. The mud squelched beneath her gloves, a sound that seemed deafening in the silence. She reached the edge of the patio, eyeing the sliding glass door. It was locked, naturally, but the mechanism was a child-proof latch, the kind designed to stop a toddler, not a professional criminal with twenty years of experience in high-stakes larceny. "Hand me the credit card." "I thought we were using the spoons?" Bill asked, scrambling forward to join her, his boots slipping on a stray piece of confetti. He nearly took out a stack of paper plates before catching himself on the garden gnome. "The spoons are for the cake later. The card is for the door. Keep up, Bill, we are losing daylight, or rather, night-light." She slid the plastic card into the track, jiggling it with practiced ease until the latch clicked open with a satisfying snap. The door slid inward, revealing the holy grail: a room filled with toys, games, and enough sugary treats to induce a coma. "Look at all that loot," Bill breathed, his eyes widening behind the mask. "Do you think they have those little chocolate coins?" "Focus on the presents, Bill. The coins are amateur hour." Molly stepped inside, her boots leaving muddy prints on the pristine hardwood. She began grabbing boxes, tossing them to Bill who fumbled each one like a clumsy waiter. "Careful with that one! It says 'Fragile' and it's shaped like a drone." "What if it's a trap?" Bill paused, holding a rectangular box suspended in mid-air. "What if it's an alarm?" "The only alarm here is the one going off in my head telling me we should have stuck to robbing banks," Molly snapped, snatching the box from his hands. "Now move. We have exactly four minutes before the parents wake up from their wine-induced slumber." "But what about the dog?" "If it wakes up, throw it the chicken and run." "That seems cruel." "It's business, Bill. Now grab the big one with the bow and let's get out of here before someone asks why two adults in ski masks are stealing birthday gifts."
created_at
"2026-02-16T10:55:18.321453+00:00"
2/16/2026, 10:55:18 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
784
moderation_latency
(null)
generation_time
30474
tokens_prompt
77
tokens_completion
874
native_tokens_prompt
78
native_tokens_completion
762
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00186
router
(null)
provider_responses
0
id
"chatcmpl-3750f840-270d-96f6-aa33-e37d7e069ed0"
status
200
is_byok
false
latency
784
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239287-ZQpqVWY7Ik4UEVGJIA6z"
upstream_id
"chatcmpl-3750f840-270d-96f6-aa33-e37d7e069ed0"
total_cost
0.00186
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
594
33.4478%
Dialogue to Total Word Ratio
Ratio: 34.68%, Deviation: 4.68%
neededClean
false
wordsTotal
594
wordsDialogue
206
16.7239%