NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2TQBQ76DDEY5BEZXWN600
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
27.8s
Cost:
$0.001740
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Stop pacing, you're making me dizzy," Molly hissed, peering through the binoculars at the brightly lit suburban house. "I can't help it," Bill whispered back, tugging at his black beanie. "It feels wrong. We usually hit banks or museums. This is a birthday party for a six-year-old named Timmy." "Timmy has a PlayStation 5 under that tree, Bill. And a stack of Lego sets big enough to build a actual fortress. That is not child's play; that is high-value merchandise." "But the clowns, Molly. There are two clowns. One is juggling. What if he sees us?" "Clowns are distracted by their own existential dread and bad makeup. They won't notice a thing. Now, look at the perimeter. The back door is unlocked because the caterer just went in with the cake." "This is insane. We are going to jail for stealing a My Little Pony." "It's the limited edition Princess Celestia, Bill! Do you know what those go for on the secondary market? Enough to get us out of this life and into a nice condo in Belize." "Belize has mosquitoes. I hate mosquitoes." "Focus! Here is the plan. I create a diversion near the bounce house. You slip in through the kitchen, grab the big boxes first, then sweep the smaller ones into the sack." "What about the parents? There are like thirty adults drinking punch out there." "They are drunk on cheap wine and nostalgia. They won't notice a thing until we are gone. Just don't make eye contact with the magician. Magicians have weird senses." "Why do magicians have weird senses?" "Because they notice when things disappear, Bill! It's their whole job description. If a guy who pulls rabbits out of hats sees you taking gifts, he's going to suspect something." "This is the dumbest plan we have ever had. Remember the zoo heist? At least the monkeys were predictable." "The monkeys bit you because you tried to feed them your sandwich. Stick to the script. I whistle three times, that means the coast is clear. If I whistle once, it means run." "What does two whistles mean?" "It means I've been caught by the bounce house operator. Just leave me and save the Legos." "I am not leaving you for plastic bricks, Molly. That's against the code." "The code says we prioritize the haul. The code also says no talking about the code while surveilling a target. Shut up and watch the window." "Okay, okay. The magician is doing a card trick. The kids are mesmerized. This might actually work." "See? I told you. Kids are easily distracted. It's their greatest weakness and our greatest asset." "What if Timmy wakes up? What if he cries?" "Then you give him a lollipop from your pocket and tell him you're the Tooth Fairy's assistant. Now, check your gear. Do you have the sack?" "Yes. It has little ducks on it. I look ridiculous." "You look like a guest who took the party favors too seriously. Let's move. Operation Happy Birthday starts now." "Wait, one last thing. If we get caught, I'm telling them you made me do it." "Standard procedure, Bill. Now move your feet before the cake gets cut."
created_at
"2026-02-16T11:20:03.261899+00:00"
2/16/2026, 11:20:03 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
840
moderation_latency
(null)
generation_time
27784
tokens_prompt
77
tokens_completion
798
native_tokens_prompt
78
native_tokens_completion
712
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00174
router
(null)
provider_responses
0
id
"chatcmpl-22fac124-63d0-979b-9598-da84173dce7a"
status
200
is_byok
false
latency
840
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771240775-8GYOBx60k5wHOBCJFURo"
upstream_id
"chatcmpl-22fac124-63d0-979b-9598-da84173dce7a"
total_cost
0.00174
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
535
0.0000%
Dialogue to Total Word Ratio
Ratio: 96.26%, Deviation: 26.26%
neededClean
false
wordsTotal
535
wordsDialogue
515
0.0000%