NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBF8TERWBN046XTSHREV8DR
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
54.2s
Cost:
$0.016329
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
8.00%
Dialogue to Total Word Ratio
(details)
4.0018%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars and squinted through the damp ivy choking the garden wall. The backyard below was a kaleidoscope of balloons and screaming toddlers chasing a dog around a bouncy castle. Bill kicked a pebble near her heel, the sound swallowed by the muffled hum of party music drifting from the open garage. The tension in the air was palpable, thick with the scent of grease and children's candy. "That place is crawling with adults, Molly," Bill whispered, his voice strained against the wind. "Look at the bouncy castle. There are at least six of them running around down there." "It's a fortress," Molly replied, lowering the binoculars to wipe condensation from her glasses with a grim expression. "But we need the haul. Do you see the gift table? It's the only thing that matters in this entire operation." "Yes. It's a pyramid of wrapped cardboard monstrosities covered in glitter and sticky residue." "We go for the big ones. The tablets, the game consoles, maybe the expensive robot." "Who buys those for three-year-olds? That's a tax audit in the making for the parents." "It's about the resale value, Bill. Not the gift-giver's sanity or the child's developmental timeline. We take what we want." Bill sighed, checking his digital watch with a frustrated flick of his wrist against the sleeve of his coat. "The security plan is still flawed. You know this perfectly well." "Flawed? I'm a professional, Bill. We infiltrate during the piñata. That is the standard protocol for parties." "A piñata distraction is a cliché. We used that tactic in the museum last year." "It works, though. They're too busy hitting sugar sacks to notice us sneaking around quietly." "What if the clown notices? What if he's the security guard wearing a mask?" "Clowns don't notice anything until you give them money. They are legally mandated to ignore reality completely." Molly shifted her weight, crunching a dry leaf under her boot with a sharp crack. "I don't like this vibe. It's too... chaotic. Unpredictable for a high-value heist." "That's the point. Chaos is our cover. We blend in like two bored parents trying to nap." "We have no children. We aren't parents. We can't blend in." "We borrow them. We find a lost kid in the shrubbery and attach them to our hip immediately." "That's illegal kidnapping. We are thieves, not abductors. We don't hurt kids." "It's borrowing. We'll return them after the loot is secured and the bag is full." Bill pulled a crowbar from his jacket, the metal gleaming in the twilight sky. "I'm ready to smash the gate if I have to force it open." "No smashing. We need to slip in silently. Look, the gate is open." "Is it? Or is that a trap? Is someone waiting behind it to catch us?" "It's a trap. The hostess is watching us right now through the window upstairs." "I don't see her. I just see a cake on the table." "You don't see her, but she sees everything. She suspects you of being shady." Bill frowned, rubbing the stubble on his chin in thought. "You think she suspects the two of us?" "She suspects everyone. She's a mother. Mothers are the real heist masters of the modern world." "Then we need better disguises for the raid. We can't just wear normal clothes." "I have the diapers. They stretch perfectly for the floor crawl if we need to hide." "Why do you always bring diapers? That is disgusting and unsanitary for us." "For the camouflage. The security guard is wearing a dinosaur suit for the show." "I'm stealing toys, not hiding from a zoo. We should just climb the fence quietly." "I do have a ladder, yes. But only for emergencies when we are desperate." Bill grabbed her wrist, pulling her out from behind the hedge suddenly. "Let's move. The music is stopping. The singing has ceased." "What does that mean? Are we too late to get anything?" "It means cake is coming. That's our window of opportunity for the theft." "Exactly. Grab the bag. Don't drop it or the whole plan fails." "I'm in. Are you in? We go now or never again." "I'm in. Just get me inside before the candles are lit."
created_at
"2026-02-25T22:38:46.239Z"
2/25/2026, 10:38:46 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
444
moderation_latency
(null)
generation_time
54005
tokens_prompt
77
tokens_completion
8030
native_tokens_prompt
76
native_tokens_completion
8155
native_tokens_completion_images
(null)
native_tokens_reasoning
7185
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.016329
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-16b449a8-abf1-9d0b-81c3-668490ebdcd1"
is_byok
false
latency
444
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059126-PSB6U3amQ4xqbfMud0NA"
upstream_id
"chatcmpl-16b449a8-abf1-9d0b-81c3-668490ebdcd1"
total_cost
0.016329
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
708
8.00%
Dialogue to Total Word Ratio
Ratio: 77.11%, Deviation: 7.11%
neededClean
false
wordsTotal
712
wordsDialogue
549
4.0018%