NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK2K6CA0ABH2HXAWEM9YYAG
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
32.1s
Cost:
$0.001944
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain slicked the alleyway behind the community center, reflecting the neon glow of the birthday banner hanging crookedly in the window. Molly adjusted her ski mask, which was far too large and kept slipping over her eyes, while Bill tried to wedge a crowbar into the service door with the grace of a drunken walrus. "This is it," Molly whispered, her voice muffled by the wool. "The ultimate score." Bill grunted, finally popping the latch. "Are you sure about this, Mol? Stealing from kids feels wrong. Like kicking a puppy but with more glitter." "Think of the resale value, Bill. Unopened LEGO sets? Limited edition dolls? It's a goldmine wrapped in colorful paper." "But they're children," Bill hissed, pushing the door open. "They believe in magic. We'd be stealing their magic." "We're stealing plastic toys they'll break in an hour. Now move." They crept into the hallway, the air thick with the scent of cheap pizza and sugar frosting. Balloons drifted lazily against the ceiling like colorful prisoners. Molly pointed toward the main hall where a mountain of wrapped boxes sat beneath a banner that read Happy 7th Birthday Timmy. "Look at that haul," Molly breathed, her eyes widening behind the mask. "That's at least three hundred dollars in retail value." "That's a My Little Pony collection," Bill noted, shivering. "Do you know how hard it is to fence those? The market is saturated." "Details, Bill. Details. We grab the big boxes first. Leave the socks and underwear." "What if there's a clown?" Bill asked, his voice trembling slightly. "I hate clowns. They know things." "There won't be a clown. It's a Tuesday night party for a seven-year-old. The budget barely covered the cake." "Famous last words," Bill muttered, stepping over a discarded party hat. "Remember the last job? The one with the gerbil?" "We don't talk about the gerbil incident." "I still have nightmares, Mol. The little guy had eyes like tiny judgmental marbles." "Focus! We have ten minutes before the parents come back from taking the cake photos." They reached the tree of gifts. Molly grabbed a large rectangular box, shaking it gently. "Sounds like a drone. High end." "Or a board game," Bill countered, picking up a lumpy sack. "This feels like socks. Definitely socks. Who gives socks to a seven-year-old? That's criminal in itself." "Put it in the bag anyway. Maybe they're superhero socks." "I feel sick," Bill groaned, shoving the sack into their duffel. "We're monsters. Actual monsters." "We're entrepreneurs," Molly corrected, snatching a shiny red box. "Now help me with this one. It's heavy." "It's probably a bicycle," Bill said, lifting the other end. "Why do parents buy bicycles indoors? It makes no sense." "Less talking, more lifting. We need to clear the top layer before we hit the small stuff." "What about the piƱata?" Bill asked, glancing at the donkey-shaped object hanging nearby. "There's candy in there." "Candy melts. We stick to hard goods." "You're so cold, Mol. So calculating." "I'm hungry, Bill. There's a difference. Now let's go before someone hears us arguing about ethics in a room full of streamers." "Fine," Bill sighed, hoisting the duffel onto his shoulder. "But if I see a clown, I'm running. And I'm taking the LEGO set with me as severance pay." "Deal. Now move your feet before the magic wears off."
created_at
"2026-02-16T11:16:00.558346+00:00"
2/16/2026, 11:16:00 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
808
moderation_latency
(null)
generation_time
32030
tokens_prompt
77
tokens_completion
885
native_tokens_prompt
78
native_tokens_completion
797
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.001944
router
(null)
provider_responses
0
id
"chatcmpl-a389ab5a-1956-9575-a54f-517055da5721"
status
200
is_byok
false
latency
808
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771240528-ChvVx24758f2qoUj7NEp"
upstream_id
"chatcmpl-a389ab5a-1956-9575-a54f-517055da5721"
total_cost
0.001944
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
563
0.0315%
Dialogue to Total Word Ratio
Ratio: 62.70%, Deviation: 12.70%
neededClean
false
wordsTotal
563
wordsDialogue
353
0.0157%