NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBFXGC4DS71AANB4SKEYMY5
Run Details
Model:
Qwen 3.5 122B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 17s
Cost:
$0.039474
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.44%
Dialogue to Total Word Ratio
(details)
49.7189%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van smelled of stale coffee and desperation. Molly adjusted her glasses, staring at the photograph of the suburban house. A pink balloon floated in the corner of the shot, mocking their criminal intent. Bill sat in the passenger seat, checking his lockpicks for the third time. The engine idled roughly, vibrating through the worn leather seats. He noticed the balloon in the photo and asked about the unicorn. The cold air seeped through the cracked window, chilling the sweat on his brow. Molly pointed at the blueprint. The house looked innocent enough, with its manicured lawn and white picket fence. It was a perfect target for amateurs, but they were not amateurs. "Is that a unicorn?" "It's a target, Bill. A very expensive target." "We're stealing from kids? Really?" "The parents bought them. The kids haven't opened them yet. Technically, it's unclaimed property." "Technically, it's also a felony involving minors." "Don't be a buzzkill. Look at the layout." Molly tapped the blueprint spread across the dashboard. "The tree is in the living room. The presents are piled under it. We slip in through the kitchen, grab the loot, and vanish before the cake cutting." "What about the security system?" "Disabled. I hacked the Ring camera. It's looping footage of a sleeping cat." Bill scratched his chin. "And the parents?" "Drunk on eggnog. They won't notice until morning." "Molly, this feels wrong." "It feels like a payout, Bill. Do you want to eat ramen again tonight?" He sighed. "No more ramen." "Exactly. Now, gear up." Molly grabbed her duffel bag. "We need the grab-and-go strategy. No wrapping paper left behind. We take the boxes, not the toys. Less noise." "They'll be asleep. It's a birthday party. They crash hard." The van engine hummed to life. Molly put the car in gear. Bill checked the rearview mirror one last time before turning the key. The engine roared to life, shaking the entire vehicle. They drove off into the night, leaving the neighborhood behind. The plan was solid. The execution was questionable. But the money was real. Molly gripped the wheel, her knuckles white against the black leather. Molly's hands trembled slightly, but she hid it well. She knew the risks involved in such a heist. She thought about the unicorn again. It seemed judgmental. Bill shifted in the passenger seat. They were professionals, or at least they pretended to be. The stakes were high, but the moral implications were lower than the price of a cheap toy. They turned the corner, headlights cutting through the darkness like a predator's eyes. The silence in the van was heavy, broken only by the hum of the engine. Outside, the wind howled, masking the sound of their approach. Molly checked her reflection in the rearview mirror. Her eyes were tired, but determined. Bill looked at the road, his face illuminated by the dashboard lights. He wondered if the parents would notice the missing gifts. He hoped not. The guilt was already settling in his stomach like a stone. But the rent was due. And the ramen was gone. The streetlights flickered overhead, casting long shadows across the dashboard. The tires crunched over gravel as they approached the curb.
created_at
"2026-02-25T22:50:04.043Z"
2/25/2026, 10:50:04 PM
model
"qwen/qwen3.5-122b-a10b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
417
moderation_latency
(null)
generation_time
77192
tokens_prompt
77
tokens_completion
11979
native_tokens_prompt
76
native_tokens_completion
12326
native_tokens_completion_images
(null)
native_tokens_reasoning
11610
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0394736
router
(null)
provider_responses
0
endpoint_id
"33d466d1-d2f7-45ed-8ff3-fda7d2e32351"
id
"chatcmpl-84f6929a-376a-9eb9-a1dc-fb6596077386"
is_byok
false
latency
417
model_permaslug
"qwen/qwen3.5-122b-a10b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772059804-N0DIUnHQmk9WsRr6QsMm"
upstream_id
"chatcmpl-84f6929a-376a-9eb9-a1dc-fb6596077386"
total_cost
0.0394736
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
534
99.44%
Dialogue to Total Word Ratio
Ratio: 29.66%, Deviation: 0.34%
neededClean
false
wordsTotal
536
wordsDialogue
159
49.7189%