NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ7A3NZPE71PQ665FJ6FH9AQ
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
42.5s
Cost:
$0.011603
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
55.46%
Dialogue to Total Word Ratio
(details)
27.7309%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill sat behind the wheel of the stolen ice cream truck, staring through duck-shaped binoculars at the Henderson house. Balloons bobbed against the sky while toddlers launched themselves at a plastic bouncy castle. Molly stood in the back, packing a reinforced burlap sack. She checked her watch, the red digits pulsing like a heartbeat of impending doom. She turned to Bill, her face obscured by a caked tennis-ball mask, and held up three fingers. Bill nodded, pointing to the French doors where a table groaned under wrapped presents. He mimed climbing a trellis, then a silent entry. Molly shook her head, pointing to the dog sleeping near the patio. She mimed a snoring sound, then tapped her temple. "The dog is on sedatives," Bill whispered, adjusting his utility belt. He pulled out a sonic emitter disguised as a rubber ducky. "The sedatives last two minutes. We have a window." He pointed to the window, making a grasping motion for the loot. "You forgot the distraction," Molly hissed, stopping him. She grabbed a spool of wire, tossing it to him. She pointed to a strand of red yarn near the door, then made a slashing motion. "That's not a laser grid, Bill. It's yarn. We trip, the dog wakes. We trip, Fido wakes. Fido wakes, we're cake." Bill checked the emitter, the battery glowing green. "Got it. Yarn trip, no cake. What about the piƱata?" Molly hefted the sack. "The bear stays. We steal the boxes. Not the candy. Stick to the plan. On my mark, you hit the ducky. I breach the window. We roll. Speed is key. Merge with the parents. Look panicked." Bill patted his pockets, finding a whoopee cushion. "I bring this?" Molly snatched it. "Deploy when the cake hits the table. Chaos covers the exit. Split is sixty-forty." Bill gave a thumbs-up, slinging the wire over his shoulder. "Sixty-forty. Go." Molly pushed open the truck door, stepping into the blinding suburban sunlight. The countdown hit zero. She cracked the window, hearing muffled cheers. She checked her wire, her resolve. The sun beat metal until it groaned. Bill adjusted his mask, the mesh scratching an irritation he had long since learned to ignore. Molly gripped the sack, preparing to swing. A low hum whined from the ducky as Bill pressed the button. The dog twitched and sighed, sinking deeper into sleep.
created_at
"2026-04-27T11:10:06.335Z"
4/27/2026, 11:10:06 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
610
moderation_latency
(null)
generation_time
42394
tokens_prompt
77
tokens_completion
7509
native_tokens_prompt
76
native_tokens_completion
7723
native_tokens_completion_images
(null)
native_tokens_reasoning
7163
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0116035
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-84e75dd3-be98-9c4a-8c58-6fbf826e1fb3"
is_byok
false
latency
610
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777288206-KmjFB1hV4lMALo0IHfU9"
session_id
(null)
api_type
"completions"
id
"gen-1777288206-Nr9au2A0ymhASBCJHd2s"
upstream_id
"chatcmpl-84e75dd3-be98-9c4a-8c58-6fbf826e1fb3"
total_cost
0.0116035
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
394
55.46%
Dialogue to Total Word Ratio
Ratio: 26.57%, Deviation: 3.43%
neededClean
false
wordsTotal
399
wordsDialogue
106
27.7309%