NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ77QJPB22EV1S023AJTXB8R
Run Details
Model:
Qwen 3.6 Flash
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
53.8s
Cost:
$0.015430
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
97.25%
Dialogue to Total Word Ratio
(details)
48.6269%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her grip on the window frame, her inflatable tire squeaking a frantic rhythm against the siding. She pressed her face to the glass, studying the target zone through a smear of bird droppings while Bill checked the watch taped to his wrist, his elbow clanging against a gutter like a dinner bell for crabs. He fumbled with a rubber duck headset, whimpering silently as he smoothed his burlap sack labeled Grandma's Cookies. "The security is tighter than a snare drum in a mosh pit, Molly," Bill whispered, voice cracking through the rubber bill. "Did you see the perimeter? Goldfish crackers everywhere. I stepped on one. I signed a treaty with a toddler named Brayden." Molly ignored the geopolitical implications of the snack and peered past a fern that smelled of regret. She produced a crayon-drawn diagram, squinting at the code while Bill's knees vibrated audibly. "The hostess is distracted. She's trying the dinosaur child to eat a hot dog without setting his eyebrows on fire. That's our window. Now remember the plan. In, grab, and out. No making friends. No balloon animals. We are ghosts. The wind. An aggressive cloud of desire for plastic treasures." Bill gripped the sack until the label peeled away, revealing Explosives scrawled in marker, which made his stomach perform a backflip. He pointed a trembling finger at the latch protected by a badger knocker, muttering a prayer to petty theft. "I'm ready. But what if they catch us? I have a reputation as a serious criminal, Molly. I once stole a library book and never returned it for three decades. This feels like a downgrade. Also, I'm allergic to glitter." Molly turned, her tire deflating with a mournful toot that echoed across the lawn. She slapped Bill's head with a gardening glove, leaving a tomato leaf imprint. "This is a goldmine, Bill. What's on that table? The limited edition dragon plush. The tablet with the infinite battery. The crayons that don't break. We're robbing the vault of joy. Cut the audio feed from your duck. You're broadcasting panic to my bone, and if you don't zip it, I'll leave you here to negotiate with Brayden." Molly adjusted her grip on the window frame. Bill whispered, "My grandmother would hate this." "But the precious dragon!" Bill wailed. Molly grabbed his shoulders, shaking him nearly dislodging the duck headset. "Focus, you gladiator. On three. One. Two."
created_at
"2026-04-27T10:28:32.645Z"
4/27/2026, 10:28:32 AM
model
"qwen/qwen3.6-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
647
moderation_latency
(null)
generation_time
53631
tokens_prompt
77
tokens_completion
10456
native_tokens_prompt
76
native_tokens_completion
10274
native_tokens_completion_images
(null)
native_tokens_reasoning
9725
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01543
router
(null)
provider_responses
0
endpoint_id
"2435bec9-c808-4f27-9fc8-12f3460d7b66"
id
"chatcmpl-37c1c644-2f9e-966a-84b4-127ac07bc1c4"
is_byok
false
latency
647
model_permaslug
"qwen/qwen3.6-flash"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777285712-WKuq4dBcs07QikQSupGo"
session_id
(null)
api_type
"completions"
id
"gen-1777285712-y88G5gy1DMUZ7EAdpCPg"
upstream_id
"chatcmpl-37c1c644-2f9e-966a-84b4-127ac07bc1c4"
total_cost
0.01543
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
401
97.25%
Dialogue to Total Word Ratio
Ratio: 49.25%, Deviation: 0.75%
neededClean
false
wordsTotal
402
wordsDialogue
198
48.6269%