NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ77R5RKDM73Y8F2W198AMM6
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
8m 55s
Cost:
$0.103745
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
50.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints covered the kitchen table, held down by plates. We strike at four, Molly said. The magician drops rabbits. Bill adjusted his bifocals and studied the crude sketch carefully. What about the chaperones? They guard the potato salad fiercely. Molly clicked her pen and drew a swift arrow outward. I hacked the sprinkler timer. Wet shoes create absolute panic. Bill leaned back, rubbing his tired eyes against the lamplight. And the primary objective? Are we still targeting the electronics? Molly pointed to a shaded rectangle near the oak tree. The gift table. Forty boxes. Mostly tablets. One giant dinosaur. Bill scratched his beard, considering the logistics of large toys. The dinosaur creates drag. Fluff ruins our aerodynamic escape profile. She tossed the highlighter onto the wood and crossed arms. We leave the dinosaur. Grab the games. Ignore the noise. He sighed, tracing the escape route with a calloused thumb. I still want the cake. Three tiers. Vanilla bean frosting. Molly slammed her palm flat against the table. Dust danced. Focus, Bill. We are thieves. We never risk for sugar. He raised both hands in surrender, shoulders slumping slightly downward. Fine. What about the entertainer? Clowns carry compact reinforced vehicles. Molly checked her wristwatch and nodded toward the dark window. He parks at three forty. We slip past the chickens. Bill drummed his fingers nervously against the chipped wooden edge. Seven minutes to clear the table? That is extremely tight. She dragged a red wagon from the corner and smiled. I oiled the wheels. Silent rolling. We load and vanish. He raised a skeptical eyebrow, watching the rusty axle spin. What if a toddler intercepts us? They have sharp eyes. Molly zipped her black windbreaker and adjusted the stiff collar. Hand them candy. Claim we are inspectors. Children trust uniforms. Bill stood up straight and cracked his neck loudly twice. I will wear the tactical vest. Extra pouches for batteries. She rolled the blueprints tight and secured them with elastic. Check the zip ties. Pack the duffels. Avoid the castle. He grabbed his car keys from the counter and nodded. Understood. I will bring ear protection. Screams shatter concentration fast. Molly killed the overhead light, leaving the room quite dark. Meet at the fence. Bring the wagon. Leave morals behind. Bill opened the back door, letting cool evening air inside. Morals are already boxed. Right beside the spare gloves now. She smiled faintly, listening to the distant sound of dogs. This job will be flawless. No lasers. Just wrapping paper. He stepped onto the porch and looked back just once. Do not underestimate the rabbit. Animals are unpredictable wildcards, honestly. Molly locked the deadbolt and watched her shadow move away. The rabbit stays caged. We follow the timeline. Gifts secured. Bill offered a crisp salute before descending the wooden steps. See you at the perimeter. Try not to trip outside. She pocketed her flashlight and exhaled a long, steady breath. I never trip. I advance aggressively. Start the engine now.
created_at
"2026-04-27T10:31:21.903Z"
4/27/2026, 10:31:21 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1146
moderation_latency
(null)
generation_time
384892
tokens_prompt
77
tokens_completion
12631
native_tokens_prompt
76
native_tokens_completion
13288
native_tokens_completion_images
(null)
native_tokens_reasoning
12587
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.1037452
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-150d19fc-47ac-96fc-8fd0-f8165d602f7a"
is_byok
false
latency
1146
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777285881-tFtigXwlT7xO3QKR4GvE"
session_id
(null)
api_type
"completions"
id
"gen-1777285881-7hYHVgPpELztm0PsWvA7"
upstream_id
"chatcmpl-150d19fc-47ac-96fc-8fd0-f8165d602f7a"
total_cost
0.1037452
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
500
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
500
wordsDialogue
0
50.0000%