NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ76WHZJWNGN49N7GW3VT86Y
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5m 40s
Cost:
$0.091515
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
3.62%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.8117%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint spread across the kitchen table resembled a crayon drawing of a bouncy castle. Molly tapped a red marker against a sketched clown face. Bill adjusted his black turtleneck, currently dusted with glitter from a craft store job. He studied the diagram with the solemn intensity of a battlefield commander, though the terrain was clearly a suburban backyard. The laminate surface bore dark coffee stains and sticky strawberry jam smears from breakfast. Molly circled the proposed entry point near the plastic sandbox. Bill sighed heavily and brushed a stray purple sequin from his shoulder. He deeply missed the days when targets involved thick steel vaults. The quiet neighborhood remained still, save for distant leaf blowers. A noisy flock of sparrows argued over a discarded pizza crust. Molly tightly secured her laces and checked the modified pool skimmer. Bill carefully loaded the collapsible wagon with absolute military precision. The afternoon sun glinted off shiny foil wrapping paper. They exchanged a solemn nod before crossing the lawn. One squeaky floorboard could summon a horde of toddlers. Those tiny humans possessed radar for interrupted festivities. Molly cracked her neck and signaled the advance. Bill gripped the wagon handle like a rifle. They crept past rose bushes, avoiding motion sprinklers. The mission was absurd, but the payoff was legendary. We enter at sixteen hundred hours, Molly said. Parents will be distracted by frosting and paper plates. Bill traced a line toward the gift table. What about the hired magician? Amateur tricks, Molly replied. We wear tactical vests. He cannot stop us. Bill nodded slowly. I will secure the perimeter. The inflatable castle masks zipper sounds. Three trips maximum. Make it two, Molly said. The piƱata drops at sixteen thirty. Those children move like liquid. Bill adjusted his gloves. I brought a timed whoopee cushion. Classic misdirection. Perfect, Molly said. We only take large boxes. Small ones contain socks. We are thieves, not monsters. Bill stood up. I have the manifest. Electric cars, a dollhouse, and a heavy science crate. That crate is definitely a disguised safe. Molly rolled the blueprint. Gear up now. Silent soles and bubble wrap pockets. No jingles allowed. This is professional. Check the wind direction, Bill said. It carries the scent of vanilla, Molly said. That means the cupcakes are fresh. We strike before the candle lighting. Bill saluted with a juice box. To the glorious prizes. They moved toward the door like shadows, if shadows occasionally stopped to pick glitter off their sleeves and argued about whether balloon animals counted as security measures. The suburban street awaited. Somewhere down the block, a hired DJ was testing a playlist of nursery rhymes. Molly checked her watch. Bill adjusted his balaclava, which featured cartoon bears because the tactical shop was sold out of plain black. They slipped into the afternoon light, ready to liberate forty pounds of plastic joy from unsuspecting toddlers. A gentle breeze rustled the oak leaves overhead. The back gate latch clicked. The heist of the century was about to begin, and the only thing standing in their way was a man named Chuck who made dogs out of latex tubes.
created_at
"2026-04-27T10:13:47.13Z"
4/27/2026, 10:13:47 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1642
moderation_latency
(null)
generation_time
340390
tokens_prompt
77
tokens_completion
11010
native_tokens_prompt
76
native_tokens_completion
11720
native_tokens_completion_images
(null)
native_tokens_reasoning
11066
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0915148
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-e4ed1662-ce41-922d-857b-1b36f61012b5"
is_byok
false
latency
1642
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777284827-MHnI6BnRgbz9g4w19eNI"
session_id
(null)
api_type
"completions"
id
"gen-1777284827-rboU4zKJlH6orkr4wPmU"
upstream_id
"chatcmpl-e4ed1662-ce41-922d-857b-1b36f61012b5"
total_cost
0.0915148
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
3.62%
Matches word count
n/a
neededClean
false
words
524
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
524
wordsDialogue
0
1.8117%