NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ79NBN6KHHT4Q29SM6V4V6D
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
3m 57s
Cost:
$0.066157
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
46.8248%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The tactical blueprint was sketched in wax crayon on the back of a pepperoni pizza menu. Molly tapped a purple marker against the illustration of an inflatable castle. Bill adjusted his black ski mask, which he insisted on wearing indoors despite the heavy sweating and general suspicion it caused. He paced across the stained carpet, rehearsing his stealth walk while avoiding a tower of empty takeout containers. The afternoon light filtered through dusty blinds, illuminating floating particles of ambition and poor life choices. The kitchen clock ticked with ruthless precision. Dust motes danced in the stagnant air. Molly smoothed the wrinkled paper and aligned her pens. Bill stretched his hamstrings and practiced his silent breathing technique. A neighbor dog barked twice. The stakes were absurdly low yet financially critical. Molly straightened her posture and exhaled slowly. Bill cracked his knuckles and adjusted his weight from foot to foot. The neighborhood outside remained quiet, unaware of the impending felony. Squirrels darted across the lawn. A distant lawnmower droned. Destiny awaited beyond the vinyl siding. We need a solid distraction, Molly said. It is a kindergarten birthday party, Bill replied. We just slip in during the pinata rotation. The pinata rotation is heavily supervised. Parents form a defensive perimeter. We must breach the gift table before the sheet cake arrives. I calculate a nine minute window between musical chairs and total glucose collapse. Bill stopped pacing. He leaned over the crayon schematic and traced a route from the side fence to a cluster of lopsided rectangles representing wrapped loot. What about the entertainer? he asked. There is always a clown. I refuse to take a seltzer blast during a tactical extraction. The entertainer is a part time actuary named Gary. We neutralize him with a crisp twenty and a direct order to face the hedges. Neutralize Gary. Understood. How do we handle the guardians? Mothers will be filming everything on smartphones. Fathers will be debating charcoal techniques. We exploit the screen addiction. You wear the courier jumpsuit. I hold the aluminum clipboard. Bill nodded. He grabbed a stale breadstick and tapped the drawing of a plastic slide. I enter through the gate. I skirt the ball pit. I load the merchandise into reinforced canvas bags. You watch the patio cooler and whistle if anyone tracks the missing inventory. Perfect. We use the singing as acoustic cover. When they hit the final cheer, we clear the fence with thirty pounds of molded plastic and overpriced building sets. Do we leave a decoy? Bill asked. We leave the socks. Always leave the socks. We grab the large package with the silver ribbon. That holds the remote control monster truck. Molly rolled the menu and slid it into her coat. She checked her wristwatch. The suburban gathering began in ninety minutes. Professional criminals rarely targeted cartoon themed goodie bags, but the landlord demanded payment and ethics were negotiable. Stay sharp, she said. Avoid the scattered building bricks. Bill zipped his duffel and offered a grim salute. I was absolutely made for this specific operation.
created_at
"2026-04-27T11:02:17.007Z"
4/27/2026, 11:02:17 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3414
moderation_latency
(null)
generation_time
237166
tokens_prompt
77
tokens_completion
9262
native_tokens_prompt
76
native_tokens_completion
8469
native_tokens_completion_images
(null)
native_tokens_reasoning
7819
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.066157
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-67d6a4c6-bca4-9d5f-9c4c-7725299e293d"
is_byok
false
latency
3414
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777287737-Ex8pDooDy5pcKDnB48Hh"
session_id
(null)
api_type
"completions"
id
"gen-1777287737-nXd8kirzU03AphJGKOKE"
upstream_id
"chatcmpl-67d6a4c6-bca4-9d5f-9c4c-7725299e293d"
total_cost
0.066157
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
509
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
509
wordsDialogue
0
46.8248%