NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ7991MWT28PCXXP0TTNSZTD
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4m 38s
Cost:
$0.079136
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
51.93%
Matches word count
(details)
81.47%
Dialogue to Total Word Ratio
(details)
66.6956%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprint spread across the folding table was actually a crayon drawing of a bouncy castle. Molly traced a red marker along the dotted fence line. Bill adjusted his night vision goggles over reading glasses. The surveillance van smelled like stale coffee and impending disaster. Streetlights flickered against the tinted windows. A half eaten granola bar rested near the gear shift. Molly adjusted her black turtleneck, ignoring the itch of cheap wool. Bill polished a plastic walkie talkie with a microfiber cloth. They reviewed the perimeter map again. Red circles marked the trampoline zone. Blue arrows indicated the snack table flank. Molly packed three zip ties and a pocketful of dog biscuits. Bill tested the tension on his utility belt. The leather creaked softly. Molly tapped the paper. "We need to neutralize the balloon artist first. If he starts twisting poodles, the children will swarm the patio." Bill nodded solemnly, treating the schematic like classified intelligence. "I already packed the earplugs. What about the piƱata? That thing is a tactical nightmare." He smoothed his dinosaur patterned gloves. Outside, a dog barked at a sprinkler. "We cut the rope from the oak branch. The candy shower covers our approach. Simple geometry." Tomorrow afternoon, this quiet driveway would become a chaos zone of sugar fueled toddlers. The target was simple. "Brilliant. Remind me why we abandoned the diamond exchange for a suburban birthday party." A mountain of wrapped boxes sat near the patio door. Security was laughable. "Tax free inventory, Bill. Plus, my nephew wants the remote control dinosaur. We move at three." One exhausted clown, a bubble machine, and a golden retriever who surrendered for cheese. Molly checked her watch. The second hand ticked steadily. Bill unfolded a paper napkin detailing their escape route. "Check the gift table again. I spot three large boxes wrapped in silver paper." He cleared his throat. The air felt heavy with ambition. "Those are definitely board games. Heavy, but highly resellable on the secondary market." They had cracked bank vaults and museum alarms, but this operation required different skills. It demanded absolute silence and a firm refusal to engage with toddlers. Molly zipped her tactical duffel. The metal teeth clicked sharply. "Agreed. What if the clown notices us crawling through the azaleas?" Bill stretched his fingers. They were prepared. "Toss a whoopee cushion. Clowns respect misdirection." The prize waited behind a streamer covered gate. Failure was unacceptable. "Noted. I will bring the extra cushions. Do we have a signal?" Victory meant a trunk stuffed with plastic toys and board games. They would strike during musical chairs. The distraction would be flawless. "Two whistles means grab boxes. One whistle means hide." The neighborhood would never suspect a thing. The van engine idled roughly, shaking the floorboards. "Understood. This is madness, Molly." Molly exhaled slowly, watching her breath fog the glass. The countdown had officially begun today. "Madness pays well enough."
created_at
"2026-04-27T10:55:33.539Z"
4/27/2026, 10:55:33 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1589
moderation_latency
(null)
generation_time
278050
tokens_prompt
77
tokens_completion
10735
native_tokens_prompt
76
native_tokens_completion
10133
native_tokens_completion_images
(null)
native_tokens_reasoning
9491
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0791362
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-612bb07f-3b18-9af1-90b9-e88733e7edbd"
is_byok
false
latency
1589
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777287333-xVfTBetuYAtIxJ9sSUmf"
session_id
(null)
api_type
"completions"
id
"gen-1777287333-vSbMIZ21wFKXhWXKF3sI"
upstream_id
"chatcmpl-612bb07f-3b18-9af1-90b9-e88733e7edbd"
total_cost
0.0791362
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
51.93%
Matches word count
n/a
neededClean
false
words
484
81.47%
Dialogue to Total Word Ratio
Ratio: 32.02%, Deviation: 2.02%
neededClean
false
wordsTotal
484
wordsDialogue
155
66.6956%