NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-27-01KQ77ZTFXXAQQEV5FSB9K61FS
Run Details
Model:
Qwen3.6 Max Preview
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
2m 19s
Cost:
$0.040659
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
65.73%
Dialogue to Total Word Ratio
(details)
82.8623%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Pass the blueprints, Bill." "It's a crayon drawing of a bouncy castle, Molly." "Exactly. The vault. Now, what's the guard rotation?" "Three sugar-high six-year-olds and a clown named Giggles wielding a balloon animal shiv." "Tough crowd. Did you secure the distraction?" "I've got a PiƱata rigged with sleeping gas." "Amateur. You gas the kids, you wake the parents. We need chaos. Swap it for laxative-laden cupcakes." "Molly, that's a war crime against toddlers." "It's a heist, Bill. Focus. What's the extraction route?" "The plastic slide. I grease the chute, we sled out with the loot bags strapped tight." "And the primary target?" "Lego Death Star, Barbie, and a heavy pony plush." "The pony's the mark. Rest is filler. Remember: no eye contact with the birthday girl. She's the mastermind." "She's five years old, Molly." "She has eyes everywhere and a scream that shatters glass. Mask up. We move when the cake song starts." Bill adjusted his snout. "My dinosaur mask itches." "Deal with it. Now practice your happy dance. If you don't look festive, we're dead." "Do I wiggle my tail?" "Only if you want to trip the laser grid made of streamers. Keep the tail still and grab the pony."
created_at
"2026-04-27T10:33:02.738Z"
4/27/2026, 10:33:02 AM
model
"qwen/qwen3.6-max-preview-20260420"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1204
moderation_latency
(null)
generation_time
139367
tokens_prompt
77
tokens_completion
4716
native_tokens_prompt
76
native_tokens_completion
5200
native_tokens_completion_images
(null)
native_tokens_reasoning
4884
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0406588
router
(null)
provider_responses
0
endpoint_id
"ca7fc0c7-f23d-4ac5-b2e0-945be5b61787"
id
"chatcmpl-05c5a9c6-79f2-9c39-a945-3de5ad938e83"
is_byok
false
latency
1204
model_permaslug
"qwen/qwen3.6-max-preview-20260420"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777285982-GAdAwiEEF7PwoT9zqMRu"
session_id
(null)
api_type
"completions"
id
"gen-1777285982-EmdZRAFtzURpe8Foytbi"
upstream_id
"chatcmpl-05c5a9c6-79f2-9c39-a945-3de5ad938e83"
total_cost
0.0406588
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
201
65.73%
Dialogue to Total Word Ratio
Ratio: 98.05%, Deviation: 8.05%
neededClean
false
wordsTotal
205
wordsDialogue
201
82.8623%