NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK22E23YPRPTD22P737AX1E
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
12.3s
Cost:
$0.000823
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Bill, are you sure this is the job?" Molly whispered, eyeing the colorful streamers. "It's a kindergarten birthday." "Trust me, Molls," Bill replied, adjusting his tiny plastic tiara. "Kids get the best loot. Unopened Lego sets? Goldmine." "But there are clowns," she hissed, pointing at a man juggling balloons. "Clowns are just cover. Nobody looks at the clown when the cake comes out." "I don't like the bounce house. It looks suspicious." "It's inflatable, Molly. That's all it is. Now, what's the plan?" "We wait for 'Happy Birthday,' then grab the pile by the slide." "Perfect. I'll create a distraction with the piƱata." "You're going to hit a donkey with a stick?" "Exactly. Chaos is our friend. While they cry over broken candy, we vacuum up the gifts." "Vacuum? Bill, we don't have a vacuum." "Metaphorically, Molly. We scoop. Fast and quiet." "What if a parent sees us?" "We say we're the entertainment. The Magician and his Assistant." "You're the magician?" "Obviously. Now put on this nose. Blend in." "This feels wrong, Bill. Stealing from toddlers?" "Think of the resin figurines, Molly. Think of the gift cards. We're doing this for us." "Fine. But if a toddler bites me, I'm leaving you behind." "Deal. Here comes the cake. Get ready to magic some presents away."
created_at
"2026-02-16T11:06:31.968224+00:00"
2/16/2026, 11:06:31 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
836
moderation_latency
(null)
generation_time
12300
tokens_prompt
77
tokens_completion
322
native_tokens_prompt
78
native_tokens_completion
330
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0008232
router
(null)
provider_responses
0
id
"chatcmpl-73210752-d933-9d91-8168-ad0dd7efcfdc"
status
200
is_byok
false
latency
836
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239979-VFfNSsWMxOp3gHobHxVa"
upstream_id
"chatcmpl-73210752-d933-9d91-8168-ad0dd7efcfdc"
total_cost
0.0008232
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
60.2752%
Matches word count
n/a
neededClean
false
words
215
100.0000%
Dialogue to Total Word Ratio
Ratio: 90.23%, Deviation: 0.23%
neededClean
false
wordsTotal
215
wordsDialogue
194
80.1376%