NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK1D3HTEMSGVVMYRS3C8QSM
Run Details
Model:
Qwen 3.5 Plus (2026-02-15)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
13.5s
Cost:
$0.000782
Provider:
Alibaba
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his tiny clown nose, looking deeply uncomfortable in the oversized polka-dot suit. "This is undignified," he grumbled, tugging at the saggy trousers. Molly kicked the side door of the community center, checking the lock with a practiced flick of her wrist. "Stop whining. The payout is huge. Kids get the best loot." She pushed the door open, revealing a sea of colorful wrapping paper. "See? A gold mine." Bill stepped inside, his squeaky shoes betraying their position immediately. "Why a clown suit? I look like a nightmare." Molly grabbed a massive teddy bear, shoving it into her sack. "Because nobody suspects the clown. It's perfect camouflage." She pointed to a table groaning under video games. "Grab those consoles. Leave the cheap plastic junk." Bill hesitated near a cake shaped like a castle. "Do we take the cake? It looks expensive." Molly rolled her eyes, stuffing three remote-controlled cars into her pockets. "No cake, Bill. We are professionals, not gluttons. Focus on the electronics." Suddenly, a tiny voice echoed from the hallway. "Mommy, the clown is stealing my Switch!" Bill froze, panic widening his eyes behind the white greasepaint. "Run?" he whispered hoarsely. Molly sighed, hoisting the heavy bag onto her shoulder. "Yes, run. But try not to trip over your giant shoes this time." They sprinted toward the exit as forty angry toddlers began to chase them.
created_at
"2026-02-16T10:54:54.230587+00:00"
2/16/2026, 10:54:54 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
841
moderation_latency
(null)
generation_time
13446
tokens_prompt
77
tokens_completion
351
native_tokens_prompt
78
native_tokens_completion
313
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0007824
router
(null)
provider_responses
0
id
"chatcmpl-ccd342ea-768c-9cb8-b87b-b01a3674ac7d"
status
200
is_byok
false
latency
841
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239280-OleP2BFiDBNslsxqQorh"
upstream_id
"chatcmpl-ccd342ea-768c-9cb8-b87b-b01a3674ac7d"
total_cost
0.0007824
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0098%
Matches word count
n/a
neededClean
false
words
231
3.1500%
Dialogue to Total Word Ratio
Ratio: 36.36%, Deviation: 13.64%
neededClean
false
wordsTotal
231
wordsDialogue
84
1.5799%