NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBEN71P4J2GYWZ5V9JATMSX
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
1m 43s
Cost:
$0.025779
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
97.9933%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted binoculars, peering through wet fabric of oversized raincoat. Bill nervously checked time on wrist, digital clock glowing sickly green in shadows of abandoned dumpster. They were supposed to be professionals, duo forged in fires of high-stakes vault heists, but here staring at suburban house decorated with plastic bunnies. Target simple. Birthday boy eight meant treasure behind wrapping paper likely filled with Lego sets and plastic blasters. Bill shifted weight, sneakers squeaking on damp pavement. House lights flickered, casting long shadows across lawn where neighbors parked sedans. Parents inside, distracted by cake cutting and inevitable screaming of toddlers. Front door unlocked, habit Molly noted from stakeout earlier afternoon. She lowered binoculars, nodded at Bill, face grim beneath hood. They slipped through gate like ghosts. They moved through hallway, footsteps muffled by thick carpet. Bill’s hand hovered over stack of gifts, trembling slightly. Molly grabbed shoulder, pointed toward wrapping table. Quickly, she said, "Grab fast. If mom yells, drop the box." They needed to vanish before first cry rang out from dining room. They grabbed pile. Bill said, "Run! Go fast. If they hear us, we die." Molly screamed, "Now! Run! Now!"
created_at
"2026-02-25T22:28:03.772Z"
2/25/2026, 10:28:03 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1318
moderation_latency
(null)
generation_time
103133
tokens_prompt
77
tokens_completion
10090
native_tokens_prompt
76
native_tokens_completion
12880
native_tokens_completion_images
(null)
native_tokens_reasoning
12615
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.025779
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-186da505-25b3-94b5-8adb-b0f6a7072049"
is_byok
false
latency
1318
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058483-vPewO8JXb9ommznyeHJI"
upstream_id
"chatcmpl-186da505-25b3-94b5-8adb-b0f6a7072049"
total_cost
0.025779
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
192
100.00%
Dialogue to Total Word Ratio
Ratio: 10.36%, Deviation: 0.36%
neededClean
false
wordsTotal
193
wordsDialogue
20
97.9933%