NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBK55EFK36X86AN50SEB8WW
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
35.1s
Cost:
$0.013037
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The white van idled in the driveway, muffling bass from the neighbor's house. Balloons swayed against the windowpane, casting dancing shadows on the walls. Molly adjusted her visor, peering through the glass. They were two criminals standing in a suburban cul-de-sac. Bill tapped his fingers on the steering wheel, eyes darting to the rearview mirror. He was sweating. They had done banks, but this felt wrong. It felt like a joke. Are you sure about this? Bill asked, his voice trembling slightly. It feels like we’re robbing a school assembly. Molly ignored the question, leaning back in her seat. Children are careless, Bill. They value the paper more than the box. If we remove the wrapping, they won’t know what they got until morning. Misdirection. And the parents? Bill gestured toward the house. They’ll be busy cutting cake. Adults distracted by punch bowl and gorilla suit. Twenty minutes before first cry. Twenty minutes is plenty. Molly opened glove box, withdrew list. We don’t take cheap stuff. LEGO, games, dolls. Leave stuffed animals. Too soft. Crinkle in bag. But how past kids? Bill asked. Not known for security skills. Treehouse entrance. Roof low. Climb fence, drop yard, scale oak to second floor. Sliding door leads to party area. Climb oak tree? Bill looked at trunk. Slippery. Bouncy castle. Noise covers movement. Molly checked watch. Dress as entertainment. Bill blinked. What does that mean? Costumes. Ones in van. Duck and pirate. No one questions duck at birthday. Camouflage. Blend into theme. Insane. Bill shook head. Really insane. You’re planner. Tactical absurdity. Molly opened door. Move fast. Bill killed engine. Grabbed bag. What if they scream? Magic show. Bill followed her. If caught, I take fall. Always take fall. Molly pulled gloves. Steal plastic. The van door creaked open, revealing the dark interior of the vehicle. A heavy silence settled over the driveway as the party continued inside without a care in the world. They stepped into the shadows, preparing for the biggest heist in toy history.
created_at
"2026-02-25T23:46:40.728Z"
2/25/2026, 11:46:40 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
438
moderation_latency
(null)
generation_time
34953
tokens_prompt
77
tokens_completion
6711
native_tokens_prompt
76
native_tokens_completion
6509
native_tokens_completion_images
(null)
native_tokens_reasoning
6025
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.013037
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-8c8cddeb-7364-9875-a265-bbd047db9849"
is_byok
false
latency
438
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772063200-jp8fUd76eMkpciD0euLg"
upstream_id
"chatcmpl-8c8cddeb-7364-9875-a265-bbd047db9849"
total_cost
0.013037
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
334
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 30.00%
neededClean
false
wordsTotal
336
wordsDialogue
0
0.0000%