NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBGFBW6KYY4JWGFYNTRFEDS
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
54.7s
Cost:
$0.019733
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars while Bill checked the time on his wristwatch outside the parked van. They sat in silence for a moment, the hum of the refrigerator in the background. They were watching the house. It looked easy. The gate was unlocked. They’re opening the balloons in ten minutes. That’s our window. Ten minutes is tight, Bill. The dad, Gary, moves like a linebacker when the lights go down. Exactly. That’s why we don’t go down there. We go up. The roof? We’re not doing the roof. I have bad knees. It’s a shed roof, Bill. You’re not climbing Mount Everest. You’re climbing a plastic picnic shelter. I know what it is! I just don’t like the physics of it. If we fall, we fall into the bounce house. Then we bounce. Think of the marketing opportunity. The Bouncing Brothers. I don’t want to be a meme, Molly. I want a pension. Okay, look. The plan is simple. You wait at the window. I scale the fence. We grab the pile. The pile? The Christmas pile. It’s huge. Look at that tower. It’s got to be two thousand dollars worth of plastic and cardboard. It’s illegal, Bill. It’s redistribution. It’s theft. Specifically, child theft. Not children. Presents. It’s a distinction without a difference in the eyes of the law. The law is very interested in presents when they go missing from a child’s eye line. Okay, okay. What if we leave one thing? A token gift. A token gift? You mean a bribe? A peace offering. A single teddy bear with a note that says, Sorry, we needed the Wi-Fi router. The router isn’t a toy. We can’t steal the router. Then we leave a note saying, The Wi-Fi was down anyway. That’s pathetic. It’s art, Bill. Is the fence electrified? No. It’s just angry raccoons. I hate raccoons. They smell like trash. Then we wear the scent masks. We smell like chicken. I hate smelling like chicken. Does anyone else know about this? Just us. And the raccoons. Mostly us. What if they have a dog? They don’t. The mom has a Poodle named Princess. Princess can’t run fast. We’re basing this whole heist on the speed of a Poodle? Princess is a lazy Poodle. She sleeps more than she barks. Okay, timing. When do we move? When the party starts. That’s when the adults are distracted by the cake. The cake is a lie. They always look at the cake until the second before we strike. That’s why we need to be fast. Grab, bag, go. How do we carry the presents? In the trunk. The van. It’s empty. We’re not hauling furniture. I’m hauling a van. That’s heavy lifting. You’re driving. You’re not lifting. I’m lifting. You’re shorter than me. You can’t lift a crib. It’s a toy crib. It’s lightweight. I don’t care if it’s a Lego crib! We’re stealing toys! Why are you resisting? Because I’m a good man, Molly! I used to deliver pizza. I know how to deliver happiness. We’re delivering misery. To the owners. To the owners? That’s cold. It’s the economy, Bill. High inflation. We’re helping the market. I’m going to jail for this. I’ll write it up. The Great Toy Heist. No. Don’t write it up. It’ll be a bestseller. Molly, please. Fine. We skip the book. But we grab the presents. It’s the principle. What if it’s a birthday party? Then it’s a birthday heist. That doesn’t make sense. It’s comedy, Bill. Logic left the chat. I’m the logic guy. Then you can explain to the kids why Santa is gone. Santa isn’t gone. Santa is just busy. Busy stealing our toys. I’m stealing them. You’re just helping. I thought I was the getaway driver. You’re the getaway driver. And the lookout. And the moral compass. Mostly the moral compass. I’m not a moral compass. I’m a man with a van. Then drive the van. Now. I don’t have the keys. You always have the keys, Bill. I don’t today. I left them on the table. You’re not getting out of this, Bill. We’re doing this. With the car. There’s no car. Then we walk. That’s a five-hour walk. Then run. Fast as the raccoons. We’re stealing toys. Why do we need speed? Because adults have good legs. Okay. Five minutes. Then we go. Wait. Is that a dog? No. That’s a balloon. It barked. It’s a trick. It’s a party trick. If it barks, we run. If it barks, we distract it with the chicken smell. I smell like chicken. This is bad. It smells like hope. I smell like grease and fear. Close enough. They moved as the lights flickered. The moment had arrived.
created_at
"2026-02-25T22:59:49.262Z"
2/25/2026, 10:59:49 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
416
moderation_latency
(null)
generation_time
54498
tokens_prompt
77
tokens_completion
9717
native_tokens_prompt
76
native_tokens_completion
9857
native_tokens_completion_images
(null)
native_tokens_reasoning
8666
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.019733
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-68dc0034-c7cf-9a9c-bfa4-02ecf2f4b8b2"
is_byok
false
latency
416
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772060389-muVU9sgOd5ejMaTFkfxn"
upstream_id
"chatcmpl-68dc0034-c7cf-9a9c-bfa4-02ecf2f4b8b2"
total_cost
0.019733
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
788
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
791
wordsDialogue
0
0.0000%