NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-25-01KJBF1Y2B5AW03WCWPQ6BA6JT
Run Details
Model:
Qwen 3.5 35B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
57.3s
Cost:
$0.013755
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the wig in the rearview mirror of the stolen minivan while Bill was counting cash on the passenger seat. Are you counting or spending? Molly asked. I am calculating our net worth, Bill said. We are low on resources. We have this van, Molly said. And the bag. The bag is flimsy, Bill said. It will tear on the cardboard boxes. We are going in at night, Molly said. Nobody will know. The party is for six-year-olds, Bill said. They sleep like the dead. So we have a clear window, Molly said. Exactly thirty minutes. From what? Bill asked. To midnight, Molly said. After that, the dad wakes up and eats the cake. I like the cake plan, Bill said. I like the money plan, Molly said. We sell the stuff. The stuff is plastic, Bill said. It is not valuable. Toys are currency to kids, Molly said. Adults buy them for five hundred dollars. So we steal five hundred dollar toys? Bill said. Yes, Molly said. We strip the electronics from the boxes. And leave the empty boxes? Bill asked. Yes, Molly said. So the parents think they opened them. That is clever, Bill said. I am impressed. Stop the car, Bill said suddenly. Why? Molly asked. Do you smell that? Bill said. It smells like dirt, Molly said. It smells like dog, Bill said. Is it the dog? Molly asked. Yes, Bill said. We heard barking earlier. Where is the dog? Molly asked. In the garden, Bill said. He is a guard. We need to move fast, Molly said. I need the taser, Bill said. Do you have it? Molly asked. Yes, Bill said. In the glove box. Then get it, Molly said. Wait, Bill said. Do we take the dog? No, Molly said. We leave the dog. He might be lonely, Bill said. He has a crate, Molly said. We put him in the crate. Is the crate unlocked? Bill asked. Yes, Molly said. I checked. Then we are good, Bill said. Wait, Molly said. Do we have the mask? Yes, Bill said. For the face. I need the mask for my head, Bill said. That is a helmet, Molly said. I need the helmet, Bill said. You do not need a helmet, Molly said. You need a plan. I have the plan, Bill said. Then go, Molly said. Wait, Bill said. What about the alarm? It is not wired, Molly said. It is wireless. So how do we stop it? Bill asked. We hit the sensor with a balloon, Molly said. A balloon? Bill said. That is the distraction. Yes, Molly said. The piƱata plan. You are serious? Bill said. Very, Molly said. I am going to cry, Bill said. Do not cry, Molly said. It ruins the mascara. I am a man, Bill said. You are a thief, Molly said. I am a thief, Bill agreed. Then move, Molly said. Wait, Bill said. Do we have the keys? Yes, Molly said. In the ignition. Then go, Bill said. Wait, Molly said. I forgot the bag. You forgot the bag? Bill said. Yes, Molly said. It is in the front seat. Then get it, Bill said. Okay, Molly said. Let's go, Bill said. Wait, Molly said. Is that a cat? No, Bill said. It is a dog. Okay, Molly said. Go, Bill said.
created_at
"2026-02-25T22:35:00.562Z"
2/25/2026, 10:35:00 PM
model
"qwen/qwen3.5-35b-a3b-20260224"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
425
moderation_latency
(null)
generation_time
57169
tokens_prompt
77
tokens_completion
6469
native_tokens_prompt
76
native_tokens_completion
6868
native_tokens_completion_images
(null)
native_tokens_reasoning
6108
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.013755
router
(null)
provider_responses
0
endpoint_id
"48c1210f-b7d2-4688-a340-f7424fc99d2b"
id
"chatcmpl-043290c9-4c84-973f-9153-0a38686a0ade"
is_byok
false
latency
425
model_permaslug
"qwen/qwen3.5-35b-a3b-20260224"
provider_name
"Alibaba"
status
200
api_type
"completions"
id
"gen-1772058900-QqzF8z2ON3wcfZnWbLEc"
upstream_id
"chatcmpl-043290c9-4c84-973f-9153-0a38686a0ade"
total_cost
0.013755
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
559
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
561
wordsDialogue
0
0.0000%