NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFHWEG79AGPZVR6C81KH4R0
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 44s
Cost:
$0.001972
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his blindfold tightly and nodded at the towering Christmas tree. Molly wiped sweat from her forehead with a gloved hand, her breath quickening in the silence of the suburban house. They both knew the smell of fresh cookies was coming soon, but the tension remained high for the break-in. She adjusted her grip on the crowbar, waiting for the green light. The street lights flickered, revealing the heavy curtains of the neighbor's house across the way. She looked at Bill, and he nodded back at her again. The crowd in the street watched them closely from afar. They had to move fast, or else the whole plan could fail now. The house was quiet, save for the sound of a dog barking. A car drove by slowly outside the window, shining its headlights. It was midnight, the perfect time for a heist of this nature. The shadows were long and stretched across the garden gate. Molly felt the heat from the wind and the moon above. Bill nodded again, signaling it was time to proceed. The door was locked tight with a creak. She reached for the door, feeling the cold metal handle. The lock was old and rusty. The tumblers turned slowly, grinding together. Bill: You sure this is the right house, Molls? Molly: Yes. I double-checked the address. We need to be professional. Bill: Professionalism is our middle name. Molly: So, what is the loot plan, exactly, Bill? Bill: Keep it simple. All the boxes go to the van. Molly: That is a lot of boxes for one trip. Bill: They are gifts. They are definitely special. Molly: Breakfast presents, not lunch presents. Bill: All the same to me. I don't care. Molly: What about the kids? Bill: They are asleep in the minty room. Molly: So we sneak quietly, like cats. Bill: I have the rope. Molly: Wait, do we have the noise? Bill: We use the radio. Molly: What will everyone hear? Bill: Just music and toys crying. Molly: I hear the floorboards. Bill: You walk very loud. Molly: I meant I hear the floorboards. Bill: Fine. Floor. No carpet. Molly: My feet are hurting. Bill: We need to get fast. Molly: It isn't running, Bill. Bill: Crawl, Molly. Quietly. Molly: I'm crawling right now. Bill: Good. Good. Molly: You are silent, aren't you? Bill: I am the ghost. Molly: Okay. Bill: Now the big time. Molly: The what? Bill: The big boxes. Molly: We take them out? Bill: We take them out. Molly: Do we feed them? Bill: No, we take them. Molly: Why? Bill: Because they are ours now. Molly: That sounds bad. Bill: We share the money. Molly: Okay, but will you catch them? Bill: I catch them bags. Molly: Okay. Bill: Listen. Molly: I hear you. Bill: The door. Molly: Open it. Bill: I need a key. Molly: No need. Bill: Do we have it? Molly: No need. Bill: Okay. Bill: No. Molly: What? Bill: Don't say no. Molly: Do we go? Bill: Yes. Molly: Let's do it.
created_at
"2026-03-11T22:57:06.069Z"
3/11/2026, 10:57:06 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
735
moderation_latency
(null)
generation_time
163987
tokens_prompt
77
tokens_completion
12736
native_tokens_prompt
76
native_tokens_completion
13097
native_tokens_completion_images
(null)
native_tokens_reasoning
11935
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00197215
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagnNQx-6Ng1vN-9dae27fd02810cf3"
is_byok
false
latency
735
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773269826-vemJMB719r76yyz2IUVY"
upstream_id
"oagnNQx-6Ng1vN-9dae27fd02810cf3"
total_cost
0.00197215
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
510
wordsDialogue
0
47.9934%