NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFCE7QQRTAZT9C3CMN27CBM
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
2m 7s
Cost:
$0.001341
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.03%
Dialogue to Total Word Ratio
(details)
0.0132%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly wiped her hands on her corduroy pants. She stood under the pale moonlight that cast long shadows on the ground and the wind blew her coat. She looked at the neighbor next door. "Bill, do you smell that? It smells like potato chips and imminent arrest." Bill popped a lentil into his mouth. He looked at the house across the street. "That’s caramel corn, Molly. It’s the smell of opportunity. Opportunity that’s going to get us booked for grand theft innocence." The wind moved the plastic leaves above them. Molly sighed and looked at the house again. The lights were dark inside. Bill whispered softly. "We don't steal joy," Bill whispered. "We steal joy's merchandise." "Joy doesn't have a barcode." "It does." "It’s on the box." Bill pointed a finger at the house. "See the yellow slide? It goes in bags." Bags get caught. Molly pulled her mask down. "We need masks." "Do masks scream? No." "They whisper. Till they don't." "Till we start screaming. They blink less. Good for us. Less motion means less noise." Bill adjusted his glasses. "I meant less blinking," Bill smirked. "Blink Rate Control." "Right. Got it. Blink Rate Control. They blink alone." Molly counted the chairs. "How many presents are we talking? Maybe fifty? One hundred." The house light flickered. "That’s a lot. The floor is slippery if we are heavy." "I have shoes with grip. You have boots." "They are too big. They trip." "They are perfect for walking." We’re leaving at midnight. Bill stated. "Midnight?" Molly asked. "That’s early for a vampire." "It’s late for the adults." "The parents will be asleep. Guests will be asleep. Babysitters will be asleep." Bill moved closer. "We move like ghosts." Bill leaned forward. "Ghosts aren't nice." "They steal hearts and leave them with nothing." "We steal toys. No one cries." "Toys are heavy. We have back support." "You said we’d go home." "Now we think." The ground was hard and cold under their boots. "We think," Bill agreed. "Why is it raining?" Molly asked the sky. "Water is good for the garage." "We need water." "We need presents." "I say we go." Bill nodded his head slowly. "Okay. I say we go." Molly stepped forward. "Okay," Bill echoed. They jumped down from the roof. "We are the cavalry." "We are." The floor wasn't slippery. "We are." "Maybe more." "We are." Bill walked down the path. "Do you think it’s working?" He walked to them. Molly followed him. "Yes, it is working perfectly." "We have the presents." "We have the present box." "We have the boxes inside." "We have the box." "We have the gift." "We have the gift."
created_at
"2026-03-11T21:21:57.507Z"
3/11/2026, 9:21:57 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
195
moderation_latency
(null)
generation_time
127085
tokens_prompt
77
tokens_completion
8966
native_tokens_prompt
76
native_tokens_completion
8887
native_tokens_completion_images
(null)
native_tokens_reasoning
8300
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00134065
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagH7fL-6Ng1vN-9dad9c9e8180ccc4"
is_byok
false
latency
195
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264117-FwUY8QykrQZeE6kTomyP"
upstream_id
"oagH7fL-6Ng1vN-9dad9c9e8180ccc4"
total_cost
0.00134065
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
444
0.03%
Dialogue to Total Word Ratio
Ratio: 62.84%, Deviation: 12.84%
neededClean
false
wordsTotal
444
wordsDialogue
279
0.0132%