NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFCSFKFVR09264BZB6NPXJQ
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
2m 14s
Cost:
$0.001143
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
49.27%
Dialogue to Total Word Ratio
(details)
24.6370%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her eye against the frosted glass. The blur of giggling children inside made her heart race with a feverish thrill. She turned to Bill behind her and wiped the condensation from the pane. Bill was already smiling, his grin sharp enough to cut ribbon. He wiped the dust from his shoulders before pulling his mask low. "Bill, look at that. That red teddy bear in the center. It is worth at least fifty dollars." "It is a plush." "A plush thief never sleeps." "We cannot leave without the bear." "We cannot leave anything behind, Molly." "You are always attached to the gold." "The gold is in the box under the tree." "There are three bags." "I only have two pockets." "You can wear the bag over your head." "That looks like a ghost." "And very stylish." "You hate the bags." "I told the mom about the bags." "Why did you do that?" "She asked if I was carrying anything." "She asked for a bag." "Do you need me to give you the bag?" "Yes, please." "Why does it smell like air?" "It smells like us." Molly checked her watch. The house sat silent on the hill. The moonlight filled the garden. The kids inside laughed without knowing the danger. "They will know. The parents will know." "We are not telling them." "We are telling the bear." "The bear eats dinner." "The bear hates thieves." "The bear likes thieves." "The bear is silent." "The bear is gold." "Bill, stand back." "My friend is standing back." "Who is your friend?" "The bear." "I am going through the window." "You never go through the window." "I am going through the air." "You never go through the air." "I am going through the house." "You never go through the house." "You are a professional." "I am a professional." "The mother is a professional." "The mother is not a professional." "She holds a balloon." "A balloon is a weapon." "A balloon is a shield." "A balloon is a weapon for children." "I cannot fight the balloon." "I cannot fight the balloon." "We have to move." "We have to move." "Before the cake is gone." "The cake is gone." "Then we get the cake." "The cake is gone." "We are rich." "We are broke." The car hummed quietly in the night. The door to the garage stood open. They stepped forward to the door. The wind blew softly.
created_at
"2026-03-11T21:28:06.014Z"
3/11/2026, 9:28:06 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1200
moderation_latency
(null)
generation_time
133623
tokens_prompt
77
tokens_completion
7458
native_tokens_prompt
76
native_tokens_completion
7570
native_tokens_completion_images
(null)
native_tokens_reasoning
6854
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0011431
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagK19E-3pDw3Z-9dada59da387f075"
is_byok
false
latency
1200
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773264486-MZbRzad4RCDvamOYgdUT"
upstream_id
"oagK19E-3pDw3Z-9dada59da387f075"
total_cost
0.0011431
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
404
49.27%
Dialogue to Total Word Ratio
Ratio: 73.76%, Deviation: 3.76%
neededClean
false
wordsTotal
404
wordsDialogue
298
24.6370%