NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFC4DH3SPAKE7EWMQ86A0E9
Run Details
Model:
Qwen 3.5 9B
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
1m 22s
Cost:
$0.000961
Provider:
Together
Evaluation summary
Score
Evaluator
Details Link
13.53%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The scent of burnt sugar and yeast hung heavy over the cracked counter. "Did you see him," the woman began, "behind the glass?" "He looked like a shadow,"
the man replied
, "but he wasn't moving." "Is that right?"
she asked
, wiping flour from a finger. "He went out the back." "Before the rush?" "Yes." "He smelled of almonds." "I didn't smell anything." "Except the box." "There was no box." "I saw dust." "Dust is normal." "Not this much." "Dave was there." "Dave?" The man paused. "The baker?" "Yes." "Dave is gone." "Where?" "In the basement." "Did you see the man?" "He's not coming back." "You know who he is?" "The one who took the box." "I know." "From the corner." "It was torn." "From the shelf." "It wasn't." "It was." "The floor." "Is it there?" "No." "The floor." "It's gone." "No." "It is." "Dave." "Dave?" "Gone." "No." "Dave is there." "Leave him be." "I need to know." "You want the money?" "The money is cold." "Dave knows." "Dave is dead." "Dave is gone." "In the basement." "Yes." "Show me." "I can't."
created_at
"2026-03-11T21:16:35.76Z"
3/11/2026, 9:16:35 PM
model
"qwen/qwen3.5-9b-20260310"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
339
moderation_latency
(null)
generation_time
81748
tokens_prompt
81
tokens_completion
6332
native_tokens_prompt
78
native_tokens_completion
6356
native_tokens_completion_images
(null)
native_tokens_reasoning
5985
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0009612
router
(null)
provider_responses
0
endpoint_id
"00d506d9-a5e9-4382-a2b9-d54732365594"
id
"oagFTvq-3pDw3Z-9dad94c39168f0b5"
is_byok
false
latency
334
model_permaslug
"qwen/qwen3.5-9b-20260310"
provider_name
"Together"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773263795-BXOlnKfaa29WbV0jxJph"
upstream_id
"oagFTvq-3pDw3Z-9dad94c39168f0b5"
total_cost
0.0009612
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Together"
Evaluation details
Result
Evaluator
Details
Meta Data
13.53%
Count dialogue tags
n/a
neededClean
false
dialogueTags
0
"the man replied"
1
"she asked"