NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8MQM41HY435R7FTCAQ8ZD
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
20.2s
Cost:
$0.000200
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning sun slanted through the bakery window, dust motes dancing above baskets of cinnamon rolls and sourdough loaves. The bell above the door jingled as it closed. “You’re in early. Everything alright?” “Saw something strange last night. Down by the old mill path.” “Again? What this time, glowing eyes in the woods?” “Funny. No, it was Martha Hargrove’s cat. Sitting right on Judge Pembleton’s porch. Dead center. Like it was waiting.” “That cat’s been missing for weeks. Poor Martha’s putting up posters everywhere.” “And now it’s on that crotchety man’s porch like a macabre doorstop. He wasn’t home. At least, no lights.” “You think someone put it there on purpose?” “Why? What would that prove? Unless… unless it’s a clue.” “A clue? To what, a missing feline caper?” “Don’t scoff. That cat vanished the same night old man Willoughby turned up with his prize marigolds uprooted. And you remember how he blamed the Pembletons?” “You think this is marital sabotage by proxy?” “Or something worse. That cat knew things. Heard things. Pets always do.” “You’ve been reading too many detective novels.” “Or I’m onto something. Either way, somebody moved that cat. Somebody who didn’t want it found in the creek where I found it yesterday.” “You didn’t tell me that.” “Because now it’s on a porch. And nothing in this town ends up somewhere without meaning.”
created_at
"2026-03-25T10:29:01.192Z"
3/25/2026, 10:29:01 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1491
moderation_latency
(null)
generation_time
20196
tokens_prompt
81
tokens_completion
405
native_tokens_prompt
72
native_tokens_completion
323
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00020028
router
(null)
provider_responses
0
endpoint_id
"ce835ea8-00c7-4686-831d-7733dac5e2d9"
id
"019d248a61b0229bc263e9e3eda1ca82"
is_byok
false
latency
1490
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434541-XyZh1KUltUSOPx4nobON"
upstream_id
"019d248a61b0229bc263e9e3eda1ca82"
total_cost
0.00020028
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)