NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8MKN3P2Y4Y9Y0MJF0GF2H
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
20.0s
Cost:
$0.000203
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The morning sun warmed the checkered tiles of the bakery, where the scent of cinnamon and burnt sugar curled through the air. A bell chimed above the door, and flour dust danced in the light. “You’re in early.” “Couldn’t sleep. Mrs. Peabody’s rooster started crowing at three again. Claims it’s a faulty internal clock, but I suspect she wound it up on purpose.” “That woman thrives on chaos.” “Remember the great zucchini incident of last summer?” “Heart attack at the farmers’ market. I still can’t look at squash the same way.” A loaf cracked in the cooling rack. The counter gleamed with jam jars and chipped teacups. “New scones today?” “Lemon poppy, with a hint of lavender from my garden. Though after the incident with Miss Clara’s cat and the essential oil bottle, I’ve started locking the shed.” “She still thinks her tabby’s missing?” “Officially vanished. Though Doris swears she saw him sunbathing on old man Hemsworth’s porch last Tuesday.” “Hemsworth hates cats.” “Exactly. Which is why I think he’s hiding something.” “You always find conspiracy in pastry.” “And you ignore it. That’s why Doris found three teacups in the recycling when she only drank one coffee last week.” “She must’ve miscounted.” “Three cups. One saucer. And a smear of strawberry jam that doesn’t match her usual preserves.” Silence. The oven timer buzzed. “You think he’s involved?” “I think no one eats jam at eleven a.m. unless they’re hiding something.”
created_at
"2026-03-25T10:28:57.127Z"
3/25/2026, 10:28:57 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1158
moderation_latency
(null)
generation_time
20009
tokens_prompt
81
tokens_completion
432
native_tokens_prompt
72
native_tokens_completion
339
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0002031
router
(null)
provider_responses
0
endpoint_id
"cf186489-d252-4fb7-aca9-87ef7b557eff"
id
"eb7988af181e91b0f66df08d373af788"
is_byok
false
latency
1157
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434537-uCnkRPIZNuBh5LdHYI9k"
upstream_id
"eb7988af181e91b0f66df08d373af788"
total_cost
0.0002031
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)