NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7ZX3WTSXGNRT6PJFH3JEA
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
5.8s
Cost:
$0.000044
Provider:
WandB
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
36.79%
Dialogue to Total Word Ratio
(details)
18.3940%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a bush shaped suspiciously like a cartoon dinosaur, binoculars glued to her eyes, which were also slightly crossed from prolonged use. The backyard of the birthday party shimmered with helium balloons tethered to lawn chairs like tiny, festive prisoners. Streamers flapped in the wind like flags of surrender. Children shrieked with the unhinged joy only three sugar rushes can provide. Bill adjusted his trench coat—entirely inappropriate for a children’s party in July—and checked his watch for the seventh time in two minutes. His disguise, a paper name tag that read *Hi! I’m Uncle Dan!*, fluttered in the breeze. He had even drawn a smiley face on it, slightly deranged in its enthusiasm. Molly lowered the binoculars and whispered something about a piñata full of cash, but Bill wasn’t listening. He was imagining the look on the clown’s face when he realized the goody bags had vanished. He might even cry. That would be funny. The cake arrived—a five-tier monstrosity shaped like a unicorn vomiting rainbows—and Bill saw his chance. While the children chanted, the clown wrestled with a stray Chihuahua in a superhero cape. Molly gave the signal: one sharp nod and a jazz hand. They moved like shadows trained in interpretive dance. Molly snatched wrapped boxes with the grace of a raccoon in a grocery store, while Bill stuffed party favors into a duffel bag labeled *Mom’s Yoga Gear*. The clown turned. They froze, smiling nervously. The clown frowned, then burst into laughter, thinking it was a game. By the time the last present vanished into the night, only confetti remained—a glittering testament to theft wrapped in whimsy.
created_at
"2026-03-25T10:17:38.688Z"
3/25/2026, 10:17:38 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
72
moderation_latency
(null)
generation_time
5673
tokens_prompt
77
tokens_completion
433
native_tokens_prompt
72
native_tokens_completion
367
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000439
router
(null)
provider_responses
0
endpoint_id
"043eb28f-fc28-4b0d-9800-b249fcfcfbf9"
is_byok
false
latency
8
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Parasail"
status
429
1
endpoint_id
"c9f927db-d559-49eb-9846-80862c5e3b4e"
id
"chatcmpl-91f4e8314443d791"
is_byok
false
latency
25
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"WandB"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433858-dlZdPEAHfa2ah9u69wDZ"
upstream_id
"chatcmpl-91f4e8314443d791"
total_cost
0.0000439
cache_discount
(null)
upstream_inference_cost
0
provider_name
"WandB"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
277
36.79%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 10.00%
neededClean
false
wordsTotal
278
wordsDialogue
0
18.3940%