NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8MF0D55YKFMBHWTJFF5BK
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9.9s
Cost:
$0.000089
Provider:
WandB
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a bush shaped like a startled penguin, binoculars pressed to her face. “Bill, are you *sure* this isn’t a trap? That clown hasn’t stopped smiling since we got here. It’s unnatural.” Bill adjusted his fake mustache, which had slipped to the left. “Relax. Clowns are nature’s white noise. Besides, no one suspects two concerned parents scouting a venue for their nephew’s *third* birthday.” “He’s your *nephew*, Bill. You brought the juice boxes and glitter bomb.” “That was *one* time! And it was self-defense. No one brings glow worms to a finger-painting station and lives to tell the tale.” Molly lowered the binoculars. “Focus. We’ve got seventeen kids, two screaming parents, one balloon animal warlord, and a piñata shaped like a unicorn riding a dinosaur. If we time this right, we hit during the treasure hunt.” Bill scratched his elbow, dislodging a puff of fake hair. “You think they’ll fall for the ‘lost puppy’ routine again?” “Only if you stop sneaking into the costume closet and coming out as a raccoon with emotional issues. Stick to the plan. I cause a distraction, you grab the gifts.” “What kind of distraction?” “I haven’t decided. Possibly crying. I’m thinking *dramatic* crying. With wind.” “Wind how?” “The sprinkler system, Bill! I’m going to trip the sprinklers, everyone panics, you swoop in like… like a gift-goblin.” “I prefer *artisanal retrieval specialist*.” “Fine. You swoop in like an artisanal retrieval specialist and take the loot. Then meet me at the snack table.” “The snack table? Why the snack table?” “Because no one ever checks the snack table. Also, I need backup cupcakes. I haven’t eaten since the failed cupcake heist of ’22.” “That wasn’t a *heist*, Molly. You ordered six hundred cupcakes and got cold feet.” “They had *sprinkles*, Bill. *Rainbow* sprinkles! I panicked. Who can think clearly around that much joy?” Bill checked his watch. “Party’s been going for twenty minutes. Kids are still upright. Peak vulnerability in ten—post-sugar rush, pre-tantrum.” “Then we move fast. You’ve got the duffel bag?” Bill patted a sack labeled “Mom’s Yoga Gear.” “Full of tissue paper and hope.” “Perfect. And remember, no heroics. No adopting stray ferrets.” “Those ferrets were *misunderstood*.” “And *covered in glitter*.” A sudden shriek erupted from the yard. The clown had made a giraffe that looked suspiciously like a rabid badger. Children scattered. Molly hissed, “Now! Go!” Bill stood, marched toward the gift pile, and immediately tripped over a plush triceratops. “I’m okay! Just reacquainting myself with gravity!” Molly facepalmed. “Get up! The clown’s coming!” Bill scrambled, stuffing presents into the bag. “Too many squeeze toys! Why are there so many squeaky parrots?!” “Leave the parrots! Take the gift cards!” “The monkey has a *savings account*?!” “It’s from his grandma! Just—take it!” A child tugged Bill’s pant leg. “Mister, my balloon died.” Bill froze. “I—uh—commend your loss. Very noble. Float free, Steve.” Molly darted over, grabbed the bag, and whispered, “Run. Now. Before someone sings the alphabet song and we lose our edge.” They sprinted toward the fence, duffel bag flapping like a wounded swan. Behind them, the clown waved serenely, holding a squeaky parrot that seemed to wink.
created_at
"2026-03-25T10:28:52.37Z"
3/25/2026, 10:28:52 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
67
moderation_latency
(null)
generation_time
9730
tokens_prompt
77
tokens_completion
954
native_tokens_prompt
72
native_tokens_completion
815
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000887
router
(null)
provider_responses
0
endpoint_id
"043eb28f-fc28-4b0d-9800-b249fcfcfbf9"
is_byok
false
latency
8
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Parasail"
status
429
1
endpoint_id
"c9f927db-d559-49eb-9846-80862c5e3b4e"
id
"chatcmpl-9a07c3d22166b33c"
is_byok
false
latency
28
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"WandB"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434532-IoWtrePTRzbx2d7e2fwW"
upstream_id
"chatcmpl-9a07c3d22166b33c"
total_cost
0.0000887
cache_discount
(null)
upstream_inference_cost
0
provider_name
"WandB"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
530
0.00%
Dialogue to Total Word Ratio
Ratio: 74.11%, Deviation: 24.11%
neededClean
false
wordsTotal
533
wordsDialogue
395
0.0152%