NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8VDWNW1HP0Z93WARRY12R
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
48.0s
Cost:
$0.000407
Provider:
Nebius
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned back in her creaky chair, twirling a lock of cherry-red hair around her finger. The flickering bulb above the cluttered warehouse table cast shadows over her blueprint of a suburban backyard. "Alright, Bill. The tree’s under the patio umbrella. Presents are stacked in a rainbow. Our window’s between the bounce house and the piñata. Simple in, simple out." Bill adjusted his wire-rimmed glasses, squinting at the blueprint. "Wait, wait—rainbow *stacked*? You’re telling me there’s a fucking *color scheme* involved here? I thought we were robbing a house, not planning a damn wedding." "Focus," Molly snapped, tossing him a pair of neon-green gloves. "Disguises are key. You’re the overly enthusiastic party clown. I’m the mom who brought ‘diapers’ but really brought Tanner’s getaway van. Keep the kids distracted with balloon animals or whatever it is you do." "I don’t *do* balloon animals," Bill muttered, stuffing the gloves into his pocket. "I do… spreadsheets. I once turned my uncle’s garage sale into a Microsoft Excel tutorial. Kids might not be into that." Molly groaned. "Just juggle the damn cupcakes. The real work’s on my end. I’ll disable the security system—motion sensors, cameras, that obnoxious ‘Happy Birthday’ fog machine—" "Wait, there’s a fog machine?" Bill’s voice climbed an octave. "Why would anyone need a fog machine for a *seven-year-old’s* party?" "Maybe the parents are secret rockstars," Molly said, shrugging. "Point is, I’ll handle tech. You handle… whatever non-technical things you handle. Just don’t let the kids near the tree. And for God’s sake, *don’t* try to befriend the puppy." "The puppy’s in on this?" "Yes, Bill, the puppy’s the *inside man*. Now shut up and eat your ‘mission briefing’ sandwich." She slid a PB&J across the table. Bill took a bite, then froze. "Molly. What if there’s a *kid* attached to the presents? Like, the actual birthday boy? You can’t just yank his toys mid-party." "Relax. The kid’s getting a new PS5 tomorrow. We’re just… redistributing. Besides, he’ll be too busy chasing the decoy drone I programmed to look like a giant, flying cupcake." "A *cupcake*?" "Tastes like guilt," she said, grinning. "Now suit up. We leave in ten." --- Ten minutes later, Bill stood in Molly’s doorway, sweat soaking through his clown shirt. His red wig slid over one eye. "I look like I just escaped a mental hospital. How’s your ‘mom’ costume?" Molly emerged in a sleek black jacket, a diaper pack strapped to her hip. "I’m leaning into the chaos. You just need to be loud and ridiculous. Oh, and *don’t* mention the balloon animal incident of ’03." "The one where I accidentally created a社会主义 commune in Arizona?" "Top secret. Now move your ass. The cupcakes juggling itself." Outside, the suburban yard buzzed with laughter and the thump of a ukulele. Bill gulped. "Molly, what if they *recognize* me?" "You’re a ghost, Bill. A *clown*. Now go make some noise." He shuffled toward the bounce house, tripped over a garden gnome, and face-planted into a tray of cupcakes. Molly facepalmed. *This is gonna be a long night.*
created_at
"2026-03-25T10:32:40.602Z"
3/25/2026, 10:32:40 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
299
moderation_latency
(null)
generation_time
37805
tokens_prompt
93
tokens_completion
1511
native_tokens_prompt
73
native_tokens_completion
1333
native_tokens_completion_images
(null)
native_tokens_reasoning
655
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004072
router
(null)
provider_responses
0
endpoint_id
"6f9e09e5-e836-4de0-a27d-27c74b866001"
is_byok
false
latency
10093
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"SiliconFlow"
status
502
1
endpoint_id
"aaa39b6c-e488-491a-91e4-e5fb66f86601"
id
"cmpl-a7ef1f6f771a6e7a"
is_byok
false
latency
299
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Nebius"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434760-AWNZDVniwXzuh8qehXK2"
upstream_id
"cmpl-a7ef1f6f771a6e7a"
total_cost
0.0004072
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nebius"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
511
0.00%
Dialogue to Total Word Ratio
Ratio: 67.83%, Deviation: 17.83%
neededClean
false
wordsTotal
516
wordsDialogue
350
43.1902%