NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8MXPSFZFQ23NYSJXX77CV
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
28.6s
Cost:
$0.000447
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.06%
Dialogue to Total Word Ratio
(details)
0.0324%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a towering piñata shaped like a frowning dragon, adjusting her glow-in-the-dark unicorn onesie. “Bill, you’re supposed to be a balloon artist, not a tornado,” she hissed, as he lugged a two-by-four wrapped in confetti paper past the cake table. Bill tugged his pirate hat over his ears, eyeing a cluster of wide-eyed toddlers. “Ain’t nothin’ funny about this gig. Kids are like tiny, snot-nosed bloodhounds.” He gestured to his belt, where a rubber sword jostled against a pocket of lockpicks. “They’re not gonna suspect a thing. We’re *characters*. You’re ‘Captain Bluutins,’ I’m ‘Rainbow McGiggles.’ Now repeat after me: *‘Want to see a magic balloon snake?’*” Bill mumbled, “‘Want to see a magical balloon syllable?’” Molly facepalmed. “Drop the ‘syllable’—they’re not here for *Shakespeare*. The gifts are in the green tins under the bounce house. I’ve got a gadget that opens child-proof lids in seconds.” “Seconds? What if someone sees us? Or worse—what if they *like* us?” “They won’t. You’ll be too busy dodging glitter bombs.” She tapped her earpiece, its antenna buzzing like a hornet. “The decorator’s hiding the Sicherheit alarms in those tins. We take the toys, disable the sensors, split before the piñata’s empty.” Bill peered at a giggling kid riding a plastic horse. “What if they throw confetti *on* us? I hear it’s sticky.” “It’s *sparkly* sticky. Just act normal.” “Normal? I’m a pirate with a chainsaw grin. How’s that normal?” “Not helping. Just… pivot to juggling. Distract them with your ‘whooshing’ noises.” Bill coughed up a lopsided balloon squid. “I’ve also got a balloon alligator. It hisses.” Molly’s eye twitched. “Fantastic. If we get caught, tell them the toy sharks are *borrowed*. Now move—guest of honor just saw us. She’s got a party favor and a suspicious mind.” As Bill waddled toward the bounce house, Molly muttered, “Next heist, we hit a *teen* party. Less glitter, more laxatives.”
created_at
"2026-03-25T10:29:07.421Z"
3/25/2026, 10:29:07 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1032
moderation_latency
(null)
generation_time
28549
tokens_prompt
93
tokens_completion
1100
native_tokens_prompt
73
native_tokens_completion
978
native_tokens_completion_images
(null)
native_tokens_reasoning
533
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004474
router
(null)
provider_responses
0
endpoint_id
"d342f1c5-34b5-4a5c-b2ea-a0cdcbf72e70"
id
"b4c0e497-4cdf-40d7-988d-74a2adde0eb7"
is_byok
false
latency
1032
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Novita"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434547-XjZ9uZLK10qSKIE0udDx"
upstream_id
"b4c0e497-4cdf-40d7-988d-74a2adde0eb7"
total_cost
0.0004474
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
317
0.06%
Dialogue to Total Word Ratio
Ratio: 66.46%, Deviation: 16.46%
neededClean
false
wordsTotal
325
wordsDialogue
216
0.0324%