NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8S6109JTN1T4TS24PC402
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
23.4s
Cost:
$0.000395
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0049%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her inflatable flamingo costume, squinting at the bouncy castle across the yard where a cacophony of giggles and balloon animals erupted. Bill, dressed as a sentient cupcake with sparkly icing glue still smudged on his cheek, tugged at his fondant collar. “So let me get this straight,” Bill said, voice muffled by the cupcake headpiece. “We’re breaking into a *kid’s birthday party*? For *toys*? What’s the worst that could happen? A balloon pops? A toddler steals our spotlight? I’ll be famous in the underworld *and* the preschool circuit.” “We’re not *breaking in*,” Molly hissed, peering through the flamingo’s beak. “We’re *integrating*. You’re the ‘surprise entertainment.’ I’m the ‘accidental structural collapse.’ See that purple tent? That’s where the presents are. Ugly, neon-wrapped monstrosities I’ve seen from here. We swap them with the decoys in my wig’s compartment. Simple.” Bill wobbled on his cupcake feet. “Decoys? Molly, that’s *glue*. I can still taste the glitter. How are we supposed to vanish without a ‘oh no, a balloon storm’ excuse? Because I am *so* stealing that.” “Stick to the plan,” Molly said, pulling a walkie-talkie from her feathered tail. “When the magician starts his stupid rabbit trick, you trigger the fog machine. I’ll dive into the bouncy castle, chaos happens, we bolt. Easy.” Bill stared at the magician across the yard, currently sawing a child in half. “What if someone *not* a娃娃 realizes we’re not part of the decor? What if someone asks me about my *stability*?” He wobbled again. “I can’t even *bounce* without crying. I once fell out of a bounce house and got life insurance.” “Then don’t bounce,” Molly snapped. “Just… *be* the distraction. Throw a glitter bomb if you must. Just do it *quickly*.” A toddler sprinted past, trailing a rainbow of bounce castle confetti. Bill flinched. “They’ve got *security*. In the form of a six-year-old with a ninja sword. If I go in there, I’m gonna need a *different* kind of fog machine.” Molly checked her watch. “Three minutes until the magician’s finale. You’ve got two to decide if you’re a coward or acriminal. Or a terrifying cupcake, I guess.” Bill looked at his reflection in a balloon. “I’m a terrifying cupcake. Also a coward. Also a criminal. Also a *disaster*. What if the *entertainment* is me fleeing in terror? It’s a *meta* masterpiece.” The magician’s rabbit hopped free. Chaos erupted. Molly shoved the glitter bomb into Bill’s cupcake hands. “Now. Or I’m replacing you with the flamingo.” “Fine!” Bill yelled. He lobbed the bomb. A neon mist swallowed the yard. Molly sprinted toward the bouncy castle as Bill tripped over a balloon, face-planting into a piñata. “Accidental structural collapse, coming right up!” he wheezed. Somewhere in the fog, a child screamed, “IS THAT A *ALIEN CUPCAKE*?!” Molly grinned. The plan was working. Somewhere. Probably.
created_at
"2026-03-25T10:31:27.012Z"
3/25/2026, 10:31:27 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
157
moderation_latency
(null)
generation_time
23288
tokens_prompt
93
tokens_completion
1579
native_tokens_prompt
73
native_tokens_completion
1391
native_tokens_completion_images
(null)
native_tokens_reasoning
770
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00039532
router
(null)
provider_responses
0
endpoint_id
"6b8c829d-3094-45e7-8139-0a67e09060c3"
id
"cmpl-4986c22bd5644115b8b8121ea9942187"
is_byok
false
latency
54
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434687-nWsza2VTvvP6qCy5JZ51"
upstream_id
"cmpl-4986c22bd5644115b8b8121ea9942187"
total_cost
0.00039532
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
469
0.00%
Dialogue to Total Word Ratio
Ratio: 62.03%, Deviation: 32.03%
neededClean
false
wordsTotal
474
wordsDialogue
294
0.0049%