NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7QRFT87DG0CAJGF6MFY27
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
15.0s
Cost:
$0.000204
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her tiara, which read *“Birthday Queen”* in glitter, while Bill fidgeted with a balloon shaped like a frowning unicycle. “You’re supposed to *look* excited,” she hissed, motioning toward the bounce house spewing confetti. “Not like you’ve never stolen six hundred dollars’ worth of toddler toys before.” Bill mopped sweat from his brow, his costume—piñata stripes and a disproportionate sombrero—swishing like a jailhouse fashion disaster. “What if someone spots us? Kids recognize voices. Or… or the cake’s rigged with alarms?” Molly rolled her eyes, slathering artificial tan over her goatee. “Relax. The parents are too busy snapping selfies with the balloon artist to notice a piñata walking off with a Jake and the Never Land Pirates DVD. Now, follow the ‘Special Surprise’ sign. Target’s under the bouncy castle—glittery box, blue ribbon.” A toddler plowed into Bill, clutching a half-eaten cupcake. “Are you the magic man?” she bleated, smearing jam on his cheek. “Y-yeah! Watch this!” Bill yanked the unicycle balloon, triggering a hiss and a limp noodle of a deflating snake. Molly dragged him away. “Stick to the plan. When the clown starts his ‘Big Surprise,’ we grab the gifts and vanish before someone realizes the piñata’s already been burst.” Bill adjusted his sombrero, eyeing a parent dragging a disco ball. “What if we get stuck in the bounce house with a mascot?” “Then you’re gonna have the worst career day ever,” Molly muttered, as the clown wheeled out a glitter cannon.
created_at
"2026-03-25T10:13:12.08Z"
3/25/2026, 10:13:12 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1158
moderation_latency
(null)
generation_time
14220
tokens_prompt
93
tokens_completion
804
native_tokens_prompt
73
native_tokens_completion
708
native_tokens_completion_images
(null)
native_tokens_reasoning
382
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00020408
router
(null)
provider_responses
0
endpoint_id
"6b8c829d-3094-45e7-8139-0a67e09060c3"
id
"cmpl-3f5d498ab2d14d83b74970324660d5cc"
is_byok
false
latency
53
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433592-nNIvVijqvOIaXrN3jQRh"
upstream_id
"cmpl-3f5d498ab2d14d83b74970324660d5cc"
total_cost
0.00020408
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
246
0.00%
Dialogue to Total Word Ratio
Ratio: 53.04%, Deviation: 43.04%
neededClean
false
wordsTotal
247
wordsDialogue
131
0.0000%