NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7VFQ36WVDTY2DYPZ3M2H6
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
10.3s
Cost:
$0.000469
Provider:
Alibaba
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
75.42%
Dialogue to Total Word Ratio
(details)
37.7084%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind a plastic castle at the edge of the inflatable moon bounce, adjusting her glittery unicorn costume. Bill, dressed as a talking balloon animal in a sequined lion suit, wriggled next to her, his tail popping unexpectedly in the quiet. *"Ten percent rule,"* she muttered, scanning the chaos of the birthday bash. Parents sipped lukewarm punch, kids chased a rogue puppy, and a piñata—stuffed with the glittering treasure they needed—hung like a siren’s call near the snacks. Bill whispered, “What if someone mistakes me for décor?” He bobbled dramatically, the sequins clacking. Molly elbowed him. “Then they’ll thank you for adding *flair.* Remember: distraction, retrieval, vanish before the cake.” The plan was elegant, absurdly fragile. Bill would juggle rubber chickens to create havoc mid-SpongeBob dance-off. Molly would leap for the piñata, her handkerchief coiled around a picklock tool. But then the party clown—a purple one with a whistle sharp enough to split atoms—herded toddlers toward karaoke, and Bill’s chicken slipped, bonking a boy with a balloon helmet. “I’m sorry, Your Highness!” he gasped, as the kid wailed. Molly lunged at the piñata, but her unicorn horn snagged a banner, unraveling a tinsel waterfall. She yanked free, sending a disco ball crashing. Presents tumbled. “*Now* we’re distractions,” Bill said, fumbling for a balloon sword. “Run!” Molly hissed, dodging a glitter bomb thrown by the birthday girl. They bolted, herding balloons as a shield, until the backyard door slammed behind them. Bill stared at the loot bag. “No cake?” “It’s a children’s party,” Molly groaned. “They’re not paying for it.” “I was hoping for cupcake-sized emergency funds.” The puppy burst past them, trailing confetti. They sprinted, the unicorn’s tail dragging like a banner from a parade gone rogue.
created_at
"2026-03-25T10:15:13.895Z"
3/25/2026, 10:15:13 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
368
moderation_latency
(null)
generation_time
10296
tokens_prompt
93
tokens_completion
1306
native_tokens_prompt
72
native_tokens_completion
1109
native_tokens_completion_images
(null)
native_tokens_reasoning
665
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000468832
router
(null)
provider_responses
0
endpoint_id
"344ccdcc-6e59-464d-89ab-8e5b92fe2adc"
id
"chatcmpl-df52f435-95f2-9822-9faf-707a32b6138c"
is_byok
false
latency
366
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"Alibaba"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433713-v4zVzVM5CetLBjxJxrDz"
upstream_id
"chatcmpl-df52f435-95f2-9822-9faf-707a32b6138c"
total_cost
0.000468832
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
292
75.42%
Dialogue to Total Word Ratio
Ratio: 17.29%, Deviation: 7.29%
neededClean
false
wordsTotal
295
wordsDialogue
51
37.7084%