NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ7QRFVJT6QEVN2KEDK167X
Run Details
Model:
Qwen 3 32B
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
13.2s
Cost:
$0.000417
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Alright, so here's the plan," said Molly, adjusting the ridiculous feather boa she was wearing for "camouflage." "We waltz in pretending we're lost parents who left their strollers at Starbucks. Easy, right?" Bill squinted from behind a pair of oversized sunglasses. "What if they do fingerprint checks? I mean, it *is* a toddler’s birthday. You know—tiny plastic toys and even tinier hope." Molly rolled her eyes. "They’re not looking for two adults smuggling mini C4 into cupcake boxes. Just smile, act confused, and grab the loot when you least expect it." Bill nodded slowly, then pointed at her hat, which had “It’s My Birthday!” embroidered on it in glitter glue. “Do you really think this disguise is foolproof or just fool-pretty?” "Fool-pretty is my middle name," Molly said. "And no one questions someone with glitter authority." Bill looked down at his tutu, which doubled as a makeshift fanny pack. “I still think we could’ve gone with bakers.” “Nope,” Molly said, tapping the party invitation like it was a mission briefing. “Bakers come after setup is done. We need to be part of the event.” Bill sighed. “So, just to recap, the heist involves cupcakes, disco balls, and a kid with a microphone yelling about dinosaurs?” “Exactly,” Molly said, grinning. “And once we’re inside, no funny stuff. This is a **children's** event.” Bill gave a mock salute. "Sir, yes, sir," he said in a squeaky voice before bursting out laughing.
created_at
"2026-03-25T10:13:12.124Z"
3/25/2026, 10:13:12 AM
model
"qwen/qwen3-32b-04-28"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
819
moderation_latency
(null)
generation_time
12807
tokens_prompt
93
tokens_completion
393
native_tokens_prompt
76
native_tokens_completion
341
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0004168
router
(null)
provider_responses
0
endpoint_id
"2693bf9a-1732-4008-acc9-d241214e936c"
id
"e28a70ad82f6470a98d0460bbc4ba6a5"
is_byok
false
latency
819
model_permaslug
"qwen/qwen3-32b-04-28"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774433592-ltXlBMqE2jLEKpHeoxfE"
upstream_id
"e28a70ad82f6470a98d0460bbc4ba6a5"
total_cost
0.0004168
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
240
0.00%
Dialogue to Total Word Ratio
Ratio: 64.05%, Deviation: 25.95%
neededClean
false
wordsTotal
242
wordsDialogue
155
0.0000%