NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ886X5BYNAYS42NY1H4F43
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
21.8s
Cost:
$0.000786
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0018%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The glow of a flickering streetlamp illuminated the alley behind Chuckles the Clown’s Birthday Emporium. Bill knelt, fiddling with a lockpick and squinting at the back door. Molly leaned against a soggy cardboard box labeled “Party Hats – Do Not Sit.” Molly: “You know this is the dumbest plan you’ve ever had, right?” Bill: “It’s not dumb. It’s inspired. Birthday parties mean presents. Presents mean cash. Cash means no more eating discount yogurt with a spoon we found in the dumpster.” Molly: “We’re stealing from kids, Bill. Little kids. There’s gotta be a line.” Bill: “There is a line. And we crossed it when you sold my grandmother’s dentures to that vampire cult. This is practically charity. Think of all the trauma those kids have waiting until opening time. We’re speeding up the process.” Molly: “You’re insane. Also, you’re terrible with locks.” Bill: (grunting) “It’s a novelty door! It has a clown-shaped handle. How am I supposed to know which pick fits a clown nose?” Molly: “You should’ve brought the clown nose pick!” Bill: “We don’t *have* a clown nose pick! That’s not a thing!” Molly: “Clearly it *is* a thing! That’s an *actual* nose-shaped keyhole!” Bill: (sighs, drops lockpicks) “Fine. Plan B.” Molly: “There’s a Plan B?” Bill: “Always. I brought a rock.” Molly: “You can’t throw a rock at a door in a children’s party store. What if a kid sees you?” Bill: “Then I wave and say ‘trick or treat!’” Molly: “It’s Tuesday.” Bill: “So? I’m festive.” Molly: “Festive?! Bill, there are balloons tied to the door. And a sign that says ‘Timmy’s 6th Birthday Bash – No Strangers Allowed.’ We are strangers!” Bill: “We’re not strangers. We’re invited guests.” Molly: “We are *not* on the guest list.” Bill: (pulls out two lanyards with fake IDs) “We are now. Meet Barry and Melinda from accounting. We brought a fruit basket.” Molly: “That’s a stolen fruit basket from the gas station down the street.” Bill: “Exactly. It’s authentic.” Molly: “I can’t believe I’m saying this, but your plan might be worse than the lockpick idea.” Bill: “Relax. Kids are easily distracted. They’ll be too busy screaming over piñatas to notice us scooping up wrapped boxes.” Molly: “What if the parents notice?” Bill: “Parents don’t pay attention. They’re either on their phones or arguing about who forgot the napkins. It’s chaos. Perfect cover.” Molly: “And when we get caught covered in glitter and holding seven Transformers?” Bill: “We say we’re undercover clowns. Very method.” Molly: “You’re wearing cargo shorts and a t-shirt that says ‘I Paused My Game to Be Here.’” Bill: “Even better. Parents will think I’m committed to the bit.” Molly: “I hate you. I really do.” Bill: “You love me. You just don’t know it yet. Now, put on the lanyard. And for God’s sake, don’t eat any candy. Last time you ate free candy, you hallucinated a squirrel offered you stock tips.” Molly: “That squirrel was very convincing.” Bill: “It was a rat with a tiny briefcase.” Molly: (muttering) “Still gave better advice than you…” Bill: “Smile, Melinda. We’re here to celebrate Timmy, not steal his future.” Molly: “It’s *Molly*, you lunatic.” Bill: “No, it’s Melinda now. And I’m Barry. Remember, we work in accounting. And we *love* cake.” Molly: “I hate cake.” Bill: “Not anymore, Melinda. Not anymore.”
created_at
"2026-03-25T10:22:10.857Z"
3/25/2026, 10:22:10 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
713
moderation_latency
(null)
generation_time
21748
tokens_prompt
77
tokens_completion
993
native_tokens_prompt
72
native_tokens_completion
877
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00078616
router
(null)
provider_responses
0
endpoint_id
"aaae5602-0949-4bc9-bd77-588b4b285341"
id
"8f5422553e024cb283dc29303061dc2d"
is_byok
false
latency
713
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"AtlasCloud"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434130-dVqNqZXQIwFiGhgvqzfE"
upstream_id
"8f5422553e024cb283dc29303061dc2d"
total_cost
0.00078616
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
552
0.00%
Dialogue to Total Word Ratio
Ratio: 84.30%, Deviation: 14.30%
neededClean
false
wordsTotal
554
wordsDialogue
467
0.0018%