NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8VSCXQV269ZZ3WKJ0F3WZ1
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
28.0s
Cost:
$0.001782
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
22.29%
Dialogue to Total Word Ratio
(details)
61.1056%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in the van was thick with the smell of old coffee and desperation. Molly, her eyes gleaming behind a pair of rhinestone-studded reading glasses, tapped a detailed schematic spread across the fold-down table. “This,” she declared, “is the motherlode. The Smythe-Vanderbilt birthday extravaganza. Six-year-old Timmy. His father is in novelty whoopee cushions. The net worth of gifts in that sunroom is… staggering.” Bill, a man who looked permanently like he’d just bitten into a lemon, squinted at the drawing. “You dragged me here at dawn to plot the theft of a six-year-old’s Legos?” “Not *just* Legos, Bill. Think bigger. We’re looking at premium, organic, gluten-free play-doh. A miniature electric car. A signed rookie card from that kid’s show hamster, ‘Binky.’ This is a high-value target.” “It’s a backyard party. With a bouncy castle.” “Which provides perfect acoustic cover!” Molly pointed a laser pen at the blueprint. “Our window is during the piñata frenzy. Chaos. Sugar-fueled anarchy. You, dressed as ‘Binky the Hamster,’ will create a secondary diversion by ‘accidentally’ setting the clown’s pants on fire.” Bill stared. “I’m not wearing a giant hamster costume.” “It’s a key part of the plan! I’ve already procured it. It’s… ventilated.” “How do you even set a clown’s pants on fire?” “A small, theatrical smoke pellet and a whoopee cushion filled with lighter fluid. Timmy’s dad’s connections finally pay off.” She beamed. “While the adults are dealing with a flaming, shrieking clown, I slip in through the side gate, which will be unlatched because the caterer is secretly my cousin Cheryl, load the presents into a gift bag-lined wheeled cooler, and exfiltrate via the neighbor’s hedge.” Bill massaged his temples. “A wheeled cooler.” “Disguised as a beverage cart! Genius, right?” “What about the kids? They’re tiny, but they’re observant. And vicious.” “We have counter-measures.” She held up a bag of glitter. “Stage two: the glitter bomb. Deployed at the mouth of the bouncy castle. It’s the herpes of the craft world. They’ll be finding it until they’re thirty. Total sensory overload. They won’t notice a thing.” “And the exit strategy?” “Cheryl drops us at the park. We switch vehicles. The presents get unwrapped at the secondary location—my grandma’s garage. She thinks we’re running a charity toy drive. We fence the big-ticket items online. The rest, we unload at a flea market two towns over. Clean. Elegant.” Bill studied the blueprint again, a slow, grudging respect dawning on his face. “The hamster thing is non-negotiable?” “The Binky suit is the linchpin.” He sighed, a long, suffering sound. “Fine. But I want a bigger cut. Hazard pay for rodent-based espionage and potential arson charges.” “Sixty-forty.” “Deal.” He leaned back. “What’s the contingency if the hamster head falls off?” Molly’s smile didn’t waver. “Improvise. Squeak loudly and run. But it won’t. I used industrial-grade Velcro. This is going to be smooth, Bill. Our biggest score yet.” Outside the van, a bird chirped. Somewhere, a six-year-old’s future happiness was being meticulously plotted, one absurd detail at a time.
created_at
"2026-02-24T22:19:49.058Z"
2/24/2026, 10:19:49 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1941
moderation_latency
(null)
generation_time
27922
tokens_prompt
77
tokens_completion
1134
native_tokens_prompt
170
native_tokens_completion
1029
native_tokens_completion_images
(null)
native_tokens_reasoning
242
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0017824
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"9cf672bd-05b8-4f9c-98bb-69020f962e5e"
is_byok
false
latency
132
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771971589-cxYnyOGgT919KiMoYfFi"
upstream_id
"9cf672bd-05b8-4f9c-98bb-69020f962e5e"
total_cost
0.0017824
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
22.29%
Dialogue to Total Word Ratio
Ratio: 75.48%, Deviation: 5.48%
neededClean
false
wordsTotal
522
wordsDialogue
394
61.1056%