NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8TKBSPT7SVMTT0XHJ5J99A
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
26.6s
Cost:
$0.001555
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse loft smelled of failure and old pizza. Molly traced a pudgy finger across the blueprints, which were, in fact, a brightly colored map of “Sunny Meadows Estate” drawn in crayon. Bill, perched on a milk crate, polished a grappling hook with the intensity of a neurosurgeon. “The primary ingress is here,” Molly said, pointing to a wobbly rectangle labeled ‘FRONT DOOR.’ “During the peak distraction window: the magician’s act. Specifically, the moment he pulls the doves from his sleeve. Universal awe pattern in the six-to-eight demographic is ninety-seven percent.” Bill nodded, squinting. “Security?” “One adult male, ‘Dad.’ Likely preoccupied with grill flare-ups. One adult female, ‘Mom.’ Threat level elevated if the paper plates run out. The real variable is the Labrador retriever. Code name: Buster. He’s a licker, not a biter, but his enthusiasm could cause a noise complaint.” “Understood. Exfil?” Molly flipped the page to a diagram of the backyard. “We go over the fence into the Andersons’ yard. They’re at their lake house. Their gnome population is dense, but passive.” Bill leaned back, a grin spreading. “The loot?” “Standard haul. Plastic construction kits, age-inappropriate glitter-based arts and crafts, several units of ‘educational’ wooden toys. The jackpot is the oversized stuffed unicorn. It’s a limited edition ‘Glitterhoof.’ Secondary market value is insane on mommy forums.” “Beautiful.” Bill’s eyes gleamed. “What’s the catch?” Molly sighed, pushing her glasses up. “The cake. It’s a multi-tiered fondant fortress. The birthday boy, ‘Timmy,’ has a documented history of violent meltdowns if his baked goods are disturbed. The presents are stacked in a makeshift strongroom adjacent to the cake table. One errant elbow, and we have a screaming, frosting-covered liability.” Bill considered this, tapping his chin. “We need a cake-based diversion. Could we trigger a pre-emptive collapse? Remote-controlled rodent?” “Too messy. I propose we exploit the magician. I’ve studied his online reviews. He’s a hack. His ‘disappearing coin’ trick fails sixty percent of the time, causing toddler incredulity. We simply ensure failure during his finale. The ensuing chaos will be our curtain.” Bill stood, stretching. “I’ll prep the get-away wagon.” He gestured to a battered ice cream truck parked in the shadows, its cheerful jingle box rewired to play the theme from *Mission: Impossible* at a low hum. Molly rolled up the crayon blueprint. “Remember, Bill. This isn’t just about the loot. It’s about sending a message.” “Yeah? What message?” “That no one’s birthday is safe.” She allowed herself a small, grim smile. “And that we really need the money for rent.”
created_at
"2026-02-24T21:59:03.082Z"
2/24/2026, 9:59:03 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2788
moderation_latency
(null)
generation_time
26118
tokens_prompt
77
tokens_completion
1034
native_tokens_prompt
170
native_tokens_completion
887
native_tokens_completion_images
(null)
native_tokens_reasoning
282
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015552
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"dd429026-3778-403b-b8b6-b840c30eadb0"
is_byok
false
latency
467
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771970343-1mDdIMkVRaWy6pIl8Uv7"
upstream_id
"dd429026-3778-403b-b8b6-b840c30eadb0"
total_cost
0.0015552
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
420
0.00%
Dialogue to Total Word Ratio
Ratio: 62.79%, Deviation: 32.79%
neededClean
false
wordsTotal
430
wordsDialogue
270
0.0000%