NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WKMHFY95Q7FQD1Q03CFQ2
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
23.7s
Cost:
$0.001550
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
10.0948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse loft smelled of dust and discarded ambition. Molly, peering through binoculars from their makeshift observation post, lowered them with a sigh. “Target is a six-year-old named Timmy. Birthday party, Saturday, three PM. Backyard of the Henderson residence.” Bill, meticulously sketching on a whiteboard, didn’t look up. “Security?” “Formidable. A hyper-vigilant mother, one sleep-deprived father, and twelve sugar-fueled minors. The cake is a chocolate volcano.” “Weapons?” “Squeaky hammers. Possibly a foam sword.” Molly tossed a grainy photo onto the table. “Primary obstacle: a golden retriever named Biscuit. Affectionate, not aggressive. Likely to compromise our position with licks.” Bill finally turned, his face grave. “Licks are a wild card. Unpredictable. We’ll need a distraction.” He tapped the whiteboard, which displayed a detailed schematic of a suburban backyard. “Phase One: Infiltration. We pose as clowns from ‘Happy Hank’s Party Pandemonium.’ Hank is currently… detained.” “You shoved him in a locker, didn’t you?” “A utility closet. He’ll be fine after the helium wears off.” Bill pointed to a drawing of a present stack. “Phase Two: Acquisition. During the peak chaos of the piñata assault, we switch the real presents with these.” He gestured to a duffel bag. Molly unzipped it to reveal identically wrapped boxes. She lifted one. It was suspiciously light. “What’s in these?” “Coal. Well, painted rocks. It’s symbolic. The switch will be clean. They won’t notice until we’re exfiltrating via the bounce house.” Molly pinched the bridge of her nose. “The bounce house. Your escape vector is an inflatable castle.” “It’s adjacent to the fence line! A well-placed bounce provides the vertical lift needed to clear the six-foot barrier. It’s elegant.” “It’s deranged. What about the payload? What are we even looking at? Action figures? Building blocks?” Bill consulted his notes. “Intel suggests a high-end tablet, a junior archaeology kit, and a disturbing number of gift cards. The big score is a limited-edition, talking ‘Captain Starfighter’ doll. It’s the crown jewel of this entire operation.” “A doll. We’re risking a felony for a doll that says ‘To infinity and slightly beyond!’” “It’s the principle, Molly! The take is clean, untraceable. Who’s going to suspect two clowns? We melt into the suburban tapestry, vanish before the first tearful child realizes their new soccer ball is a painted rock.” Molly stared at the photo of little Timmy, mid-giggle, oblivious to the impending criminal enterprise. “I just… I have standards. We stole the Crown Jewels of Liechtenstein.” “And we almost got pecked to death by the ceremonial swans,” Bill reminded her. “This is a softer target. Lower stakes.” “Higher chance of being vomited on by a child who ate too much cake.” “A risk we mitigate with the clown costume’s non-porous polymer coating.” Bill offered her a red foam nose. “So? Are you in?” Molly took the nose, giving it a melancholy squeeze. It honked softly. “The things we do for retirement.”
created_at
"2026-02-24T22:34:08.823Z"
2/24/2026, 10:34:08 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2191
moderation_latency
(null)
generation_time
23680
tokens_prompt
77
tokens_completion
1024
native_tokens_prompt
170
native_tokens_completion
884
native_tokens_completion_images
(null)
native_tokens_reasoning
169
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015504
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"f5fdbc9f-06d2-4e90-a934-a5795a604c4d"
is_byok
false
latency
124
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771972448-iTj4p2rEPw8De7gJziYh"
upstream_id
"f5fdbc9f-06d2-4e90-a934-a5795a604c4d"
total_cost
0.0015504
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
480
0.00%
Dialogue to Total Word Ratio
Ratio: 71.89%, Deviation: 21.89%
neededClean
false
wordsTotal
491
wordsDialogue
353
10.0948%