NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8VFM4DT3347GZ89W3XV7WC
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
31.6s
Cost:
$0.002027
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
1.0058%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse loft smelled of dust and discarded dreams, but on the rickety card table, Molly had laid out the blueprint for perfection. Or, at least, a detailed crayon diagram of the Henderson backyard. Bill, a mountain of a man who could crack a safe but not a smile, squinted at the drawing. “This is the primary ingress,” Molly said, tapping a point marked ‘GATE.’ Her finger, adorned with a plastic spider ring from a previous job, moved with precision. “During the peak distraction event.” “The what now?” “The piñata strike, Bill. The moment the donkey’s head is obliterated, twelve sugar-fueled assets will descend into a frenzy. That’s our window. You’ll be here,” she pointed to a shaded area labeled ‘SHRUBBERY – BILL,’ “providing lookout for the primary guardian.” “The dad with the grill spatula?” “The very same. He’s a wild card. Last year, he successfully applied sunscreen to a moving toddler. The man has focus.” Molly adjusted her glasses. “I will proceed to the target zone.” Her finger landed on a picnic table draped with a red-and-white checkered cloth. “The loot will be here. Initial reconnaissance suggests a mix of soft goods and hard plastics. No electronics. It’s a lean year.” Bill scratched his stubble. “What about the secondary guardian? The one with the… vocal projections.” He mimicked a high-pitched voice. “‘Timothy, don’t put sand in the punch!’” “Aunt Cheryl. She’s perimeter security, but her attention is divided. She’s also in charge of the ‘goodie bags,’ which creates a predictable pattern of movement to the gift table for replenishment. We use her.” Bill leaned in, his bulk causing the table to creak. “Extraction?” Molly produced a second, smaller drawing. “The Henderson’s garden shed. It backs onto the alley. We load the goods into these.” She pointed to two objects she’d sketched beside it. “Are those… red wagons?” “High-capacity, all-terrain transport modules. Common backyard asset. Invisible in plain sight. We pull them right out the back gate during the cake-and-singing ritual. Maximum emotional distraction. The mark will be weeping if he doesn’t get the first slice with the blue tractor on it.” Bill was silent for a long moment, studying the crayon topography. “The big pink thing in the corner. That the target?” “That’s the bouncy castle, Bill. It’s a non-factor unless there’s a catastrophic deflation. Then, it becomes a smothering hazard. We’ll have to abort.” He nodded slowly, the gravity of the operation settling on him. “So, we’re stealing toys from a six-year-old.” Molly fixed him with a steely gaze. “We’re acquiring unsecured assets from a low-risk, high-yield environment. The wrapping paper alone is a nightmare of crinkly audio signatures. This is a professional challenge.” She rolled up the diagrams with a swift, decisive motion. “We go in at 3:15. Wear something festive but non-descript. And for God’s sake, don’t step on the squeaky turtle.”
created_at
"2026-02-24T22:14:28.758Z"
2/24/2026, 10:14:28 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2392
moderation_latency
(null)
generation_time
31479
tokens_prompt
77
tokens_completion
1374
native_tokens_prompt
170
native_tokens_completion
1182
native_tokens_completion_images
(null)
native_tokens_reasoning
560
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0020272
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"88fb8be6-b5a0-4efe-8dfb-e5518fc8e9d0"
is_byok
false
latency
186
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771971268-YildrDTVVuxeIBHx3YL7"
upstream_id
"88fb8be6-b5a0-4efe-8dfb-e5518fc8e9d0"
total_cost
0.0020272
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
475
0.00%
Dialogue to Total Word Ratio
Ratio: 66.60%, Deviation: 36.60%
neededClean
false
wordsTotal
488
wordsDialogue
325
1.0058%