NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8V64GJX6KN33MHNVEQ47P9
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
22.8s
Cost:
$0.001518
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rusted van smelled of stale fries and regret. Molly pointed a laser pointer at a hand-drawn map taped to the wall. “This is the operational area. The ‘target repository’ is here, under the papier-mâché volcano.” Bill squinted, scratching his stubble. “The living room?” “The living room. Primary ingress is via the sliding patio door, here. It’s reported to have a faulty lock.” “Reported by who?” “My niece, Chloe. She’s seven. Her intel on snack placement is impeccable, so we trust the lock assessment.” Molly switched to a blueprint of the house. “The main variable is the patrol pattern of the adults. They tend to cluster around the guacamole. We’ll have a ninety-second window during the ‘Piñata Protocol’ when all visual attention is directed at a suspended donkey.” Bill nodded, taking notes on a napkin. “What’s the extraction point?” “We load the goods into the empty gift bags we’ll be carrying, exit via the front door, and blend into the departing parent demographic. Our cover is we’re from the catering company, ‘Clown Adjacent.’” She handed him a rainbow wig. Bill stared at it. “I’m not wearing the wig.” “It’s non-negotiable. You’re ‘Bubbles.’ I’m ‘Sparkles.’ We’re assessing party satisfaction.” “The loot, Mol. What’s the estimated haul?” Molly sighed, flipping to a spreadsheet. “Based on the registry and demographic, we’re looking at a mix of plastic novelties, educational puzzles, and at least four copies of the same bestselling graphic novel. We fence it all to that guy with the kiosk in the mall. He gives us twenty cents on the dollar.” “Twenty cents?!” Bill threw his hands up. “We’re risking a B&E for some used action figures and a gift card to ‘Tiny Tumblers’?” “It’s a volume business, Bill! The Smythe birthday next week is a double-header. Twins. That’s twice the gifts.” “What’s our contingency if the small, screaming witnesses get suspicious?” “We deploy the distraction.” Molly held up a sack. “Confetti-filled eggs. Creates chaos, covers our retreat. But only as a last resort. The goal is silent, smooth, and merry.” Bill slumped against the van wall, the rainbow wig in his lap. “I just… I dreamed of bigger scores. The Bellagio. The Crown Jewels.” “We all start somewhere,” Molly said, patting his shoulder. “Consider this your training wheels. Now, let’s run the timings again. At ‘happy birthday,’ you’re on guacamole surveillance. I’ll be verifying the lock…”
created_at
"2026-02-24T22:09:17.853Z"
2/24/2026, 10:09:17 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1828
moderation_latency
(null)
generation_time
22730
tokens_prompt
77
tokens_completion
997
native_tokens_prompt
170
native_tokens_completion
864
native_tokens_completion_images
(null)
native_tokens_reasoning
304
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0015184
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"c7bada33-779c-456e-9c8f-b7f38c3f47cf"
is_byok
false
latency
96
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771970957-qQwMR5g1dZzNoOtHFZ23"
upstream_id
"c7bada33-779c-456e-9c8f-b7f38c3f47cf"
total_cost
0.0015184
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
395
0.00%
Dialogue to Total Word Ratio
Ratio: 78.50%, Deviation: 28.50%
neededClean
false
wordsTotal
400
wordsDialogue
314
0.0000%