Run: 2026-02-24-01KJ8V4N55YZDJJ6Q0E8NK1KPH

Run Details

Model:
Aion 2.0
Duration:
25.0s
Cost:
$0.001763
Provider:
AionLabs

Evaluation summary

Score Evaluator
Details Link
99.74% Matches word count (details)
17.39% Dialogue to Total Word Ratio (details)
58.5695% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The back booth of the diner was shrouded in coffee steam. Molly stirred her cup, staring at the napkin where Bill had drawn a crude map of a backyard. “Okay,” Bill said, tapping the napkin. “The target is a six-year-old named Timmy. His perimeter is soft, but the guards are hyper-vigilant.” “You mean his mom drinks a lot of iced tea and makes frequent trips inside,” Molly said flatly. “Precisely. That’s our window. We go in during the cake chaos. Everyone’s sugared up, distracted.” “To steal what? A pile of poorly wrapped Lego sets and a gift card to Chuck E. Cheese?” “It’s the principle, Moll. The score is the score. Now, extraction is key. We blend into the party, then exit via the side gate.” “Dressed as clowns? You said no clowns.” “Magicians. Higher tip potential on the way out. Look, the real prize is the giant stuffed panda from the grandparents. That’s the centerpiece. We need a vehicle with a large trunk.” Molly sipped her coffee. “I can’t believe I’m doing a five-year stretch for a panda that smells like grandma.” “Think of the secondary market,” Bill whispered. “The piñata alone is a goldmine.”
created_at"2026-02-24T22:08:29.359Z" 2/24/2026, 10:08:29 PM
model"aion-labs/aion-2.0-20260223"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency2094
moderation_latency(null)
generation_time24979
tokens_prompt77
tokens_completion1106
native_tokens_prompt170
native_tokens_completion1017
native_tokens_completion_images(null)
native_tokens_reasoning788
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0017632
router(null)
provider_responses
0
endpoint_id"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id"a6bb3891-655c-4e9c-ae53-4a3f8493d9c0"
is_byokfalse
latency126
model_permaslug"aion-labs/aion-2.0-20260223"
provider_name"AionLabs"
status200
api_type"completions"
id"gen-1771970909-WBVB0BaEVTw2RoK97bMj"
upstream_id"a6bb3891-655c-4e9c-ae53-4a3f8493d9c0"
total_cost0.0017632
cache_discount(null)
upstream_inference_cost0
provider_name"AionLabs"

Evaluation details

Result Evaluator Details Meta Data
99.74% Matches word count n/a
neededCleanfalse
words196
17.39% Dialogue to Total Word Ratio
Ratio: 78.50%, Deviation: 11.50%
neededCleanfalse
wordsTotal200
wordsDialogue157
58.5695%