Run: 2026-02-24-01KJ8TRJW01NKDFBTXHN8T307P

Run Details

Model:
Aion 2.0
Duration:
18.2s
Cost:
$0.001213
Provider:
AionLabs

Evaluation summary

Score Evaluator
Details Link
20.19% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
10.0948% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The rusty van smelled of old pizza and regret. Molly spread a crayon-drawn map across the steering wheel. "The mark is Timothy Henderson, age six. Party starts at two. Loot is centralized in the living room, under the My Little Pony piñata." Bill peered at the map. "The intel's from your niece? It's in glitter glue." "It's solid. The adults will be distracted by 'Mystical Marvin' during the cake ceremony. That’s our window." "A magician?" Bill paled. "My uncle was a magician. Those guys see everything." "Not if we're part of the entertainment." Molly produced two rainbow wigs and oversized shoes. "We infiltrate as the party clowns, 'Binky and Bonk.'" "I hate clowns," Bill muttered, pulling on a polka-dot sleeve. "What's the exfil strategy?" "Through the side gate, loaded into the giant balloon animal bag." She tossed him a red nose. "Remember, no taking candy from the baby. It's unprofessional." Bill sighed, the nose squeaking pathetically. "Just once, I'd like to rob a bank." "Where's the challenge in that? Now, let's run the timings again. Cake smash is at 2:45..."
created_at"2026-02-24T22:01:53.802Z" 2/24/2026, 10:01:53 PM
model"aion-labs/aion-2.0-20260223"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency2185
moderation_latency(null)
generation_time18105
tokens_prompt77
tokens_completion723
native_tokens_prompt170
native_tokens_completion673
native_tokens_completion_images(null)
native_tokens_reasoning435
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0012128
router(null)
provider_responses
0
endpoint_id"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id"323b8fc0-c096-4af5-9eb5-cf5615081ece"
is_byokfalse
latency115
model_permaslug"aion-labs/aion-2.0-20260223"
provider_name"AionLabs"
status200
api_type"completions"
id"gen-1771970513-p6GyRzC7HVDh8uqZ5xIt"
upstream_id"323b8fc0-c096-4af5-9eb5-cf5615081ece"
total_cost0.0012128
cache_discount(null)
upstream_inference_cost0
provider_name"AionLabs"

Evaluation details

Result Evaluator Details Meta Data
20.19% Matches word count n/a
neededCleanfalse
words180
0.00% Dialogue to Total Word Ratio
Ratio: 70.33%, Deviation: 20.33%
neededCleanfalse
wordsTotal182
wordsDialogue128
10.0948%