Run: 2026-02-24-01KJ8W39VEQZA6292GANMDPT7H

Run Details

Model:
Aion 2.0
Duration:
38.6s
Cost:
$0.001907
Provider:
AionLabs

Evaluation summary

Score Evaluator
Details Link
93.65% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
46.8248% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly peered through high-powered binoculars from their idling van. The target house was a symphony of pastel balloons and a squealing inflatable castle. "You're absolutely sure the parents are vacated?" "Positive," Bill said, adjusting his ridiculous rainbow wig and rubber nose. "The invitation said 'Parents Night Out Party.' It's just a dozen sugared-up six-year-olds and a teenage sitter named Kaitlyn who will be on her phone." "So the extraction point is the bounce house? Seriously?" "Think about it," Bill said, tapping the crude map. "It's the perfect blind spot. I go in as Clown Bill. I'll cause a sensory overload with my 'amazing' disappearing rabbit act—" "The one where you just drop it and pretend to look under your hat?" "Technical difficulties are part of the magic! You, as Magician Molly, will be 'making presents disappear' into the giant trick bag. We're in and out before they finish the cake." Molly sighed, staring at her sequined cape. "The Frankfurt diamond job was less degrading." "Think of the haul! Plastic dinosaurs, endless gift cards, those little overpriced building bricks. It's an untapped, unguarded goldmine." "Fine," Molly muttered, pulling on her top hat. "But if one of them calls me 'mommy' or starts to cry, I'm leaving you with the bag."
created_at"2026-02-24T22:25:13.592Z" 2/24/2026, 10:25:13 PM
model"aion-labs/aion-2.0-20260223"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency2126
moderation_latency(null)
generation_time38495
tokens_prompt77
tokens_completion1258
native_tokens_prompt170
native_tokens_completion1107
native_tokens_completion_images(null)
native_tokens_reasoning922
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0019072
router(null)
provider_responses
0
endpoint_id"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id"a7e280b3-838f-436e-a370-8ebb2b501f31"
is_byokfalse
latency169
model_permaslug"aion-labs/aion-2.0-20260223"
provider_name"AionLabs"
status200
api_type"completions"
id"gen-1771971913-w4Xs1SYPHnvD73IvE00j"
upstream_id"a7e280b3-838f-436e-a370-8ebb2b501f31"
total_cost0.0019072
cache_discount(null)
upstream_inference_cost0
provider_name"AionLabs"

Evaluation details

Result Evaluator Details Meta Data
93.65% Matches word count n/a
neededCleanfalse
words209
0.00% Dialogue to Total Word Ratio
Ratio: 74.65%, Deviation: 24.65%
neededCleanfalse
wordsTotal213
wordsDialogue159
46.8248%