NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8V2PZ2KNETQQD8J5SGXA4M
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
26.9s
Cost:
$0.001747
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.6885%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse was cold, but the glow from the laptop illuminated Molly’s face with a zealous light. Blueprints were taped to a rolling whiteboard, not of a bank vault, but of a suburban backyard. Bill stared, a half-eaten donut suspended in his hand. “This,” Molly announced, tapping the screen, “is the target. The Henderson residence. Saturday, 2 PM. A six-year-old’s birthday party.” Bill slowly finished his donut. “You’ve lost it. The last job was a jewelry store.” “And it had lasers, Bill! Lasers! This has a bounce house. It’s a lateral move. Now, the primary objective is the gift table, here, under the oak tree. Secondary objective is the cash from the card pile, usually on the kitchen counter near the gluten-free cupcakes.” “What’s our exit strategy? Fleeing on a Big Wheel?” “We blend in,” Molly said, pulling two garish costumes from a duffel. “I’ll be ‘Sunny the Clown.’ You’ll be my assistant, ‘Bubbles.’” Bill held up the tiny, sparkly vest. “I’m not ‘Bubbles.’ And I don’t do balloon animals.” “You won’t have to. Your job is to create a diversion at the piñata strike. I’ve calculated the sugar-high crash window will be around 3:15 PM. That’s when you trigger the incident.” “What incident?” “The great ‘Clown Nose Mishap.’” She held up a large, red foam nose. “I’ve rigged it with a small smoke pellet. You ‘sneeze,’ it pops, releases smoke and a confetti burst. While the little monsters are confused, I sweep the gifts into my oversized clown car, which will be parked by the side gate.” Bill rubbed his temples. “The loot. It’s gonna be Lego sets and stuffed unicorns.” “And cash, Bill! Modern parents give cash. Also, high-end educational toys. We fence the electronics to that guy in the alley behind the comic shop. He’s got a nephew.” “What about the parents? The dads in polo shirts talking about grills?” “Distracted. Always distracted. One will be manning the grill, arguing with propane. Another will be trying to assemble a ‘some assembly required’ rocket ship. The mothers will be in a circle discussing school districts. They are not a threat. The only real variable is the birthday boy himself, one ‘Liam.’ He’s described as ‘spirited.’” “A six-year-old?” “Intel suggests he bites. That’s why you get the nose. He goes for the shiny thing.” Molly pointed to a detailed schedule. “We go in at 2:30, perform for twenty minutes, execute the diversion at 3:15, exfiltrate by 3:30. We’re back here counting small, unmarked bills by four.” Bill stared at the blueprint, then at the clown wig. “The plan is airtight. It’s also the most depressing thing I’ve ever heard.” “It’s a heist, Bill. The stakes are lower, but the percentage is higher. No alarms, no guards, just cake and chaos. What do you say?” A long silence followed, broken only by the drip of a distant pipe. Bill sighed, reaching for the sparkly vest. “Do I have to be Bubbles? Can I be like, ‘Blasto’ or something?” “Bubbles is non-negotiable. It’s on the waiver.”
created_at
"2026-02-24T22:07:25.676Z"
2/24/2026, 10:07:25 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2131
moderation_latency
(null)
generation_time
26811
tokens_prompt
77
tokens_completion
1111
native_tokens_prompt
170
native_tokens_completion
1007
native_tokens_completion_images
(null)
native_tokens_reasoning
272
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0017472
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"be7f3207-f398-4d75-b0d7-11f1d5a5bada"
is_byok
false
latency
169
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771970845-sLejj8ILLiLbSNlk699D"
upstream_id
"be7f3207-f398-4d75-b0d7-11f1d5a5bada"
total_cost
0.0017472
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
0.00%
Dialogue to Total Word Ratio
Ratio: 77.04%, Deviation: 27.04%
neededClean
false
wordsTotal
514
wordsDialogue
396
49.6885%