NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8VQF1TESX8B1GWTKYK3BCV
Run Details
Model:
Aion 2.0
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
31.2s
Cost:
$0.002056
Provider:
AionLabs
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.5180%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The "war room" was, in fact, Bill's kitchenette. A map of the suburban neighborhood was taped to the fridge, obscured slightly by a magnet shaped like a piece of pizza. Molly, her eyes gleaming with the fervor of a general before Waterloo, pointed a laser pen at a printout of a backyard layout. The red dot trembled over a sketch of a bouncy castle. “This,” she declared, “is the primary ingress point. We go over the fence at the rear, here, during the peak chaos of the piñata ritual. Parental attention will be fragmented.” Bill stirred his cold coffee with a grimace. “Peak chaos. Right. You mean when twenty six-year-olds are screaming because a boy named Aiden got the big Snickers bar.” “Precisely. A perfect storm of avarice and tears.” Molly moved the dot to a small square labeled ‘SHED’. “The presents will be here, temporarily stored before the cake ceremony. We hit fast, we hit quiet. In and out in ninety seconds.” “And the K-9 unit?” Bill asked, nodding toward a circled area on the map marked ‘BUSTER’. “The Yorkshire Terrier.” Molly slid a second sheet across the laminate table. It was a detailed dossier on the dog, complete with a grainy photo of a creature that could fit in a teacup. “Profile: food-motivated, easily distracted. Countermeasure: a high-velocity launch of mini-sausages over the eastern fence. He’ll be occupied for the duration.” Bill massaged his temples. “What about the hired entertainment? Intel says a ‘Jolly Jangles the Clown’ is confirmed.” “A known variable.” Molly produced a small, garish flower from her pocket and pinned it to her dark sweater. “This is a modified squirting flower. Contains a concentrated solution of glitter and temporary hair dye. I’ll bump into him near the punch bowl, activate the device. His subsequent meltdown will draw all adult focus. He’ll be a sparkly, pink-haired distraction.” “We’re gonna steal toys from a bunch of kids, Molly.” Bill’s voice was flat. “We’re gonna tranq a terrier and terrorize a clown to steal a pile of action figures and plush unicorns.” Molly’s expression was one of profound disappointment. “Bill, this isn’t about the merchandise. This is about the principle. The sheer audacity. The flawless execution. Besides,” she added, consulting a clipboard, “the birthday boy, Oliver, is getting a new tablet. That’s the score. The rest is window dressing.” “And the exit strategy?” “Through the side gate, into the waiting ice cream truck. It’s the perfect camouflage. No one suspects the ice cream man.” “We don’t have an ice cream truck.” “I rented one.” Molly beamed. “It’s vanilla. Very inconspicuous.” Bill stared at the map, at the dossier on the terrier, at the glitter flower. He took a long, slow sip of his terrible coffee. “What time does the piñata ritual commence?” Molly’s smile was triumphant. “Three-fifteen. Wear something festive. But dark.”
created_at
"2026-02-24T22:18:45.698Z"
2/24/2026, 10:18:45 PM
model
"aion-labs/aion-2.0-20260223"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2246
moderation_latency
(null)
generation_time
31146
tokens_prompt
77
tokens_completion
1367
native_tokens_prompt
170
native_tokens_completion
1200
native_tokens_completion_images
(null)
native_tokens_reasoning
576
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.002056
router
(null)
provider_responses
0
endpoint_id
"1e93b0bb-eb0f-49bc-8fda-947105413b86"
id
"17f34bb9-cbc7-4789-8980-a971f6d6fcb5"
is_byok
false
latency
97
model_permaslug
"aion-labs/aion-2.0-20260223"
provider_name
"AionLabs"
status
200
api_type
"completions"
id
"gen-1771971525-tSuYvWX8YAjtQCJAbMyV"
upstream_id
"17f34bb9-cbc7-4789-8980-a971f6d6fcb5"
total_cost
0.002056
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AionLabs"
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
474
0.00%
Dialogue to Total Word Ratio
Ratio: 60.42%, Deviation: 30.42%
neededClean
false
wordsTotal
480
wordsDialogue
290
0.5180%