NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFT1PN40P5YR03P3C1K1BP
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6.7s
Cost:
$0.000160
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
8.34%
Dialogue to Total Word Ratio
(details)
44.8045%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slammed her fist on the table, sending a half-eaten cupcake flying. "Bill, you're missing the point entirely! It's a CHILDREN'S party! Think of the chaos potential!" Bill adjusted his glasses, peering over the rim. "Molly, I appreciate the... enthusiasm, but we're professional thieves. Stealing from a nursery is... ethically questionable. And logistically insane." "Logistically insane?" Molly scoffed, leaning in conspiratorially. "That's the beauty of it! Kids are everywhere, parents are distracted, the security is a drunk uncle with a whistle. It's a gold mine!" Bill sighed, rubbing his temples. "A gold mine guarded by sugar-fueled tornadoes. And what about the presents? They're wrapped in shiny paper, probably booby-trapped with glitter bombs." "Glitter bombs are just... extra," Molly declared, her eyes gleaming. "Think of the sparkle! And the sheer audacity! We walk in, grab the pile, and vanish before anyone realizes their Elmo doll is missing!" Bill pushed his chair back. "I'm not sure 'vanishing' is the right word. I picture us tripping over a bouncy castle and being tackled by a horde of angry toddlers." Molly grinned, unfazed. "Perfect! Distract them with the chaos, we make our move. Simple." "Simple?" Bill repeated, incredulous. "Simple is stealing a wallet from an unlocked locker. This involves... this involves..." "Planning!" Molly finished for him, bouncing on her heels. "Yes! Planning! We need a distraction. Something big. A... a giant balloon animal? No, too obvious. A... a sudden burst of confetti cannons?" Bill pinched the bridge of his nose. "Confetti cannons? Molly, we're not robbing a bank. We're robbing a birthday party. The noise alone would wake the dead... and the parents." Molly's face lit up. "Exactly! Wake the dead! And the parents! They'll be so busy trying to calm the kids down, they won't notice us loading the presents into the van!" Bill stared at her, a slow, horrified realization dawning. "You're suggesting we... we create a scene so chaotic, so utterly ridiculous, that the parents are too busy dealing with it to notice their children's birthday loot being carted off?" Molly nodded vigorously. "Bingo! It's foolproof! We just need the right... catalyst." Bill rubbed his temples again, this time with more force. "A catalyst? Like what? A fire alarm? That might be... excessive." Molly tapped her chin thoughtfully. "What about... a piƱata? We break it early, spill candy everywhere, kids go wild. Perfect distraction!" Bill groaned. "Molly, we're not going to... we can't... that's... that's child endangerment!" Molly leaned back, triumphant. "See? I knew you'd come around! It's a win-win! We get the presents, the kids get candy, parents get a story for therapy. What's not to love?" Bill stared at the ceiling, his mind racing. "The sheer... the sheer... the absolute, mind-boggling stupidity of it all. But... the presents. The shiny, wrapped presents. The potential... the sheer... potential." Molly clapped her hands together. "That's the spirit, Bill! Now, who's bringing the glitter bombs?"
created_at
"2026-02-17T09:45:00.742195+00:00"
2/17/2026, 9:45:00 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
68
moderation_latency
(null)
generation_time
6549
tokens_prompt
77
tokens_completion
1266
native_tokens_prompt
70
native_tokens_completion
1049
native_tokens_completion_images
(null)
native_tokens_reasoning
463
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001605
router
(null)
provider_responses
0
id
"180db5745b92452a9669ab9764f65a36"
status
200
is_byok
false
latency
66
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321493-IFCd4ZkUlBu1R0nUYNkL"
upstream_id
"180db5745b92452a9669ab9764f65a36"
total_cost
0.0001605
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
8.34%
Dialogue to Total Word Ratio
Ratio: 77.05%, Deviation: 7.05%
neededClean
false
wordsTotal
488
wordsDialogue
376
44.8045%