NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEYXKSZK3G4D4WFVTQT6XH
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
7.6s
Cost:
$0.000180
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.26%
Dialogue to Total Word Ratio
(details)
0.1291%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his gloves, peering through the scope of his modified laser pointer. "Okay, Molly. The plan is solid. We hit the bouncy castle at precisely 2:15 PM. The kids will be exhausted, the parents distracted by the cake. Perfect cover." Molly leaned against the fence, arms crossed. "Bouncy castle? Bill, that's like robbing a daycare with a bazooka. What's next? A security detail of toddlers?" "Security detail of toddlers is exactly why we need the distraction," Bill insisted, lowering the laser. "They're unpredictable. One might trip you, another might cry for mommy. Chaos is our ally." "Chaos is also why we need a getaway plan that doesn't involve a tricycle," Molly countered, watching a group of kids tumble out. "Look at that cake. It's practically begging to be stolen. But how? We can't just waltz in there." "Walzing in is the plan," Bill said, tapping his earpiece. "Disguised as... what? The clown? No, the clown is creepy. How about... the ice cream truck driver? Everyone loves ice cream." Molly snorted. "Ice cream truck driver? Bill, we don't have a van. And even if we did, the parents would mob us for 'free samples'." "Disguise is key," Bill murmured, pulling a ridiculous oversized sombrero from his bag. "This hat makes me look... vaguely Hispanic? Or maybe a pirate? Either way, it's not a clown." Molly shook her head. "You look like a rejected sombrero salesman. And the laser pointer? That's not a disguise, that's a weapon. Or a very expensive toy." "Focus, Molly. The laser is for... signaling. And the hat is for... blending in. The parents won't question a hat. They'll just think I'm eccentric." "Blending in with what? A group of parents who are all wearing sensible shoes and looking for their lost children?" Molly pointed at the crowd. "They'll notice a man in a sombrero carrying a grappling hook." Bill sighed dramatically. "The grappling hook is for the fence. The fence is... problematic. It's high, it's spiked, it's got 'Keep Out' signs. But we need the presents, Molly. The presents are the goal." "The presents are in a house, Bill. A house with a door. Why are we using a grappling hook on a fence?" Molly demanded. "Because the door is guarded by a pit bull named Sparky who looks like he's seen one too many Scooby-Doo episodes," Bill explained patiently. "The fence is the easy part. The grappling hook gets us over. Then we're in." Molly stared at the fence, then at the house, then back at Bill. "You're seriously considering this? Stealing presents from a birthday party? What's next? Robbing a bank during story time?" "Story time is when the guards are most vulnerable," Bill countered. "They're lulled by the sound of 'The Very Hungry Caterpillar'. Perfect for a quiet heist." Molly threw her hands up. "Fine. Grappling hook it is. But if Sparky bites me, I'm blaming you. And the hat. And the laser pointer. And the fact that we're wearing gloves in July." Bill adjusted his sombrero, looking determined. "Trust the plan, Molly. Chaos, disguise, distraction. It's foolproof. Now, 2:15. Ready?" Molly checked her watch. "Ready. But if we get caught, I'm blaming the hat." She gave a small, reluctant smile. "Let's go steal some birthday presents."
created_at
"2026-02-17T09:30:12.529732+00:00"
2/17/2026, 9:30:12 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
88
moderation_latency
(null)
generation_time
7514
tokens_prompt
77
tokens_completion
1391
native_tokens_prompt
70
native_tokens_completion
1181
native_tokens_completion_images
(null)
native_tokens_reasoning
558
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001803
router
(null)
provider_responses
0
id
"bc42d7cddcf34cac847e808ec0686818"
status
200
is_byok
false
latency
88
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320604-LqdIuKNKUh3DV7tsoiwK"
upstream_id
"bc42d7cddcf34cac847e808ec0686818"
total_cost
0.0001803
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
545
0.26%
Dialogue to Total Word Ratio
Ratio: 80.92%, Deviation: 10.92%
neededClean
false
wordsTotal
545
wordsDialogue
441
0.1291%