NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFXZB5E8RZSNKS3Y013EXY
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
9.1s
Cost:
$0.000167
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the brightly coloreddecorations. "Look at that, Bill. A whole tree full of presents. Like a giant, edible-looking target." Bill adjusted his glasses, peering through his binoculars. "Molly, we've discussed this. Stealing from a children's party is... ethically questionable. And logistically insane. Those kids are like human tripwires." "Logistically insane?" Molly scoffed. "It's a cakewalk! They're all hopped up on sugar and chaos. Who's gonna notice a couple of slightly taller, slightly more suspicious-looking folks in the corner? We just need a ladder." "A ladder?" Bill groaned. "Molly, the last time we used a ladder at a kids' party, we ended up with a piñata full of confetti down our pants. And the parents thought we were part of the entertainment." "Different ladder," Molly insisted, pointing. "That one by the cake table. It's practically begging to be used. And look, no one's guarding it. Just a sleepy clown in a tiny car." Bill sighed, lowering the binoculars. "The clown. Yes. That's the problem. That clown is a former professional thief. He's been casing this party since the cake was cut. He knows every blind spot, every hiding place. He's probably got a miniature grappling hook hidden in his oversized shoe." Molly rolled her eyes. "Bill, he's a clown. He probably just has a tiny rubber chicken. Or maybe a whoopee cushion. That's not a threat." "Watch him," Bill warned, his voice low. "He's just finished adjusting the piñata. He's looking... calculating. That's not the look of a man who finds whoopee cushions amusing." Molly shrugged. "Okay, okay. Plan B. We sneak in when the parents are all outside smoking. They'll be distracted. We grab the presents, we're out before the first 'thank you' is even said." Bill pinched the bridge of his nose. "Molly, the parents are outside because they're trying to get the kids to stop bouncing on the bouncy castle. They're not going to notice us. They'll be too busy shouting 'Sit down!' and 'Don't eat the grass!'" Molly grinned, already imagining the loot. "Exactly. They won't see a thing. We'll be in, we'll be out, we'll be rich. And the kids? They'll just think the presents disappeared because Santa got lost. Classic misdirection." Bill stared at her, a slow, horrified smile spreading across his face. "Santa got lost... Molly. That's the most ridiculous thing I've ever heard. And you're suggesting we use it as a cover story? That's... that's brilliant. In the worst possible way." He shook his head, a chuckle escaping. "Alright. Plan C. We dress up as the parents. We blend in. We steal the presents while they're all busy arguing about whose turn it is on the slide." Molly clapped her hands together. "Perfect! And if anyone asks, we're just the Smiths from down the street. We brought extra cupcakes." She winked. "They'll never suspect a thing."
created_at
"2026-02-17T09:47:11.228026+00:00"
2/17/2026, 9:47:11 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
66
moderation_latency
(null)
generation_time
9086
tokens_prompt
77
tokens_completion
1297
native_tokens_prompt
70
native_tokens_completion
1094
native_tokens_completion_images
(null)
native_tokens_reasoning
558
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00016725
router
(null)
provider_responses
0
id
"ef2938d7b89a4b90aa536515fbb12fc2"
status
200
is_byok
false
latency
66
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321621-xG2GbtZ27mvgmQi59gxl"
upstream_id
"ef2938d7b89a4b90aa536515fbb12fc2"
total_cost
0.00016725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
477
0.00%
Dialogue to Total Word Ratio
Ratio: 84.07%, Deviation: 34.07%
neededClean
false
wordsTotal
477
wordsDialogue
401
0.0000%