NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFTG5R7MYHQV549839SNRT
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
5.2s
Cost:
$0.000133
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
22.66%
Dialogue to Total Word Ratio
(details)
11.3300%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the fence, eyes fixed on the brightly colored balloons bobbing above the backyard. "Bill, look at that. Pure loot. All those presents under the tree, just waiting for us." Bill adjusted his glasses, peering through the gap in the fence. "Molly, it's a children's party. The presents are... well, presents. Kids don't have safes. They have... wrapping paper and hope." "Hope is a commodity, Bill. And these kids are full of it. Think of the haul! Action figures, board games, maybe even that limited edition unicorn plushie everyone's talking about." Molly grinned, already imagining the cash. Bill sighed, rubbing his temples. "The security is... lax. But the noise? The chaos? The sheer volume of tiny humans? It's like trying to steal from a beehive with a butterfly net." "Perfect! We blend in. We're just another set of parents, right? We'll wear these ridiculous clown wigs we found in the attic." Molly held up a fuzzy orange wig. "They'll never suspect a thing." Bill stared at the wig. "Molly, we're not clowns. Clowns are terrifying. Kids cry at clowns. We'll be the villains in their nightmares." "Exactly! We're the villains! We'll be the scary clowns who steal their birthday presents! It's genius!" Molly bounced on her heels. "We hit the tree, grab the loot, and vanish before the cake is even cut. Easy peasy." "Easy peasy," Bill repeated, deadpan. "And what about the parents? They're the real security detail. They'll be hovering like hawks. And the cake. We can't just... walk off with the cake. That's... that's cannibalism, Molly." Molly waved a dismissive hand. "Cake is just sugar and frosting. We'll grab the presents, leave the cake. They'll never notice. It's a heist, Bill. It's about the presents. The cake is... collateral damage." Bill pinched the bridge of his nose. "Molly, this is insane. Stealing from children? It's... it's morally reprehensible. And logistically... the noise. The crying. The sheer... innocence. It's overwhelming." Molly grinned, already moving towards the gate. "Morally reprehensible? Bill, it's a children's party! They have more presents than sense! It's practically a public service. Now, come on. The loot isn't going to steal itself. And I really want that unicorn plushie."
created_at
"2026-02-17T09:45:13.703559+00:00"
2/17/2026, 9:45:13 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
64
moderation_latency
(null)
generation_time
5151
tokens_prompt
77
tokens_completion
1021
native_tokens_prompt
70
native_tokens_completion
866
native_tokens_completion_images
(null)
native_tokens_reasoning
437
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00013305
router
(null)
provider_responses
0
id
"84d3475b740f4f09bc542c1b768886bc"
status
200
is_byok
false
latency
64
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321508-cGMtCuPlYs1wueLADSP9"
upstream_id
"84d3475b740f4f09bc542c1b768886bc"
total_cost
0.00013305
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
366
22.66%
Dialogue to Total Word Ratio
Ratio: 78.96%, Deviation: 11.04%
neededClean
false
wordsTotal
366
wordsDialogue
289
11.3300%