NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF6KG0B7W3CMNRVR25V3ZA
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
3.7s
Cost:
$0.000121
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leanedagainst the wall, eyes wide. "Bill, this is it. The ultimate score. A room full of kids, all the presents under that tree, just... waiting." Bill adjusted his glasses, peering through the scope. "Molly, we're breaking into a birthday party. The presents are wrapped in shiny paper. Kids are going to be... *running around*." "Exactly! Chaos is our friend. We blend in. We grab the loot and vanish before anyone realizes the clown lost his nose." Bill sighed. "The clown losing his nose is the least of our worries. What about the cake? It's a three-tier monstrosity. How do we carry that?" Molly grinned, already imagining it. "We don't. We take the presents. The cake? We leave it. It's just sugar and frosting. The real treasure is the loot." "Real treasure? Molly, it's a *child's* party. The presents are toys. Maybe a few video games, but that's it. This isn't a museum heist." "Video games are treasure to a kid! And think of the resale value! Kids outgrow things fast. We hit it, sell it, retire in the Bahamas." Bill lowered the scope, rubbing his temples. "The Bahamas? Molly, we're breaking into a house. The parents are probably just in the next room. What if little Timmy sees us?" Molly waved a dismissive hand. "Timmy's too busy trying to eat the cake. We move fast. We grab the presents, we're out. Simple." Bill pointed the scope back. "Simple? What about the security? The parents. The neighbors. The... *dog*?" Molly rolled her eyes. "Bill, it's a backyard party. The dog's probably asleep. Or chasing its tail. Focus. The presents are the target. The chaos is our cover." Bill finally nodded, though reluctantly. "Fine. But if we get caught, it's your idea of 'chaos' that gets us arrested." Molly beamed. "Deal. Now, let's make this the heist of the century. Or at least the most ridiculous one."
created_at
"2026-02-17T09:34:20.103466+00:00"
2/17/2026, 9:34:20 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
90
moderation_latency
(null)
generation_time
3602
tokens_prompt
77
tokens_completion
917
native_tokens_prompt
70
native_tokens_completion
786
native_tokens_completion_images
(null)
native_tokens_reasoning
437
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00012105
router
(null)
provider_responses
0
id
"a2203e153da843d09498b29aac25d504"
status
200
is_byok
false
latency
90
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320856-cN5QM3f92Zx33p3Sjqof"
upstream_id
"a2203e153da843d09498b29aac25d504"
total_cost
0.00012105
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
317
0.00%
Dialogue to Total Word Ratio
Ratio: 84.54%, Deviation: 34.54%
neededClean
false
wordsTotal
317
wordsDialogue
268
0.0000%