NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFB05HJ9GFMQTDHXX1VE4V
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
4.2s
Cost:
$0.000140
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against thefence, squinting at the brightly colored balloons bobbing above the backyard. "Look at that, Bill. Pure loot. Kids' birthday party. Presents everywhere. Like a candy store for thieves." Bill adjusted his glasses, peering through his binoculars. "Molly, it's a children's party. The presents are wrapped in shiny paper, sure, but they're... what? Stuffed animals? Cheap plastic toys? And the security? A bunch of toddlers. They'd probably just cry if we took their stuff." Molly grinned, tapping her finger against her chin. "Exactly! They're defenseless. No security guards, no alarms, just a bunch of sugar-hyped rugrats. We waltz in, grab the loot, and vanish before anyone realizes their 'precious' stuffed dinosaur is missing. It's a cakewalk." Bill sighed, lowering the binoculars. "A cakewalk? Molly, it's a backyard. The 'security' is a three-year-old with a juice box. The 'looters' are the parents who'll notice their kid's new action figure is gone and start a neighborhood watch. Plus, the cake is probably gluten-free. We can't even enjoy the spoils." Molly shrugged, already moving towards the gate. "Details, Bill. We're not here for the cake. We're here for the presents. And the loot is... substantial. Look at that pile near the table. That's probably a gaming console someone's kid got. We could fence that for real money." Bill followed, shaking his head. "A gaming console? Molly, that's a toddler's birthday party! It's a Fisher-Price tablet, not a PlayStation. And even if it was, how do we fence it? We don't have a fence. We're standing in a suburban backyard planning a heist on Elmo's birthday." Molly paused at the gate, looking back at Bill. "Fine. We take the cheap toys. The stuffed animals, the plastic cars. We dump them in the dumpster behind the pizza place. No one will miss a few dollar-store trinkets. It's a low-risk, high-reward operation. What's the worst that happens? We get chased by a crying kid? That's comedy gold." Bill hesitated, then nodded slowly. "Alright. Low risk. High reward. Comedy gold. But Molly... when we get back to the van, I'm driving. You're navigating. And we're wearing gloves. And maybe a disguise. Like... giant clown noses? That might confuse the toddlers." Molly laughed, pulling the gate open. "Clown noses? Bill, you're thinking too hard. Just go with it. This is going to be brilliant." She slipped through the gate, ready to begin the most ridiculous heist of their careers.
created_at
"2026-02-17T09:36:44.583269+00:00"
2/17/2026, 9:36:44 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
78
moderation_latency
(null)
generation_time
4152
tokens_prompt
77
tokens_completion
1054
native_tokens_prompt
70
native_tokens_completion
910
native_tokens_completion_images
(null)
native_tokens_reasoning
420
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00013965
router
(null)
provider_responses
0
id
"43719b1261c646d4a965029daabf0b17"
status
200
is_byok
false
latency
78
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321000-1Of0A4QCi4zPuTuvrrbA"
upstream_id
"43719b1261c646d4a965029daabf0b17"
total_cost
0.00013965
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
411
0.00%
Dialogue to Total Word Ratio
Ratio: 80.05%, Deviation: 30.05%
neededClean
false
wordsTotal
411
wordsDialogue
329
0.0000%