NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFBYA1294XPZAVDSH87NHQ
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.1s
Cost:
$0.000135
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the fence,her eyes tracking the kids swarming around the giant inflatable castle. "Look at 'em, Bill. Pure chaos. Like a swarm of sugar-crazed locusts. Perfect cover." Bill adjusted his glasses, peering through his binoculars. "Chaos is our friend, Molly. But we need a point of entry. The back gate looks unsecured. That's our window." "Window? More like a gaping maw," Molly muttered, scanning the yard. "But what about the presents? They're the real prize. How do we get past the parents? They're like human watchdogs." Bill tapped his chin. "Simple. We blend in. We're the 'Uncle Bob' and 'Aunt Sally' act. Parents love handing out candy to 'nice uncles.' We'll position ourselves near the snack table. Easy in, easy out." Molly snorted. "Blending in? With our getups? You look like you raided a thrift store's '80s section, Bill. And I'm wearing this tacky clown wig. We'll stick out like a sore thumb." "Comfort over style, Molly. We need to be forgettable. Parents won't remember us if we're uncomfortable." Bill adjusted his ill-fitting polyester suit. "Focus. The presents are under that blue tarp near the cake table. We grab 'em, we're out before the first tantrum starts." Molly grinned, a mischievous glint in her eye. "Tantrums? That's our cue. We hit it when the parents are distracted by a meltdown. Perfect." Bill sighed, but a reluctant smile tugged at his lips. "Fine. Operation 'Steal the Sugar Rush' is a go. Remember the plan: Snack table, grab the tarp, vanish. No heroics. No talking to kids. They bite." "Got it. No heroics. No talking to kids." Molly mimicked him, then winked. "Unless they offer us candy. Then we negotiate." Bill shook his head, but he couldn't help chuckling. "Just stick to the plan, Molly. And for the love of all that's shiny, don't trip over the clown shoes." Molly adjusted her oversized, floppy shoes. "Don't worry. My grace is legendary. Now, let's go make some parents' lives a living nightmare." She took a step towards the gate, then paused. "Bill? What if the kids... you know... like the presents?" Bill stared at her, then burst out laughing. "Molly! They're five! They'll like *anything*! Now move!" He pushed her towards the gate, the plan, however ridiculous, finally set in motion.
created_at
"2026-02-17T09:37:15.370591+00:00"
2/17/2026, 9:37:15 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
58
moderation_latency
(null)
generation_time
4009
tokens_prompt
77
tokens_completion
1002
native_tokens_prompt
70
native_tokens_completion
881
native_tokens_completion_images
(null)
native_tokens_reasoning
414
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001353
router
(null)
provider_responses
0
id
"70e077a1e4994fe1a18cd60cd01d5271"
status
200
is_byok
false
latency
58
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321030-bVV3UTGIFTFvckklqgP5"
upstream_id
"70e077a1e4994fe1a18cd60cd01d5271"
total_cost
0.0001353
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
382
0.00%
Dialogue to Total Word Ratio
Ratio: 71.20%, Deviation: 41.20%
neededClean
false
wordsTotal
382
wordsDialogue
272
0.0000%