NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFW7ZC0VMXKG33MF1DJDZK
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3.4s
Cost:
$0.000113
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
77.94%
Dialogue to Total Word Ratio
(details)
38.9687%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the wall,arms crossed, eyeing the brightly colored balloons bobbing above the backyard. "So, Bill, you're telling me we're gonna sneak into a room full of screaming toddlers to swipe their birthday loot?" Bill adjusted his fake mustache, looking serious. "Precision, Molly. Precision. We hit the presents before the cake is cut. The parents are distracted by the clown. It's a classic play." Molly snorted. "A classic play? More like a disaster waiting to happen. What if one of them recognizes us? What if they lock the door? What if... what if they have guard dogs? Or worse... what if they have actual security?" "Guard dogs? Molly, it's a suburban backyard. The worst security is probably a grumpy grandma with a rolling pin." Bill tapped his temple. "Focus. The plan is simple. I create a diversion with the piƱata. You slip in, grab the pile near the cake, and we're out before the first 'Happy Birthday' is finished." Molly grimaced. "And how exactly do you plan to create a diversion that doesn't involve setting the whole place on fire? Last time you tried the 'loud noise' method, we ended up with a swarm of angry bees." Bill held up a small, shiny object. "This. The sonic disruptor. It'll just sound like... a really enthusiastic party horn. Guaranteed to scatter the little terrors." Molly eyed the device skeptically. "And you're sure it won't just make the dog bark? Or the neighbors call the cops?" Bill grinned. "That's the beauty of it. The neighbors *will* call the cops. But by then, we'll be long gone, sipping champagne on a yacht. Now, are you in, or are you gonna stand there looking like you swallowed a lemon?" Molly sighed, pushing off the wall. "Fine. But if we get caught, it's your fault. And if we get stung by bees, it's definitely your fault." She adjusted her own ridiculous clown wig. "Let's make this the heist that goes down in history... as the most ridiculous one."
created_at
"2026-02-17T09:46:09.275388+00:00"
2/17/2026, 9:46:09 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
58
moderation_latency
(null)
generation_time
3312
tokens_prompt
77
tokens_completion
884
native_tokens_prompt
70
native_tokens_completion
730
native_tokens_completion_images
(null)
native_tokens_reasoning
385
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011265
router
(null)
provider_responses
0
id
"09d5cee4869549babaa5259d11689fd3"
status
200
is_byok
false
latency
58
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321565-EDvVRCZcrRCuePVdjmus"
upstream_id
"09d5cee4869549babaa5259d11689fd3"
total_cost
0.00011265
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
334
77.94%
Dialogue to Total Word Ratio
Ratio: 82.93%, Deviation: 7.07%
neededClean
false
wordsTotal
334
wordsDialogue
277
38.9687%