NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEXF8W9P5YQTC1KS1B3RR7
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
4.6s
Cost:
$0.000140
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
74.63%
Dialogue to Total Word Ratio
(details)
37.3173%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned againstthe wall, arms crossed, watching Bill pace like a caged tiger. "So, you're telling me we're breaking into a birthday party? For presents?" Bill stopped, hands on hips, eyes blazing with conviction. "Exactly! It's the perfect target, Molly. Kids? They don't lock their presents away. They just... leave them out. Like an open invitation." Molly snorted. "An open invitation to a sugar crash and a tantrum. What's the plan, Bill? Sneak in while the clown's juggling and grab the Wii? How do you even get past the parents?" Bill pulled out a crumpled map, smoothing it out on the floor. "Simple. The back door. The parents are all in the living room, glued to the cake. The kids are in the backyard, chasing bubbles. Perfect blind spot." Molly raised an eyebrow. "And the presents? They're not just sitting in a pile guarded by a three-year-old with a plastic sword?" Bill grinned, a dangerous glint in his eye. "That's the beauty of it. They're not guarded. They're... displayed. Like trophies. We just need to... relocate them. Discreetly." "Discreetly?" Molly mimicked. "In a backyard full of screaming kids? How?" Bill pointed at the map. "The shed. It's unlocked. We grab the loot, toss it in the shed, and walk out the front door when the parents finally notice the kids are missing." Molly shook her head, a slow smile spreading. "You're insane. And you know what? I love it. But what about the kid? The birthday kid? He's probably the only one who could stop us." Bill waved a dismissive hand. "He's busy being the center of attention. He won't even notice us. We're just... background noise. Like the boring uncle." Molly chuckled, the absurdity finally hitting her. "Okay, Bill. Let's do it. But if we get caught, you're explaining to a three-year-old why his Lego castle is missing." Bill clapped her on the shoulder, a manic energy surging through him. "Fear not, Molly! This is going to be legendary! The greatest heist since... well, since the last time we stole a vending machine from a dentist's office. But bigger! Better! And with cake!" Molly followed him out the back door, the scent of balloons and cake already thick in the air, her heart pounding with a mix of dread and excitement. This was going to be a disaster. And she couldn't wait to see it unfold.
created_at
"2026-02-17T09:29:21.739285+00:00"
2/17/2026, 9:29:21 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
271
moderation_latency
(null)
generation_time
4477
tokens_prompt
77
tokens_completion
1071
native_tokens_prompt
70
native_tokens_completion
910
native_tokens_completion_images
(null)
native_tokens_reasoning
474
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00013965
router
(null)
provider_responses
0
id
"84241fa6e5424b178bfb6e90d2753d89"
status
200
is_byok
false
latency
271
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320556-MG48isQ6mG2KRo8gYoGy"
upstream_id
"84241fa6e5424b178bfb6e90d2753d89"
total_cost
0.00013965
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
401
74.63%
Dialogue to Total Word Ratio
Ratio: 67.58%, Deviation: 2.42%
neededClean
false
wordsTotal
401
wordsDialogue
271
37.3173%