NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF5ZN5YG8ZGKEB277DQEQR
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
3.4s
Cost:
$0.000115
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
13.08%
Dialogue to Total Word Ratio
(details)
6.5399%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly bounced onher toes, eyes wide with manic glee. "Bill, this is genius! A children's party! All those shiny presents under the tree, just... sitting there! Like a candy store for grown-ups!" Bill adjusted his ill-fitting clown shoes, a look of profound skepticism on his face. "Molly, we've discussed this. Stealing from kids? It's... morally questionable. Plus, they're tiny. We'd need a stepladder just to reach the presents." "Stepladder? Pfft! We're professionals! We'll blend in!" Molly twirled, a cheap party hat askew. "We'll be the funniest clowns at the party! Everyone loves clowns! We'll hand out balloon animals and steal the presents right under their noses!" Bill rubbed his temples. "Blending in? Molly, we look like escaped convicts who lost a bet. And 'hand out balloon animals'? That's not a cover, that's a job description. What if a kid asks for their balloon back?" Molly grinned, already imagining the loot. "Then we say, 'Oops! Silly us! Here's a shiny new toy instead!' Problem solved! And the cake! Bill, think of the cake! We could eat half of it before anyone notices!" Bill sighed, the sound heavy with resignation. "Molly, the cake is for the kids. They'll cry. And the parents? They'll call the police. And the presents... Molly, what if one of those presents is a live puppy?" Molly's grin faltered, replaced by a look of dawning horror. "A... a puppy? Oh, Bill! That's... that's terrible! We can't steal a puppy! That's... that's just wrong!" Bill nodded solemnly. "Exactly. We're thieves, Molly. Not puppy-nappers. This plan has more holes than Swiss cheese. Let's go back to robbing banks. At least the targets are less... fluffy."
created_at
"2026-02-17T09:33:59.408129+00:00"
2/17/2026, 9:33:59 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
61
moderation_latency
(null)
generation_time
3384
tokens_prompt
77
tokens_completion
871
native_tokens_prompt
70
native_tokens_completion
744
native_tokens_completion_images
(null)
native_tokens_reasoning
439
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011475
router
(null)
provider_responses
0
id
"28a000b8e5ca436cb0134bd2fca4eaab"
status
200
is_byok
false
latency
61
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320835-6iLejK3Upaqo7jHGtrLi"
upstream_id
"28a000b8e5ca436cb0134bd2fca4eaab"
total_cost
0.00011475
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
278
13.08%
Dialogue to Total Word Ratio
Ratio: 78.06%, Deviation: 11.94%
neededClean
false
wordsTotal
278
wordsDialogue
217
6.5399%