NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNEWF4SZHQV5XQXTJ60C71J
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
4.1s
Cost:
$0.000107
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
65.78%
Dialogue to Total Word Ratio
(details)
32.8879%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Billstared at the crumpled clown suit. "Molly, this is insane. A children's party? We're breaking into a birthday bash? What's next? Stealing candy from a baby?" Molly adjusted her oversized red nose. "Bill, think of the loot! Toys, games, maybe even a pony if we're lucky. Kids don't guard presents like security at Fort Knox. It's practically a gift buffet!" Bill rubbed his temples. "A buffet? More like a trap. What if the birthday boy is a ninja? Or the cake is booby-trapped? And clowns? Seriously? You know I have a phobia, Molly." Molly grinned, her eyes sparkling. "Phobia? That's just your way of saying you're not man enough for a little fun. Besides, the suit's on me. It's camouflage. Kids see clowns and think 'free hugs,' not 'armed robbers.' We blend right in." Bill sighed, looking at the suit again. "Blending in is one thing. Getting out with a sack full of Legos and a talking teddy bear is another. What's the exit strategy? Hide under the cake table?" Molly clapped her hands together. "Perfect! The cake table is the best spot. Kids will be too busy eating frosting to notice us stuffing their loot into the clown car. We hit it, we grab it, we bounce. Easy as pie... if pie wasn't made of sugar and disappointment." Bill looked at the suit, then at Molly's determined face. "Okay. But if we get caught, it's your nose. I'm blaming the clown makeup." Molly saluted with a finger. "Deal. Operation 'Steal the Sugar Rush' is a go. Let's make this birthday unforgettable... for everyone else."
created_at
"2026-02-17T09:28:48.310259+00:00"
2/17/2026, 9:28:48 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
64
moderation_latency
(null)
generation_time
4014
tokens_prompt
77
tokens_completion
817
native_tokens_prompt
70
native_tokens_completion
689
native_tokens_completion_images
(null)
native_tokens_reasoning
424
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001065
router
(null)
provider_responses
0
id
"86f49c65716d450583ca14116133082c"
status
200
is_byok
false
latency
64
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320523-AyFWhjRcSiTZuihFBRaa"
upstream_id
"86f49c65716d450583ca14116133082c"
total_cost
0.0001065
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
266
65.78%
Dialogue to Total Word Ratio
Ratio: 81.95%, Deviation: 8.05%
neededClean
false
wordsTotal
266
wordsDialogue
218
32.8879%