NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFV8W5MQBSS447X9XDAG4X
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
5.6s
Cost:
$0.000137
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squintedat the blinking red light on the security camera. "Perfect," she whispered, nudging Bill. "That's the only one watching the back door. Bill, you're our distraction. Go make some noise." Bill groaned, adjusting his oversized clown nose. "Molly, we're breaking into a birthday party. The 'distraction' is a three-year-old with a juice box." "Exactly!" Molly grinned. "He'll be too busy trying to pour it on his head to notice us. You go, Bill. Be the clown. I'll be the magician. Poof! Presents gone." Bill shuffled towards the living room, his clown shoes squeaking. "I'm not a magician, Molly. I'm a... a... professional party crasher with a questionable wig." Molly rolled her eyes. "Just do it. And for the love of all that's shiny, don't trip over the piƱata." Bill disappeared into the festive chaos. Molly checked her watch. "Ten minutes. That's all we need. Kids are predictable. Nap time starts... now." She slipped through the slightly ajar back door. Inside, the living room was a battlefield of streamers, half-eaten cake, and squealing children. Molly spotted the presents, stacked neatly under a tree. "Bingo," she breathed, reaching for the largest box. A small voice piped up, "Who's that? Are you a monster?" Molly froze, heart pounding. "No! I'm... the birthday fairy! Delivering extra presents!" The child frowned. "But the fairy brings presents at night. You're too loud." Molly winced. "Right. Too loud. Bad fairy. Run along, little one." She tried to back away, but the child grabbed her ankle. "Help! Monster!" Bill burst in, tripping over a discarded balloon. "Molly! I got the juice box! It's... it's... wet!" He held up a soggy, red-stained clown nose. Molly sighed, looking at the child clinging to her leg and Bill looking like a drowned rat. "This was supposed to be a cakewalk," she muttered. "Now we've got a witness and a soggy clown. Maybe we should just... leave the presents?" She gestured helplessly at the tree. Bill nodded vigorously, his eyes wide. They retreated as quietly as possible, leaving the party intact, the presents untouched, and Bill muttering about needing a new nose.
created_at
"2026-02-17T09:45:39.704987+00:00"
2/17/2026, 9:45:39 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
22
moderation_latency
(null)
generation_time
5515
tokens_prompt
77
tokens_completion
1058
native_tokens_prompt
70
native_tokens_completion
891
native_tokens_completion_images
(null)
native_tokens_reasoning
501
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001368
router
(null)
provider_responses
0
id
"f74fba410b1d4da79c510d95dda8db1b"
status
200
is_byok
false
latency
22
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321533-j3D74Af8DgVtCToaDJgD"
upstream_id
"f74fba410b1d4da79c510d95dda8db1b"
total_cost
0.0001368
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
355
0.00%
Dialogue to Total Word Ratio
Ratio: 49.30%, Deviation: 39.30%
neededClean
false
wordsTotal
355
wordsDialogue
175
0.0000%