NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFAJ0SM1SGA53SC6144NZK
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
5.4s
Cost:
$0.000164
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
90.48%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
45.2419%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leanedagainst the brick wall, her eyes fixed on the brightly colored balloons bobbing above the backyard fence. Bill adjusted his gloves, his brow furrowed. "Okay, Molly, let's run through this again. We hit the party at 7:30 sharp. The parents are all inside, probably arguing over who gets the last slice of cake. The kids are distracted by the clown. Perfect." Molly nodded, a grin spreading across her face. "Yeah, yeah. The clown's the best part. He's got those giant shoes that make that *whoosh* sound when he walks. Kids love it. We just need to... distract him. Maybe make him think he dropped his red nose? That always gets a laugh." Bill sighed, pulling out a small, shiny object. "Molly, it's a heist, not improv. We need a solid plan. Distraction first. I'll create a diversion near the cake table. You slip in, grab the presents, and meet me at the back gate. Simple." "Simple?" Molly scoffed. "You make it sound like stealing candy from a baby. Which, technically, is what we're doing. Kids' presents. Baby presents." She winked. "Easy money." Bill adjusted his glasses, his voice low. "Easy money that belongs to five-year-olds who cry when their teddy gets a scratch. That's the moral quandary, Molly. Do we really want to be the villains who steal the birthday presents?" Molly shrugged, her enthusiasm undimmed. "Look, Bill, it's not like they're going to miss them. They'll get new ones next year. Besides, we're not hurting anyone. Just... borrowing. For a little while." She gestured towards the fence. "Now, how do we get over this thing without setting off the motion sensor?" Bill pointed to a large, bushy rhododendron. "See that? We climb over, hide in the bushes until the clown's busy. Then we move fast. The presents are in that big blue bin near the swings. We grab the wrapped ones, leave the cheap plastic toys. That's the plan." Molly's eyes sparkled. "The blue bin. Got it. And the clown? How do we... distract him?" Bill produced a small, battery-powered toy duck. "This. I'll make it quack loudly near him. He'll think it's a lost toy and go chasing it. Perfect cover for you to slip in." Molly clapped her hands. "Perfect! Quack, duck! That's brilliant, Bill! You're a genius!" She bounced on her heels. "So, 7:30. Motion sensor off. Rhododendron cover. Quack duck. Blue bin. Presents gone. Easy peasy lemon squeezy!" Bill checked his watch. "7:30. Don't be late. And Molly?" He paused, a rare serious look crossing his face. "Try not to laugh too loud when you see the birthday boy cry. It's... unsettling." Molly just grinned, already scanning the fence line. "Don't worry, Bill. I'll be professional. Just... very, very professional." She gave him a thumbs-up. "Let's make this the easiest heist of our careers." She turned and melted into the shadows of the rhododendron bushes, ready for the easy score.
created_at
"2026-02-17T09:36:31.394122+00:00"
2/17/2026, 9:36:31 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
91
moderation_latency
(null)
generation_time
5327
tokens_prompt
77
tokens_completion
1229
native_tokens_prompt
70
native_tokens_completion
1075
native_tokens_completion_images
(null)
native_tokens_reasoning
494
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001644
router
(null)
provider_responses
0
id
"35bf6b5f5381472eb431283741a46cc3"
status
200
is_byok
false
latency
91
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320985-L1XlwHvYSBoW9IdPMQYz"
upstream_id
"35bf6b5f5381472eb431283741a46cc3"
total_cost
0.0001644
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
90.48%
Matches word count
n/a
neededClean
false
words
490
0.00%
Dialogue to Total Word Ratio
Ratio: 73.06%, Deviation: 43.06%
neededClean
false
wordsTotal
490
wordsDialogue
358
45.2419%