NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNF31FK8P7XWT0HA82QN1FH
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
3.6s
Cost:
$0.000118
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the brightly colored banner flapping in the breeze. "Bill, look! The Johnson's backyard. Kids everywhere, balloons, cake... and presents under that tree. Like a treasure map drawn by a five-year-old." Bill adjusted his glasses, peering through his binoculars. "Molly, we agreed no residential targets. Kids' parties are... messy. And noisy. And the parents are usually armed with cameras, not baseball bats." "Armed with cameras? That's our distraction!" Molly grinned, already calculating angles. "We climb the fence, grab the presents before the cake hits the floor, and vanish. Ten minutes tops. We're in, we're out, we're rich." Bill sighed, the picture of reluctant genius. "Ten minutes? Molly, the clown just arrived. He's juggling chainsaws. And the dog is loose. And the cake is... *decorated* with sprinkles. It's a logistical nightmare wrapped in a sugar rush." Molly ignored the objections, her mind racing. "Perfect! We use the clown's red nose as a decoy. Bill, you're the distraction. You trip over the piƱata rope. I scale the fence. We grab the giant teddy bear and the noisy robot. Easy." Bill rubbed his temples. "The giant teddy bear weighs fifty pounds. The robot requires assembly. And the clown... he's got a very long stick. And he's *very* good at spotting suspicious activity near the cake table." Molly's eyes sparkled with mischief. "That's the beauty, Bill! We're suspicious activity! We'll be the most interesting thing since the clown's chainsaws. Now, are you in or are you going to stand there looking like a deflated party balloon?" Bill finally nodded, a small, determined smile playing on his lips. "Alright. But if we get caught, it's your idea. And you're buying the therapy. And the cake. All of it." He adjusted his backpack. "Let's make this the heist that ends with a very confused toddler and a very angry clown."
created_at
"2026-02-17T09:32:23.707949+00:00"
2/17/2026, 9:32:23 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
58
moderation_latency
(null)
generation_time
3499
tokens_prompt
77
tokens_completion
900
native_tokens_prompt
70
native_tokens_completion
769
native_tokens_completion_images
(null)
native_tokens_reasoning
416
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001185
router
(null)
provider_responses
0
id
"1443d1e9ae89453f936a1d46f4daf7eb"
status
200
is_byok
false
latency
58
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771320739-PO2GgLQ4hSbClSQiEl3n"
upstream_id
"1443d1e9ae89453f936a1d46f4daf7eb"
total_cost
0.0001185
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
308
0.00%
Dialogue to Total Word Ratio
Ratio: 79.87%, Deviation: 69.87%
neededClean
false
wordsTotal
308
wordsDialogue
246
0.0000%