NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFWVNW9YP4S54WSPQW8PRN
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
4.5s
Cost:
$0.000152
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly bounced on hertoes, eyes wide with glee. "Bill, this is perfect! A whole room full of shiny boxes just waiting for us. Think of the loot!" She gestured wildly at the party decorations. "We just need to get in, grab the presents before the kids notice, and vanish into the night." Bill adjusted his glasses, looking skeptical. "Molly, we're talking about a children's birthday party. There's probably a dozen adults within shouting distance. And the presents are under a tree guarded by a hyperactive golden retriever." He pointed at the dog, who was currently attempting to hump Bill's leg. "Not to mention the parents. They're like human watchdogs." Molly waved a dismissive hand. "Pfft. We're professionals. We'll use the garden hose to create a diversion. You distract the dog with a squeaky toy, I scale the fence, grab the loot, and we're out before anyone can say 'cookie'." She grinned, imagining the haul. "Imagine the cash from those designer toys!" Bill sighed, rubbing his temples. "A garden hose? Molly, that's not a diversion, that's a garden tool. And the dog? You think a squeaky toy will work when he's got a bone? This is madness. We're not thieves, we're... enthusiastic party crashers with poor life choices." Molly's grin faltered slightly, replaced by a determined glint. "Fine. Plan B. You sneak in disguised as a clown. Kids love clowns. They'll think you're part of the entertainment. I'll create a distraction with... glitter bombs!" She pulled a small, glittery pouch from her pocket. "Sparkle everywhere! Parents will be too busy cleaning up to notice us." Bill stared at the glitter bomb. "Molly, we're not going to a rave. We're going to a children's party. Glitter will get everywhere. On the cake. In the cake. On the kids. On the parents. It's a disaster waiting to happen." He shook his head. "This is the worst plan I've ever heard. And I've heard some bad plans." Molly puffed out her chest. "Then we'll do it the old-fashioned way. You sneak in, I sneak in, we grab the presents, and we run. Simple. Quick. No fancy gadgets." She looked at him, her eyes pleading. "Please, Bill? For the loot? For the... adventure?" Bill looked at her, then at the sparkling, chaotic scene of the party. He sighed, a long, defeated sound. "Alright. But if we get caught, it's your glitter bomb that gets us arrested. And I'm blaming you when the dog bites me." He started towards the fence. "Let's go get those presents. Before common sense catches up to us."
created_at
"2026-02-17T09:46:30.379884+00:00"
2/17/2026, 9:46:30 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
60
moderation_latency
(null)
generation_time
4499
tokens_prompt
77
tokens_completion
1145
native_tokens_prompt
70
native_tokens_completion
990
native_tokens_completion_images
(null)
native_tokens_reasoning
509
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00015165
router
(null)
provider_responses
0
id
"9f1ed1633f674d07a6a9a6fa91a236e9"
status
200
is_byok
false
latency
60
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321585-o2xfJpWp1tjHiS7l0Hoe"
upstream_id
"9f1ed1633f674d07a6a9a6fa91a236e9"
total_cost
0.00015165
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
428
0.00%
Dialogue to Total Word Ratio
Ratio: 73.60%, Deviation: 63.60%
neededClean
false
wordsTotal
428
wordsDialogue
315
0.0000%