NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNG1WQXZ85S6RTM32HH34DN
Run Details
Model:
Arcee AI: Trinity Mini
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
5.2s
Cost:
$0.000173
Provider:
Clarifai
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the brick wall, hereyes fixed on the brightly colored balloons bobbing above the backyard fence. "Bill, look at that. Pure loot. A whole room full of shiny things just waiting for us." Bill adjusted his glasses, peering through the scope of his improvised periscope made from a cardboard tube and a magnifying glass. "Molly, we agreed no residential targets. Kids' parties are a minefield of parental supervision, loud noises, and... well, kids. They scream. A lot." "Parental supervision is just another obstacle," Molly countered, her voice low but excited. "And loud noises? Perfect cover. No one will hear us over the screaming. Kids scream anyway, right? It's like white noise." Bill sighed, rubbing his temples. "The noise is the problem, Molly. It's chaotic. How do we even get in? The gate's locked. The back door's reinforced. And what about the... the cake? The cake is a trap. Everyone stops to watch the cake. It's a distraction field." Molly grinned, already imagining the haul. "Exactly! We hit it during the cake phase. Everyone's eyes are glued to the frosting. We grab the presents, make our exit before the 'Happy Birthday' song finishes. Easy." "Easy for you to say," Bill muttered. "You're not the one who has to carry a giant teddy bear through a crowd of sugar-fueled toddlers. What if one of them... what if one of them recognizes us? Or worse, tries to hug us?" Molly rolled her eyes. "They won't recognize us. We're wearing masks. And hugs? We dodge. Simple. Focus, Bill. The presents. The shiny, expensive-looking presents. That Lego set alone is worth a month's rent." Bill finally lowered the periscope, his expression grim. "Molly, that Lego set is probably just a box of plastic bricks. And the 'shiny' presents? Likely wrapped in cheap paper. This isn't a bank, it's a birthday. The real prize is the cake. The cake is the real target. It's enormous, covered in fondant, and impossible to carry without attracting attention." Molly's enthusiasm faltered slightly. "The cake? But... but we need presents! For the... for the... the getaway car? Or something." Bill nodded solemnly. "Exactly. The cake is the perfect decoy. We steal the presents, create a diversion with the cake, and vanish. The parents will be too busy trying to cut the cake to notice us leaving with the loot." Molly hesitated, then shrugged. "Alright. Cake it is. But we need a plan for the cake. How do we... extract it?" Bill produced a small, sleek device from his pocket. "This. A high-grade cake cutter. Discreet, efficient. And a dolly. We'll need a dolly." Molly's eyes lit up again. "A dolly! Perfect. We can load the cake onto the dolly, wheel it out like we belong here. Parents love helping with the cake. They'll think we're staff." Bill adjusted his glasses, a rare, almost hopeful smile touching his lips. "Staff. Yes. Perfect. Now, remember the plan. We move at the peak of the cake phase. No lingering. Grab, cake, go. No heroics. No hugs. Understood?" Molly saluted with a gloved hand. "Understood. Cake heist. Operation Sweet Surprise. Let's go steal some birthday joy." She started towards the gate, already plotting the dolly's route. Bill followed, muttering about the impracticality of carrying a giant cake while avoiding toddlers, but the spark of a challenge was in his eyes. The backyard party, with its balloons and laughter, suddenly looked like the most complicated, yet potentially rewarding, target they'd ever faced.
created_at
"2026-02-17T09:49:15.740174+00:00"
2/17/2026, 9:49:15 AM
model
"arcee-ai/trinity-mini-20251201"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
60
moderation_latency
(null)
generation_time
5127
tokens_prompt
77
tokens_completion
1337
native_tokens_prompt
70
native_tokens_completion
1130
native_tokens_completion_images
(null)
native_tokens_reasoning
443
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00017265
router
(null)
provider_responses
0
id
"a2f54df4a85e49ab88ec5889edea0776"
status
200
is_byok
false
latency
60
endpoint_id
"73d82c49-90ae-4223-a860-83ed0810821f"
provider_name
"Clarifai"
model_permaslug
"arcee-ai/trinity-mini-20251201"
api_type
"completions"
id
"gen-1771321750-cRbOZHW9sl9MQCTl8Nlp"
upstream_id
"a2f54df4a85e49ab88ec5889edea0776"
total_cost
0.00017265
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Clarifai"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
582
0.00%
Dialogue to Total Word Ratio
Ratio: 70.96%, Deviation: 20.96%
neededClean
false
wordsTotal
582
wordsDialogue
413
0.0000%