NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFQ6BQA6SRP3T0CG0DCFCW
Run Details
Model:
Arcee AI: Trinity Large (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.7s
Cost:
$0.000000
Provider:
Arcee AI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill crouched behind a hedge, peering through the leaves at the brightly colored bouncy castle in the backyard. Laughter and shrieks of joy drifted over to them on the warm summer breeze. "Alright, here's the plan," Molly whispered, pulling out a notepad. "Bill, you'll create a diversion by pretending to be a clown. While everyone's distracted, I'll sneak in and grab the presents." Bill raised an eyebrow. "A clown? Are you serious?" Molly nodded. "It's perfect. Who would suspect a clown of anything nefarious?" Bill sighed. "Fine, but I'm not wearing a red nose. Those things itch like crazy." "Deal," Molly said, scribbling on her notepad. "Now, once you've got their attention, I'll make my move. The presents should be piled up near the cake table." Bill glanced at the table, loaded with brightly wrapped packages. "That's a lot of presents. We may need a getaway vehicle." Molly grinned. "Leave that to me. I've got Rusty the Clown Car parked around the corner. It may look small, but trust me, it can hold a ton of loot." Bill shook his head in disbelief. "Rusty the Clown Car? Really? That thing is a death trap on wheels." "Hey, it's all about misdirection," Molly said defensively. "No one would ever suspect a harmless little clown car of being a getaway vehicle." Bill sighed again. "Alright, alright. I'll do my best clown impression. But if I end up with a pie in the face, I'm blaming you." Molly clapped him on the shoulder. "That's the spirit! Now let's get into position. Showtime in five minutes." They split up, Molly slinking towards the house while Bill made his way to the bouncy castle. He pulled a rainbow wig and oversized shoes out of his bag and quickly changed. Taking a deep breath, Bill stepped out from behind the hedge and started bouncing towards the party. "Hey kids, who wants to see a magic trick?" he called out in a high-pitched voice. The children turned, eyes wide with excitement. Molly watched from the shadows, waiting for her moment to strike. The plan was in motion, and soon they would be swimming in a sea of stolen presents. It was going to be the greatest heist of their careers - as long as Bill didn't embarrass them too much in the process.
created_at
"2026-02-17T09:43:36.779119+00:00"
2/17/2026, 9:43:36 AM
model
"arcee-ai/trinity-large-preview:free"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
334
moderation_latency
(null)
generation_time
16648
tokens_prompt
77
tokens_completion
577
native_tokens_prompt
69
native_tokens_completion
492
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
0
id
"chatcmpl-8c75e09d0cfe62a4"
status
200
is_byok
false
latency
166
endpoint_id
"b9d81865-ce9a-432a-9cd5-0d9be55e7cb5"
provider_name
"Arcee AI"
model_permaslug
"arcee-ai/trinity-large-preview"
api_type
"completions"
id
"gen-1771321399-pVWfr4GBhwOkOMfeqAKb"
upstream_id
"chatcmpl-8c75e09d0cfe62a4"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Arcee AI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
388
0.00%
Dialogue to Total Word Ratio
Ratio: 48.71%, Deviation: 18.71%
neededClean
false
wordsTotal
388
wordsDialogue
189
0.0000%