NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-17-01KHNFCVR528AE1YSBW6FFP3N9
Run Details
Model:
Arcee AI: Trinity Large (Preview)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
13.0s
Cost:
$0.000000
Provider:
Arcee AI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the wall, arms crossed, a mischievous glint in her eye. "So, Bill, you sure about this? Breaking into a kid's party to steal presents? Seems a bit low, even for us." Bill chuckled, adjusting his cap. "Come on, Moll. It's not about the kids. It's about the thrill. Besides, think of all the shiny new toys we could sell." "Shiny new toys, huh?" Molly smirked. "And what about the parents? They're not going to be too thrilled when they find out their little Timmy's new bike is missing." Bill waved a dismissive hand. "We'll be long gone by then. Besides, who doesn't love a good mystery? The kids will have a story to tell for years." Molly sighed, shaking her head. "You're incorrigible. Alright, fine. But if we're doing this, we need a plan. How do we even get in without being seen?" "Easy," Bill said, pulling out a crumpled piece of paper from his pocket. "I've got the layout of the house. There's a back window that's always left open for the cat." Molly raised an eyebrow. "A cat? Really? You're basing our entire heist on a cat?" "Hey, it's not my fault Whiskers here is a reliable accomplice," Bill retorted. "Besides, the window's big enough for us to squeeze through." Molly rolled her eyes but couldn't help a small smile. "Alright, genius. Let's say we get in. How do we know which presents are the good ones?" Bill grinned, tapping his temple. "I've been doing some recon. The good stuff is always under the tree. We just grab everything and sort it out later." "And if the kids are still awake?" Molly asked, her tone serious. Bill paused, considering. "We'll have to improvise. Maybe distract them with some candy from my pocket." Molly laughed, shaking her head. "You and your candy. Alright, Bill. Let's do this. But remember, no heroics. We grab the presents and get out. No lingering." "Wouldn't dream of it," Bill said with a wink. "Just a quick in and out. Easy peasy." Molly nodded, a determined look on her face. "Alright, partner. Let's go make some kids' dreams come true. Or at least, let's make sure they have a story to tell." Bill clapped her on the shoulder. "That's the spirit. Let's roll." Together, they slipped into the shadows, ready to embark on their most unconventional heist yet.
created_at
"2026-02-17T09:37:54.798109+00:00"
2/17/2026, 9:37:54 AM
model
"arcee-ai/trinity-large-preview:free"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
401
moderation_latency
(null)
generation_time
12977
tokens_prompt
77
tokens_completion
589
native_tokens_prompt
69
native_tokens_completion
537
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
0
id
"9cf44eb84557f4e2-IAD"
status
200
is_byok
false
latency
178
endpoint_id
"239c00d5-c404-4d17-b243-6a645212dc95"
provider_name
"Arcee AI"
model_permaslug
"arcee-ai/trinity-large-preview"
api_type
"completions"
id
"gen-1771321061-fQnDSfxxxR8asRHTzmzW"
upstream_id
"9cf44eb84557f4e2-IAD"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Arcee AI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
397
0.00%
Dialogue to Total Word Ratio
Ratio: 72.29%, Deviation: 42.29%
neededClean
false
wordsTotal
397
wordsDialogue
287
0.0000%