Run: 2026-02-17-01KHNFJ6PG6H3Y5B76ZHJHY8R9

Run Details

Duration:
10.0s
Cost:
$0.000000
Provider:
Arcee AI

Evaluation summary

Score Evaluator
Details Link
20.19% Matches word count (details)
90.91% Dialogue to Total Word Ratio (details)
55.5517% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly leaned against the wall, arms crossed. "So, you're sure this is the best plan? Breaking into a kid's birthday party to steal the presents?" Bill nodded confidently. "Absolutely. Parents are too distracted with the chaos to notice a couple of extra guests. Plus, the loot is prime." "And you're certain we can pull this off without getting caught?" Molly raised an eyebrow. "Trust me, Moll. I've cased this joint. The parents are clueless, and the kids are too busy with cake and games to pay us any mind." Molly sighed. "Alright, but if this goes sideways, I'm blaming you." Bill grinned. "It won't. We'll blend right in. I'll distract the parents with some fake small talk, and you make a beeline for the presents." "What about security? Won't they have cameras or something?" "Nah, it's a kid's party. They're not expecting a heist. Besides, I've got a few tricks up my sleeve." Molly shook her head, half-amused, half-exasperated. "You're insane, you know that?" Bill winked. "That's why you love me. Now, let's go make some kids cry."
created_at"2026-02-17T09:40:47.035326+00:00" 2/17/2026, 9:40:47 AM
model"arcee-ai/trinity-large-preview:free"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency776
moderation_latency(null)
generation_time9934
tokens_prompt77
tokens_completion277
native_tokens_prompt69
native_tokens_completion251
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0
router(null)
provider_responses
0
id"chatcmpl-9393e96f98b48abf"
status200
is_byokfalse
latency379
endpoint_id"b9d81865-ce9a-432a-9cd5-0d9be55e7cb5"
provider_name"Arcee AI"
model_permaslug"arcee-ai/trinity-large-preview"
api_type"completions"
id"gen-1771321236-lZbpQONkO8mMELQrPr0b"
upstream_id"chatcmpl-9393e96f98b48abf"
total_cost0
cache_discount(null)
upstream_inference_cost0
provider_name"Arcee AI"

Evaluation details

Result Evaluator Details Meta Data
20.19% Matches word count n/a
neededCleanfalse
words180
90.91% Dialogue to Total Word Ratio
Ratio: 84.44%, Deviation: 5.56%
neededCleanfalse
wordsTotal180
wordsDialogue152
55.5517%