NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEY8HHRD128ZV8BH2N5RNB
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
14.7s
Cost:
$0.000388
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced back and forth across the rundown apartment, her sneakers squeaking against the linoleum floor. Bill sat at the small kitchen table, calmly sipping coffee as if planning a breakfast menu rather than a major theft. "kid-friendly neighborhood, right?" Molly asked, stopping suddenly. "No, really ask me again about this kid-friendly neighborhood detail. I just need to confirm..." "For the fifth time, yes, Miller Acres. The 'Happiest Place to Grow Up' according to the welcome sign we saw on the way over," Bill replied, not looking up from his newspaper. "Operation Birthday Bash," Molly mused, a slight grin spreading across her face. "It has a certain ring to it, isn't it? Some romantic rogue thieves swindling gift-wrapped cash from just-born bankers." Bill finally lowered his paper. "They're three-year-olds, Moll. The 'bankers' you're referring to can't even tell the difference between a dollar bill and a liquorice wrapper." "Optimistic! I like that in a partner." Molly grabbed a water bottle from the fridge and took a drink. "The Party Shop is having their end-of-year clearance. I was thinking about getting a balloon animal for later, you know? For morale. Boost our spirits if the snatch goes south." Bill sighed and rubbed his temples. "No balloon animals." "I hardly see how a little balloon animal would compromise the operation's integrity," Molly protested. "Besides, we might need a diversion. I can make balloon animals do some pretty convincing dance moves, I'll have you know." "We don't need diversions. We need in and out in under two minutes before the birthday girl finishes her nap and discovers the presents are gone," Bill explained patiently. "This isn't Mission: Impossible, Molly. It's interfering with toddler playtime." "We're professionals!" Molly spun around, nearly knocking over a chair. "Professionals who are about to score gift bags that average eighty bucks a pop, I might add. And haven't opened presents since we were kids." She paused, her expression softening. "Have the people who work at the toy store always been so suspiciously cheerful, or is that a new security measure?" "I think it's just their job," Bill said dryly. "Look, just stick to the plan. I'll create a small fireworks distraction at the backyard of the house next door. By the time the responding police cars arrive, you'll be in and out of the party house. The presents go into the bag, and we meet at the van around the corner." "I like it. Except maybe change the 'small fireworks distraction' to a 'serious fireworks display of awesome'," Molly suggested. "Nothing cements an escape quite like a spectacular fiery explosion." "Focus, Molly." Bill returned to his newspaper as if ending the conversation. "Remember, this is an in-and-out job. Quick, clean, and we're done before the latest batch of juice boxes gets delivered." "Suffering pでしょう冰kSprite," Molly muttered before grinning. "Don't worry, partner. I know exactly what I'm doing." She began limbering up, stretching her arms dramatically. "You focus on your fireworks. I'll work on my disappointment face if I'm not back in time."
created_at
"2026-05-27T10:16:02.879Z"
5/27/2026, 10:16:02 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
161
moderation_latency
(null)
generation_time
14637
tokens_prompt
77
tokens_completion
806
native_tokens_prompt
243
native_tokens_completion
703
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
240
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0003884
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-e585aef4080546e8aca7b94b910b0087"
is_byok
false
latency
161
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876962-4ho95y6KbvinGCk5rQTi"
session_id
(null)
api_type
"completions"
id
"gen-1779876962-z83uPygwoOVe9RFohEC8"
upstream_id
"chatcmpl-e585aef4080546e8aca7b94b910b0087"
total_cost
0.0003884
cache_discount
0.000036
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
0.00%
Dialogue to Total Word Ratio
Ratio: 75.49%, Deviation: 25.49%
neededClean
false
wordsTotal
514
wordsDialogue
388
49.8722%