NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMETQDSSXCX5A9RVE2X9A8G
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
10.6s
Cost:
$0.000249
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
3.19%
Dialogue to Total Word Ratio
(details)
1.5939%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Okay, so we casually show up to this birthday party," Molly said, biting into her stolen donut as blue icing smeared her chin. "I take out the chubby kid, you disable the security system." The security consisted of a more-breathing-than-working digital camera and a lock that Bill could open with a rubber band. "I'm stealing the presents from under the tree," Molly continued. "You keep the kids distracted. Maybe do some balloon animals? You know, the thing we talked about for weeks?" "I was a magician's assistant once," Bill insisted, fingers already moving in the air. "This kid will love me. Kids love me. Also, we've only talked about this plan for twenty minutes." "You said you did it with your magic kit as a kid. That's different," Molly rolled her eyes. "Just make sure you find a twisted balloon dog. The piƱata's already filled with cash, but those toys are pure gold." Bill frowned. "We're divisioning the loot? Again? Your division of the sequin-covered princess crown last time left me without matching socks." " Lucy bought you those mismatched socks for Christmas. They're Christmas presents, Bill. Stop suggesting I made you have slate gray on the right and canary yellow on the left." "She was six," William said, losing the balloon dog slightly in his focus. "She gave me the blank look of a gifted child creating mildly uncomfortable art for her elders." Molly sighed and wiped her icing-laced fingers on her velour pants. "We're in, we're out, before cupcakes are served. The incendiary device is a timed cake that releases bright yellow smoke. No mess, no fire. It just looks like the world's worst birthday candle malfunctions." "Are we sure the kid won't notice all her presents are missing while we're still in the room?" "Distraction is key, Bill. Just keep inflating things."
created_at
"2026-05-27T10:14:07.045Z"
5/27/2026, 10:14:07 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
282
moderation_latency
(null)
generation_time
10549
tokens_prompt
77
tokens_completion
470
native_tokens_prompt
243
native_tokens_completion
410
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0002491
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-14306f702c1c4eecbbe09fa90b6532a6"
is_byok
false
latency
282
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876847-wFuyeqBNpGJWT2AfJTpY"
session_id
(null)
api_type
"completions"
id
"gen-1779876847-IFRI8AykbeWJvl3I6sgA"
upstream_id
"chatcmpl-14306f702c1c4eecbbe09fa90b6532a6"
total_cost
0.0002491
cache_discount
0.0000288
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
304
3.19%
Dialogue to Total Word Ratio
Ratio: 76.38%, Deviation: 13.62%
neededClean
false
wordsTotal
309
wordsDialogue
236
1.5939%