NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEGFB1FCZ76VADNBHA7NRH
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
20.7s
Cost:
$0.000523
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her fanny pack, her eyes scanning the row of الرغم brightly colored houses on Main Street. "Look," she whispered, gesturing with her coffee cup, "right there. The one with the uninflated balloons tied to the white picket fence." Bill followed her gaze, pushing thin glasses up his nose. "That's it? Seems rather... cheerful for a high-value target." "That's the genius part," Molly said, her grin widening. "Nobody suspects the children's party. All eyes are on us, the parents, making small talk. We're invisible." She took a sip of her coffee. "That birthday treasure chest? Filled with cash gifts from well-meaning aunts and uncles. I'm talking stacks of twenties and fifties. Easy pickings while the kids are too busy with cake to notice." Bill scratched his head. "Possible, but still a risk. Security presence?" "Not a gated compound, if that's what you're worried about." Molly waved dismissively. "Just some bored teenager they hired to watch the back door. Piece of cake." She caught Bill's skeptical expression and amended, "Easy to slice, I mean. Get it? Slice the pie?" "Focus, Molly." Bill sighed. "We need to approach this with professionalism. My intelligence suggests these events are often monitored." "Monitors?" Molly's eyes darted around. "Who told you that?" "Common sense! Parents today," Bill said, waving an envelope presumably holding blueprints. "They track everything. I've found schematics of the party venue. Multiple breach points." "Hate to break it to you, genius, but we're breaking into a pumpkin cake decoration, not Fort Knox," Molly muttered, rolling her eyes. She lowered her voice. "Listen, here's the plan. You'sh distraction at the front—make a big fuss about allergies or something. I'll slip in through the bow window in the sunroom. The parents will be too busy knowing you to notice me." "This is preposterous," Bill said, adjusting his tie. "Operation 'Birthday Blowout' has been meticulously planned for weeks. I was specifically looking for second-floor access near the promised land of presents." "Look, Bill, kids live on the ground floor. Presents live on the ground floor. You can stop your James Bond nonsense," Molly whispered angrily. "We get in, grab the loot, and be gone before anyone even knows the birthday kid got more than one Michael Jackson Halloween costume." "That's cold, Molly." "The gifts are harder money, Bill. Easier to fence than adrenaline pills." "Where'd you get that wisdom from?" Bill asked, suddenly suspicious. "Watched too many movies," Molly admitted with a shrug. "My point is, we're not stealing from orphans here. Those kids may not even like Michael Jackson." Bill sighed, deeply. "Your lack of professional ethics continues to astound me. Nevertheless, we proceed. Try not to trip over the party clown." "The party clown?" Molly asked, eyes wide. "Scheduled in his act. Arrives at 2:45 PM," Bill said, flipping through his notes. "Apparently, he's a 'surprise your child distracting special element.' Parents will be occupied with laughter and tears." "Just like that?" Molly stared at him, impressed in spite of herself. "You've really got this planned out, haven't you?" "Detailed preparation prevents catastrophic mistakes," Bill said sternly. "Now stop chatting and review the distraction timing. Three minutes tops, then you need to be at the sunroom window. Don't screw this up." "Got it," Molly nodded, a mischievous grin forming. "Three minutes. Window's marked." Bill shot her a look as they approached the house. The sound of children's laughter and adult chatter grew louder as they rounded the corner, passing the balloon-adorned fence just as a agency in a red car pulled up outside. "Group of clowns is earlier than expected," Bill muttered. "Change of plans. You go with the window, I'll circle for secondary access points." Molly patted the fanny pack one last time and straightened her polka-dotted apron. "Operation Baby Boomer Bonanza is a go." Before Bill could stop her, she marched up the walkway and disappeared into the bustling children's party, leaving him to mutter about professionalism and the distinct possibility that she'd already replaced her apron with a clown nose.
created_at
"2026-05-27T10:08:31.087Z"
5/27/2026, 10:08:31 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
201
moderation_latency
(null)
generation_time
20586
tokens_prompt
77
tokens_completion
1097
native_tokens_prompt
243
native_tokens_completion
957
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0005226
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-8d43bfac0b2241138e80ffe3e55490fb"
is_byok
false
latency
201
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876511-Mt5CZrnVtCFA1O09ihBa"
session_id
(null)
api_type
"completions"
id
"gen-1779876511-eUFUw1zL1osdYqxVcYJZ"
upstream_id
"chatcmpl-8d43bfac0b2241138e80ffe3e55490fb"
total_cost
0.0005226
cache_discount
0.0000288
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
671
0.00%
Dialogue to Total Word Ratio
Ratio: 66.57%, Deviation: 36.57%
neededClean
false
wordsTotal
676
wordsDialogue
450
0.0000%