NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEG4B67MXRREZZNB1W08K2
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.5s
Cost:
$0.000438
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the length of thecrumbs-strewn apartment, her silver fur coat swishing against faded beige wallpaper. "Bill, are you listening to me? This could be our biggest score yet." Bill glanced up from the untidy folding of his "disguise"—a cocktail dress covered in sequins that probably once glittered in a better light. "Anything for a five-year-old's princess party, darling. This is art. What was that about our biggest score?" "You heard me. The Brinkley-Hartford tycoons have been brag箐 giving their daughter Maisie everything she wants. I've seen the kid on social media—the crown jewels look more modest than her birthday cake. Top shelf stuff, Bill." "Jewels? You're sure? At a children's party?" Bill adjusted his glasses, carefully inspecting a seam on the dress. "Not jewelry, you dope. The latest tech. My sources say they're giving her that new robotics kit that cost more than your car." "Why does that sound exactly plausible?" Bill muttered. "So we're breaking into a children's party to steal children's toys? A new low, even for us." Molly paused, spinning on her heel with practiced precision. "Bill, focus. This isn't about some plastic junk. This is about black market industrial design schematics hidden inside those shiny boxes. We tap into the maker community, they reverse-engineer those plans, and we're looking at quarter million distribution rights." "Or," Bill said, standing up with the dress held half-mast, "we get caught by Miss Fusspot Security Guard and spend Christmas morning in a cell." "We don't get caught," Molly corrected, pulling a small device from her coat pocket. "We infiltrate. You go in as Aunt Mildred's plus-one—" "I look nothing like Mildred's plus-one, Molly. For one, I'm a man." "Depends on how you accessorize." She tossed him a rainbow-colored bowtie. "You greet them with enough noisy enthusiasm to make their teeth hurt, and nobody questions Aunt Mildred bringing her outrageous nephew to the party." "And you?" Bill sighed, catching the tie. "I'm the horse. Literally," Molly said, gesturing toward the corner where a pink plush costume with velcro mouth lay in a heap. "Parental Sabotage had me booked for two hours, I already told them there was a family emergency and I had to leave. The costume's theirs for the afternoon." "But what—want?" "Three words: unstable supply chain. Your job is to make Small Maisie Brinkley-Hartford so miserable she flees to the safety of her owner-less mountain of presents. I'll be waiting nearby, and when she's nestled in with her pile of toys, I make the grab-and-go." Molly snapped her fingers. "Thirty seconds maximum. In and out before the piñata even hits the floor." "The piñata is the signal?" Bill asked, horror creeping into his voice. "The distraction," Molly corrected. "Get the kid away from the party, and I do the job. Sixty forty split, I get the larger percentage since I'm the one taking all the risk in the stupid horse suit." "House policies are sixty forty psychotic criminal all over," Bill muttered. Molly's eyes gleamed. "Rum and Coke too? Magic markers, sticky shoes, and enough sugar to float a battleship? It's like Christmas in spring, only cushier and with better gifts. Don't you see, Bill? It's perfect. We get rich, some company loses twenty dollars in pilfered parts, and the world keeps turning exactly the same way. What could possibly go wrong?"
created_at
"2026-05-27T10:08:19.827Z"
5/27/2026, 10:08:19 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
184
moderation_latency
(null)
generation_time
15415
tokens_prompt
77
tokens_completion
883
native_tokens_prompt
243
native_tokens_completion
788
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0004381
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-dcd275c6d4f842f5ad3141e61cb679e0"
is_byok
false
latency
184
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876499-CAgkQg2Yba51QtEVIKkc"
session_id
(null)
api_type
"completions"
id
"gen-1779876499-x2oMsXZWWBGI1fK6gAm3"
upstream_id
"chatcmpl-dcd275c6d4f842f5ad3141e61cb679e0"
total_cost
0.0004381
cache_discount
0.0000288
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
557
0.00%
Dialogue to Total Word Ratio
Ratio: 77.15%, Deviation: 27.15%
neededClean
false
wordsTotal
569
wordsDialogue
439
0.0000%