NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEKKDPRZM2BAEKN3YVZ8NQ
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
9.9s
Cost:
$0.000283
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned against the brick wall of the community center, chewing on the end of a lollipop she'd swiped from the gas station five blocks away. Bill stood beside her, fidgeting with the ill-fitting collar of his neon striped shirt. "I don't know, Bill," Molly said, her voice dropping to a conspiratorial whisper. "A children's party? That's just lowest of the low." "Lowest?" Bill sniffed indignantly. "This is prime real estate, Molly. These parents are loaded. I counted seven Teslas in the parking lot when I did my recon yesterday. Their kids are basically gift-reception centers on stilts." "Still feels cheap," Molly muttered, eyeing a stroller being pushed past them. "We're jewel thieves, Bill. Sophisticated. The last thing I want is my mug shot next to 'suspected piƱata bandit.'" Bill sighed, setting down his backpack of lock-picking tools. "Look, the party starts in twenty minutes. The parents will be distracted, pre-occupied with small talk and baby puree. This is our in. It's quiet, fast, and we walk away with thousands in unwrapped booty." "Unwrapped booty," Molly repeated, rolling her eyes. "You've been watching too many cartoons." "Fine," Bill said, straightening his ridiculous party hat he'd found in the dumpster earlier. "My condensed points: Children are noisy distractions. Parents are clueless. Presents are valuable. Window's cracked. We go in, grab, we go out. What could possibly go wrong?" "Everything," Molly said honestly. "What if there're security guards? What if the kid with the bow tie and attitude decides to test your 'youthful appearance' cover story by asking you to identify simple machines?" Bill adjusted his glasses, blinking rapidly. "Does knowing that a lever is a rigid bar that pivots around a fixed point make me look sufficiently childlike?" Molly stared at him for a long moment before tossing her lollipop into a nearby bush. "This is a disaster," she whispered. "Agreed," Bill nodded, already reaching for his lock picks. "But probably not a boring one."
created_at
"2026-05-27T10:10:13.57Z"
5/27/2026, 10:10:13 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
211
moderation_latency
(null)
generation_time
9854
tokens_prompt
77
tokens_completion
523
native_tokens_prompt
243
native_tokens_completion
473
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
176
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.000283
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-d305f1a5ff9544f897e59282e4974406"
is_byok
false
latency
211
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876613-hOUpNmWps6pZocFyoVks"
session_id
(null)
api_type
"completions"
id
"gen-1779876613-nmT8MXkDT5aGskZPlNDA"
upstream_id
"chatcmpl-d305f1a5ff9544f897e59282e4974406"
total_cost
0.000283
cache_discount
0.0000264
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
324
0.00%
Dialogue to Total Word Ratio
Ratio: 61.77%, Deviation: 31.77%
neededClean
false
wordsTotal
327
wordsDialogue
202
0.0000%