NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEMA9XD8YYRAAMD7H1XGFY
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
9.4s
Cost:
$0.000257
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her sunglasses, though she wore them under a wig of ineptly dyed blue hair that looked suspiciously like a cleaning rag. "We have a timeline problem," she whispered, studying the targeting map on her phone. The sunny suburban street was ominously quiet, the only movement coming from a lawn sprinkler that seemed to have a personal vendetta against a particularly stubborn dandelion. Bill wiped sweat from his brow, his bow-tie hanging slightly askew despite multiple attempts to tighten it. "It's just a kids' party, Mol. They'll be too busy stuffing their faces to notice we're stuffing our pockets." He jingled a puzzlingly large number of keys that weren't his. "Which is exactly why we're operating on borrowed time," Molly hissed, crumpling the invitation she'd nabbed from the school's lost and found. "Patty birthday girl. Seven years old. That means attention span of a gnat, but also parent vigilance of a énormously bored hawk." The sound of a balloon popping in the distance made both of them jump. "We need a diversion," Bill said, the pragmatism finally kicking in as he produced a small fire whistle from his pocket. "I was thinking... smoke?" Molly rolled her eyes so forcefully she was pretty sure she could see the future. "No, you oaf. We need children's entertainment value. Remember that time at the jewelry store when you tried to use a whoopee cushion?" "You said it was affable," Bill sulked. "It was adjacent to fun being inadequate," Molly corrected, already mentally calculating how long it would take to assemble a costumed distraction from the supplies in Bill's van of questionable origin. "We go in, we get the 'gifts' for the disenfranchised, we leave before the looters lose their minds over the demolition man cake." Bill nodded professionally, carefully not thinking about the previous week's incident involving a security system and what they'd thought was a "perfectly safe" amount of nitrogen triiodide.
created_at
"2026-05-27T10:10:37.001Z"
5/27/2026, 10:10:37 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
303
moderation_latency
(null)
generation_time
9256
tokens_prompt
77
tokens_completion
502
native_tokens_prompt
243
native_tokens_completion
440
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
240
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0002569
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-e856fd443eb74090abc5e949d9744c24"
is_byok
false
latency
303
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876637-oO1eRNcgmfHcwyxi8RZu"
session_id
(null)
api_type
"completions"
id
"gen-1779876637-zffPdhGVAHG93goP8dZC"
upstream_id
"chatcmpl-e856fd443eb74090abc5e949d9744c24"
total_cost
0.0002569
cache_discount
0.000036
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
320
0.00%
Dialogue to Total Word Ratio
Ratio: 38.32%, Deviation: 28.32%
neededClean
false
wordsTotal
321
wordsDialogue
123
0.0000%