NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEF7RG1P1MRXDHG1Q0HH13
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
7.7s
Cost:
$0.000212
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
96.50%
Dialogue to Total Word Ratio
(details)
48.2505%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Molly, we are not stealing from a children's party," I said vehemently, pacing the cramped apartment. "Think about it, Bill. That place is an all-you-can-eat buffet of electronic toys and words that need batteries. It's the perfect score. The parents are too busy playing hide-and-seek with their little darlings to notice trouble." "Some enterprising thief stole all the good presents years ago. It's a trap," I muttered, running a hand through my thinning hair. " hysteria, is brilliant misdirection. No one would ever rob a kid's birthday. That's our advantage! Plus, the cake has jelly filling—when we trip the silent alarm, they'll think it's just a sugar-fueled chaos explosion." I sighed, defeated. "Fine. What's the plan? Assault from the window? Tunnel through the backyard?" "Much simpler. Under the guise of being party clowns, we'll case the joint." She pulled out a red nose and a pair of giant shoes. "Clowns? We haven't worked a gig in twenty years. I can't do balloon animals." "Relax. You just have to stand around looking menacingly cheerful. I'll do the heavy lifting." Molly adjusted her wig. "Besides, children are terrified of our faces. They'll give us a wide berth." "What if they have actual clowns? This stinks, Molly. Why can't we rob a casino like normal thieves?" "Because casinos are boring! They have cameras and math. This is art. Now inflate that balloon, we've got a diamond-encrusted toy train to liberate."
created_at
"2026-05-27T10:07:50.557Z"
5/27/2026, 10:07:50 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
160
moderation_latency
(null)
generation_time
7613
tokens_prompt
77
tokens_completion
378
native_tokens_prompt
243
native_tokens_completion
351
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
240
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0002124
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-6edc2f3b199f4997807bcd753819d1fd"
is_byok
false
latency
160
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876470-vv5SupBxSb0WsJmiBQfs"
session_id
(null)
api_type
"completions"
id
"gen-1779876470-g0ZxZRS111hnPkLJPayV"
upstream_id
"chatcmpl-6edc2f3b199f4997807bcd753819d1fd"
total_cost
0.0002124
cache_discount
0.000036
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
237
96.50%
Dialogue to Total Word Ratio
Ratio: 85.66%, Deviation: 4.34%
neededClean
false
wordsTotal
244
wordsDialogue
209
48.2505%