NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEKHKXGTM1RAJ12X643AKT
Run Details
Model:
Cydonia 24B V4.1
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
7.3s
Cost:
$0.000198
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
43.1902%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly studied the blueprints for the third time, her glasses perched precariously on her nose. "It's ridiculous to think we'd need to scale a five-foot fence," she muttered, folding the paper. Bill scratched his forehead, watching a goblin bounce outside. "Concussions are messy, Molly," Bill whispered, peering through binoculars. "Besides, who puts a climbing wall next to a fence?" "Parents who don't want to worry about insurance," Molly replied, tapping her pen. "According to my surveillance, they've got a classic treasure chest lock on that toy box. Piece of cake." "That's what you said about the bank security system." "Recalibrated that in two minutes," Molly snapped. "And stopped that screaming alarm. You just got startled by the flashing lights." A barrage of confetti exploded from a nearby window. Bill flinched. "Maybe we reconsider. There's at least twenty kids running around, and they're hyper-stimulated on sugar." Molly sighed, straightening her sleeves. "Child labor liquidates presents faster than we can steal them. Grab the bolt cutters." "Should we... bring party favors?" Bill asked nervously, watching a piƱata break open. "For, you know, blending in?" Molly rolled her eyes. "You did not just suggest we crash a children's party as adults dressed like unicorns." "I was thinking clowns," Bill said softly. "They're less... frightening."
created_at
"2026-05-27T10:10:11.72Z"
5/27/2026, 10:10:11 AM
model
"thedrummer/cydonia-24b-v4.1"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
254
moderation_latency
(null)
generation_time
7189
tokens_prompt
77
tokens_completion
359
native_tokens_prompt
243
native_tokens_completion
308
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
192
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.0001981
router
(null)
provider_responses
0
endpoint_id
"4d46db86-63fe-47eb-85f1-6a7567af190c"
id
"chatcmpl-52baa2df9d3a47528ca68576324384e4"
is_byok
false
latency
254
model_permaslug
"thedrummer/cydonia-24b-v4.1"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876611-M5EiPNIpCC02I7jkGgJf"
session_id
(null)
api_type
"completions"
id
"gen-1779876611-tQhKB7uw5MAhQLzj4Xbh"
upstream_id
"chatcmpl-52baa2df9d3a47528ca68576324384e4"
total_cost
0.0001981
cache_discount
0.0000288
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
211
0.00%
Dialogue to Total Word Ratio
Ratio: 66.20%, Deviation: 56.20%
neededClean
false
wordsTotal
213
wordsDialogue
141
43.1902%