NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEWS4ZDQPJ2AYBR7QPJNH8
Run Details
Model:
Skyfall 36B V2
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9.1s
Cost:
$0.000486
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly peeked through the fence at the birthday party in full swing. Balloons bobbed merrily, and the delighted squeals of children filled the air. "Right, here's the plan," she whispered to Bill. "We need to blend in, grab the gifts, and cacaow!" Bill adjusted his wrinkled "Happy Birthday" tiara. "This looks ridiculous," he muttered. "Lengthening to blend at a party is bulletproof," Molly assured him, twirling a rainbow twist in her ponytail. "Now, grappling hooks on three. One... Two..." As the countdown clock screeched to a halt, the pair swung into action. Molly's hook caught the fence post, and she zipped up effortlessly. Bill's landed on a sandbox; he tumbled in with a gravely crack of palmetto sandals. Meanwhile, Molly had reached the event tent roof. She sprinted across, hugging the sides. Below, the children were entranced by a clown. The perfect diversion. Molly grabbed a stack of expertly wrapped gifts and prepared her getaway. But Bill was having trouble. He'd escaped the sandbox, but his tiara caught on a low-hanging branch, leading to an awkward stumble. A group of preschoolers snickered. "Ooh, Bill, you trip like you're overeasy!" John sang. Even when he reached Molly, he seemed disheveled and dreadful. "Let's blow this Popsicle stand," Bill whispered hoarsely, brushing sand from his tousle. As they tumbled off, Molly turned to Bill and sneaked an insincere smile. "Y'know, they say crime is a young person's game. Maybe we should retire, pick up fishing or knitting or morbid taxidermy..." "Aww, you knit, Mol? For whom, fellow schnozzes?" Bill was fishing through their illicit rainbow trove, pulling out an uninviting My Little Iguana doll. The pair dispelled as the wail of a police siren approached. "We did it, Bill... Ran off with some seriously soiled loot and a barrel full of banana splits!" Molly said gleefully as they jumped onto their sugary-ride-away. "I don't even like Koko hacerlo," Bill grumbled, kicking off his sandals. "That kid's the real bitonti now."
created_at
"2026-05-27T10:15:14.346Z"
5/27/2026, 10:15:14 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
183
moderation_latency
(null)
generation_time
8762
tokens_prompt
77
tokens_completion
521
native_tokens_prompt
243
native_tokens_completion
507
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
176
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00048645
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-bb271ebd9fa68aa1"
is_byok
false
latency
183
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876914-R3kZ5pegvQjmlRdLP3sA"
session_id
(null)
api_type
"completions"
id
"gen-1779876914-CgXvgALJD2dOZn2smiPg"
upstream_id
"chatcmpl-bb271ebd9fa68aa1"
total_cost
0.00048645
cache_discount
0.0000528
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
328
0.00%
Dialogue to Total Word Ratio
Ratio: 31.72%, Deviation: 18.28%
neededClean
false
wordsTotal
331
wordsDialogue
105
0.0000%