NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEP51QBB2AK3HANP310CKW
Run Details
Model:
Skyfall 36B V2
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10.1s
Cost:
$0.000514
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.31%
Dialogue to Total Word Ratio
(details)
0.1533%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced back and forth, her emerald eyes sparkling with excitement and a touch of nervousness. "Alright, Bill, we've got everything planned. Perimeter check, photo op, gift grab, and ghost!" Bill looked up from his daguerreotype, his bushy mustache twitching. "You sure about this, Moll? There's bound to be a pack of little ankle-biters running around and squealing." "I'm spot on for this one, Bill. The bigwig's son is turning a whole six! Imagine the haul - costly gifts stacked high. And with all the cradle-ticklers not noticing, we can sweep in, snatch, and go!" Bill scratched his chin. "They might have a nanny guard or such. Won't hurt to make sure." "No need. I've been casing the joint. The only folks are the rug-rats and families. No strapping railbirds on guard or any bog-trotters lolling about." Bill adjusted his monocle. "If you're certain... The bonanza would be prime. But if things unfurl awry, the bigwig's folks ain't to be trifled with. Left a mark on the cell at the cop shop." "Bill, you worry too much! It'll be easy as taking a tickle from a babe. I've got the treads and rigs for us to bluff and scheme past those brats and their mommies. No roadblocks!" Bill nodded reluctantly. "Right enough, Moll. Now what about loot-lining and cuddle-cloaking? Can't have them spotting us and raising a ruckus." Molly broke into a grin. "Bill, you'd be brilliant playing팅 with those totters for a tad! Give 'em a peep here and there while I get our mittens on the mounds of wrapped loot. Then we'll be gone before their tin horns realize, leaving the sort of chaos that makes folks dizzy!" "For my next act, I'll be running rings around a rakeful of hobnail whippersnappers," Bill said dryly. "Well, Moll - looks like another sortie together, eh? Set your clocks-ubiquitous and let's be off to weave our web of holiday! Molly clapped Bill on the back as they stepped out for their mission. "Joking aside, Bill, just this once, thanks for the backup. Wouldn't want to miss you and your unique talents!" Bill hunched his shoulders sheepishly. "Wouldn't miss it fer the world, me heartie!"
created_at
"2026-05-27T10:11:37.155Z"
5/27/2026, 10:11:37 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
174
moderation_latency
(null)
generation_time
10008
tokens_prompt
77
tokens_completion
550
native_tokens_prompt
243
native_tokens_completion
565
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
240
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00051365
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-b7fef4c64d007c4e"
is_byok
false
latency
174
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876697-3DXdtUxKMXJbtHd0AE8p"
session_id
(null)
api_type
"completions"
id
"gen-1779876697-Bhe1UUOQGtSVgRrHoTVU"
upstream_id
"chatcmpl-b7fef4c64d007c4e"
total_cost
0.00051365
cache_discount
0.000072
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
362
0.31%
Dialogue to Total Word Ratio
Ratio: 80.76%, Deviation: 10.76%
neededClean
false
wordsTotal
369
wordsDialogue
298
0.1533%