Run: 2026-05-27-01KSMEP51QBB2AK3HANP310CKW

Run Details

Duration:
10.1s
Cost:
$0.000514
Provider:
Parasail

Evaluation summary

Score Evaluator
Details Link
0.00% Matches word count (details)
0.31% Dialogue to Total Word Ratio (details)
0.1533% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly paced back and forth, her emerald eyes sparkling with excitement and a touch of nervousness. "Alright, Bill, we've got everything planned. Perimeter check, photo op, gift grab, and ghost!" Bill looked up from his daguerreotype, his bushy mustache twitching. "You sure about this, Moll? There's bound to be a pack of little ankle-biters running around and squealing." "I'm spot on for this one, Bill. The bigwig's son is turning a whole six! Imagine the haul - costly gifts stacked high. And with all the cradle-ticklers not noticing, we can sweep in, snatch, and go!" Bill scratched his chin. "They might have a nanny guard or such. Won't hurt to make sure." "No need. I've been casing the joint. The only folks are the rug-rats and families. No strapping railbirds on guard or any bog-trotters lolling about." Bill adjusted his monocle. "If you're certain... The bonanza would be prime. But if things unfurl awry, the bigwig's folks ain't to be trifled with. Left a mark on the cell at the cop shop." "Bill, you worry too much! It'll be easy as taking a tickle from a babe. I've got the treads and rigs for us to bluff and scheme past those brats and their mommies. No roadblocks!" Bill nodded reluctantly. "Right enough, Moll. Now what about loot-lining and cuddle-cloaking? Can't have them spotting us and raising a ruckus." Molly broke into a grin. "Bill, you'd be brilliant playing팅 with those totters for a tad! Give 'em a peep here and there while I get our mittens on the mounds of wrapped loot. Then we'll be gone before their tin horns realize, leaving the sort of chaos that makes folks dizzy!" "For my next act, I'll be running rings around a rakeful of hobnail whippersnappers," Bill said dryly. "Well, Moll - looks like another sortie together, eh? Set your clocks-ubiquitous and let's be off to weave our web of holiday! Molly clapped Bill on the back as they stepped out for their mission. "Joking aside, Bill, just this once, thanks for the backup. Wouldn't want to miss you and your unique talents!" Bill hunched his shoulders sheepishly. "Wouldn't miss it fer the world, me heartie!"
created_at"2026-05-27T10:11:37.155Z" 5/27/2026, 10:11:37 AM
model"thedrummer/skyfall-36b-v2"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency174
moderation_latency(null)
generation_time10008
tokens_prompt77
tokens_completion550
native_tokens_prompt243
native_tokens_completion565
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached240
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
num_fetches(null)
web_search_engine(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
service_tier(null)
usage0.00051365
router(null)
provider_responses
0
endpoint_id"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id"chatcmpl-b7fef4c64d007c4e"
is_byokfalse
latency174
model_permaslug"thedrummer/skyfall-36b-v2"
provider_name"Parasail"
status200
user_agent"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer(null)
request_id"req-1779876697-3DXdtUxKMXJbtHd0AE8p"
session_id(null)
api_type"completions"
id"gen-1779876697-Bhe1UUOQGtSVgRrHoTVU"
upstream_id"chatcmpl-b7fef4c64d007c4e"
total_cost0.00051365
cache_discount0.000072
upstream_inference_cost0
provider_name"Parasail"
response_cache_source_id(null)

Evaluation details

Result Evaluator Details Meta Data
0.00% Matches word count n/a
neededCleanfalse
words362
0.31% Dialogue to Total Word Ratio
Ratio: 80.76%, Deviation: 10.76%
neededCleanfalse
wordsTotal369
wordsDialogue298
0.1533%