NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMENETNA997F9XQQQAM65EK
Run Details
Model:
Skyfall 36B V2
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
9.3s
Cost:
$0.000417
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced back and forth in their cramped apartment, eyeing the local map sprawled out on the kitchen table. Bill lounged on the couch, pen behind his ear and feet propped up, devouring a leftover slice of birthday cake they'd picked up from outside the wealthy neighborhood. "Is the wall climb still a go?" Molly asked, frowning. "I think so," Bill said, between mouthfuls. "It's the sunset crawl from Main down to Bridger and Pine, probably past the neighbors line of sight 90% of the time and into that fancy birch grove out front. It's way better than face time with highway anthem security." "It's not a crime if he beheldes us!" Molly said. "Alright, alright... But scores better than this anyway?" It was settled. Come dusk, they'd scale a wall in a quiet section of the street, avoid some secondary cameras, glide in to that Welcome Party, snag all the fancy stuff, and be out the tree line — ideally without getting spotted. "It's a babe in a wicker basket," Bill said, smearing the last bit of cake with a sullen, eager look. "No presentation box, and no royal guards either! We'll be in and out in seven minutes, tops; slicing for you to recover before midnight." "But reconsider my distance yet," Molly said. "It's utterly ghastly! Consign everybody an invoice for that augury outfit, reclaiming all those kilts. There's no way I can recover from seven hundred degrees at that hour. Seven minutes after sunsets isn't gonna do it for me!" "Molly, seriously, coming up with this round?" Bill narrowed his eyes at her, then flopped backward on the couch again. "We ain't ghosts, man! We ain't gonna almost crap ourselves, what fun is it," she snapped, then shook her head. "Trust," Bill said, before slipping deeper into the couch, "scooping that birthday piggy bank is the talk." She checked their watch. One hour to deployment.
created_at
"2026-05-27T10:11:14.401Z"
5/27/2026, 10:11:14 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
187
moderation_latency
(null)
generation_time
9171
tokens_prompt
77
tokens_completion
479
native_tokens_prompt
243
native_tokens_completion
444
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
240
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00041685
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-b08ac68ea12baf43"
is_byok
false
latency
187
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876674-TTbtlhQdkUqxY8rAnwig"
session_id
(null)
api_type
"completions"
id
"gen-1779876674-5uzhll8dknZI2iu8Xaw1"
upstream_id
"chatcmpl-b08ac68ea12baf43"
total_cost
0.00041685
cache_discount
0.000072
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
317
0.00%
Dialogue to Total Word Ratio
Ratio: 53.63%, Deviation: 23.63%
neededClean
false
wordsTotal
317
wordsDialogue
170
0.0000%