NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEP1XDMNMEVG2FN898Y24R
Run Details
Model:
Skyfall 36B V2
Test:
Language Writing
Scenario:
Character dialogue (Italian) in a story
Duration:
5.6s
Cost:
$0.000382
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Isabella"> Isabella is a 30-year old college student on a study abroad program. Born in Italy, she is fluent in Italian and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Italian, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Isabella is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Italian, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Isabella and Hunter as a voice sheet: Isabella: "..." Hunter: "..."
Model response
Messages
Meta
AI
```markdown **Voice Sheet: Dialogue Between Isabella and Hunter** **Isabella:** "...Aiutami, ti prego! (Help me, please!)" Her voice was trembling, filled with anxiety and fear. She repeatedly pulls away from her camera, as if it held some dark secret. **Hunter:** "...." Hunter's brow furrowed, not comprehending a single word. He reached for the chair across from Isabella and settled into it. "É momento di andare. (It's time to go.)" Hunter says in a firm, authoritative voice, fingers twitching nervously on the camera showing an alarming event. "I have to head over to thesteller Inn. A new lead came in, something big. You coming?" Hunter offered Isabella a ride, but she shook her head. Her eyes shone with fear and sadness. He sighed, seeing no point in loitering if she wouldn't go with him. Time was of the essence and this outbreak certainly wasn't going to investigate itself. So with a final nod, Detective Hunter fetched his car keys. "Trent tanta, mi dispiace." That's all he could manage. He hoped that she understood that he was sorry he couldn't help, but he had an important case to solve. And walk away he did, flashing his badge at the cafe cashier and paying for Isabella's drink with his own money. He wasn't a stranger to taking a loss these days. ```
created_at
"2026-05-27T10:11:33.945Z"
5/27/2026, 10:11:33 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
185
moderation_latency
(null)
generation_time
5496
tokens_prompt
321
tokens_completion
338
native_tokens_prompt
277
native_tokens_completion
299
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
32
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00038195
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-a099731a9045c1f4"
is_byok
false
latency
185
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876693-uDhMusmFfoKEQ7zsX1ym"
session_id
(null)
api_type
"completions"
id
"gen-1779876693-qnzVTA8dE8s9TcGh7yac"
upstream_id
"chatcmpl-a099731a9045c1f4"
total_cost
0.00038195
cache_discount
0.0000096
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Isabella"
dialogue
"...Aiutami, ti prego!"
detectedLang
"it"
heavyLang
"it"
scores
it
0.6951219512195121
pl
0.6355685131195336
sl
0.5941558441558442
ro
0.5819397993311036
sk
0.571917808219178
yo
0.5598591549295775
cs
0.5567375886524822
no
0.5519713261648745
pt
0.47257383966244726
ca
0.41588785046728977
es
0.41588785046728977
lt
0.4075829383886255
sq
0.4075829383886255
hr
0.39903846153846156
lv
0.3811881188118812
reliable
true
passes
true
1
name
"Hunter"
dialogue
"...."
detectedLang
""
heavyLang
""
scores
reliable
false
passes
true