NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMF3XF5VRVF3RNZESPCM2PH
Run Details
Model:
Skyfall 36B V2
Test:
Text Replacement
Scenario:
POV shift: 3rd person to 1st person (Elena's perspective)
Duration:
10.9s
Cost:
$0.000766
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
11.11%
Dialogue and other-character pronouns preserved
(details)
8.00%
POV transformation accuracy
(details)
100.00%
No remaining protagonist name in narration
(details)
39.7037%
Overall
Model input
System
You are an expert prose editor. Whenever you're given text, rephrase it using the following instructions: <instructions>Rewrite from third person to first person, from Elena's perspective.</instructions> Imitate and keep the current writing style, and leave mannerisms, word choice and sentence structure intact. Keep the same stylistic writing choices. Use English spelling and grammar. Only return the rephrased text, nothing else.
User/Human
Text to rewrite: <text words="381"> The market square buzzed with the sound of a hundred voices. Elena pushed through the crowd, her satchel clutched against her chest. She knew the merchant would not wait — he never waited for anyone, least of all a girl from the outer ring. "You're late," Gregor said, without looking up from his ledger. He ran a thick finger down a column of numbers and frowned. Elena set the package on the counter. "The bridge was out. I had to take the long way around." "Not my problem." He unwrapped the cloth and held the crystal up to the light. It caught the sun and threw tiny rainbows across the stall's canvas roof. Gregor's frown deepened. "This isn't what I ordered." She had expected this. The old man always found something to complain about, whether the goods were perfect or not. Elena crossed her arms and stood her ground. "It's exactly what you ordered," she said. "Blue quartz, uncut, from the northern mines. I watched them pull it from the rock myself." Gregor turned the crystal slowly, examining every facet. He brought it close to his eye, then held it at arm's length. Elena thought he looked like a jeweler appraising a diamond, though Gregor dealt in stranger things than jewelry. He set the crystal down and sighed. "Fine. But next time, wrap it properly. The edges are exposed." He reached beneath the counter and produced a small leather pouch. The coins inside clinked as he slid it toward her. Elena scooped up the pouch and weighed it in her palm. It felt light. She loosened the drawstring and counted: eight silver crowns. They had agreed on ten. "We agreed on ten," she said flatly. "That was before the bridge fee went up." Gregor smiled — a thin, practiced smile that never reached his eyes. "Take it or leave it." She took it. She always took it. The outer ring didn't offer many choices, and Gregor knew that as well as she did. Elena walked out of the market and into the narrow lanes beyond. The sun hung low over the rooftops, and the shadows stretched long across the cobblestones. She slipped the pouch into her satchel and quickened her pace. If she hurried, she would reach the bridge before dark. </text>
Model response
Messages
Meta
AI
I push through the crowd, my satchel clutched tightly against my chest as I weave through the bustling market square. A hundred voices buzz and murmur around me, but I barely hear them. My mind isfixed on getting to Gregor's stall in time. He won't wait, not for anyone, especially not for a girl from the outer ring. "You're late," Gregor says as I approach, not bothering to glance up from his ledger. A thick finger runs down a column of numbers, and his brow furrows. I set the package on the counter. "The bridge was out. I had to take the long way around." "Should have planned better," he grunts. Gregor unwraps the cloth and holds the crystal up, catching the sunlight. It sparkles and sends little rainbows dancing across the canvas roof of his stall. He turns it over, examining each facet. Double-checking, like I knew he would. "What's the issue?" I ask. I know there's going to be one, even though I made sure to follow his instructions to a T. "This isn't what I ordered," he says flatly. "It is," I insist, not backing down, "Blue quartz, uncut, straight from the northern mines. I watched them pull it from the rock myself." He twists the crystal back and forth, holding it up to his eye before stretching it out at arm's length. He looks like a jeweler appraising a diamond, which he isn't, lest anyone forget. Gregor's in plenty of other things. Sighing, he sets it down. "Fine. But next time, wrap it better. The edges need to be covered." He reaches under the counter, pulls out a small leather pouch, and slides it my way. The coins inside jingle as it lands. I scoop it up and weigh it in my palm. Feels light. Loosening the drawstring, I count: eight silver crowns. We'd agreed on ten. "We agreed on ten," I point out. "That was before the bridge fee went up." Gregor smiles — a thin, practiced thing that never touches his eyes. "Take it or leave it." I take it, like always. The outer ring doesn't offer much in the way of choices, and Gregor knows that as well as I do. I walk out of the market and into the narrow lanes beyond, leaving the cheerful chatter of the square behind. The sun hangs low over the rooftops, and the shadows stretch long across the cobblestones. I slip the pouch into my satchel and pick up my pace. If I hurry, I should reach the bridge before dark.
created_at
"2026-05-27T10:19:08.145Z"
5/27/2026, 10:19:08 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
242
moderation_latency
(null)
generation_time
10773
tokens_prompt
685
tokens_completion
599
native_tokens_prompt
591
native_tokens_completion
557
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
16
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00076585
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-b2627420f56548e5"
is_byok
false
latency
242
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779877148-hLr71uHlfWWZoxBQkS7h"
session_id
(null)
api_type
"completions"
id
"gen-1779877148-hCNypbUU2dVf38s0xxuA"
upstream_id
"chatcmpl-b2627420f56548e5"
total_cost
0.00076585
cache_discount
0.0000048
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
11.11%
Dialogue and other-character pronouns preserved
2/18 anchor points preserved
matched
2
total
18
details
0
id
"dlg-1"
paragraphFound
true
preserved
true
1
id
"dlg-2"
paragraphFound
true
preserved
true
2
id
"dlg-3"
paragraphFound
true
preserved
false
3
id
"dlg-4"
paragraphFound
true
preserved
false
4
id
"dlg-5"
paragraphFound
true
preserved
false
5
id
"dlg-6"
paragraphFound
true
preserved
false
6
id
"dlg-7"
paragraphFound
true
preserved
false
7
id
"dlg-8"
paragraphFound
true
preserved
false
8
id
"dlg-9"
paragraphFound
true
preserved
false
9
id
"dlg-10"
paragraphFound
true
preserved
false
10
id
"greg-1"
paragraphFound
true
preserved
false
11
id
"greg-2"
paragraphFound
true
preserved
false
12
id
"greg-3"
paragraphFound
true
preserved
false
13
id
"greg-4"
paragraphFound
true
preserved
false
14
id
"set-1"
paragraphFound
true
preserved
false
15
id
"set-2"
paragraphFound
true
preserved
false
16
id
"set-3"
paragraphFound
true
preserved
false
17
id
"set-4"
paragraphFound
true
preserved
false
8.00%
POV transformation accuracy
2/25 pronouns/names correctly transformed
neededClean
false
matched
2
total
25
details
0
id
"p01"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
1
id
"p02"
paragraphFound
true
correct
true
matched
"my"
expected
0
"my"
2
id
"p03"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
3
id
"p04"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
4
id
"p05"
paragraphFound
true
correct
true
matched
"I"
expected
0
"I"
5
id
"p06"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
6
id
"p07"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
7
id
"p08"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
8
id
"p09"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
9
id
"p10"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
10
id
"p11"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
11
id
"p12"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"me"
difficulty
"hard"
12
id
"p13"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
13
id
"p14"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
14
id
"p15"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
15
id
"p16"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
16
id
"p17"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
17
id
"p18"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
18
id
"p19"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
difficulty
"hard"
19
id
"p20"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
20
id
"p21"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
21
id
"p22"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
22
id
"p23"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"my"
23
id
"p24"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
24
id
"p25"
paragraphFound
true
correct
false
originalPresent
false
expected
0
"I"
nameMatched
1
nameTotal
6
pronounMatched
1
pronounTotal
19
100.00%
No remaining protagonist name in narration
No protagonist name remaining
remaining
(empty)
39.7037%