NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-25-01KMJ8TS50EKR1Q338887AHTWS
Run Details
Model:
Qwen3 235B A22B Instruct 2507
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
14.9s
Cost:
$0.000302
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: A Stern Welcome" data-color="red"> As Frederick dismounted, a broad-shouldered knight in dented armor stepped from the shadow of the stable doorway. Sir Roland’s voice was like gravel beneath a boot. "So, another pup sent to bark at dragons, eh?" He eyed Frederick up and down, unimpressed. "I’ve seen roasted turnips with more spine. Drop your gear, clean the stables by moonlight, and we’ll see if you’ve got the grit to last till morning." Frederick swallowed hard, gripping his satchel tighter. This wasn’t the welcome he’d dreamed of—but perhaps it was the one he needed. </nc-section> <nc-section data-title="Version 2: A Father's Legacy" data-color="green"> The knight who emerged from the stables wore a weathered cloak embroidered with the sigil of the Silver Flame. "Frederick," he said, voice soft but firm. "I knew your father. Brave man. Died with a sword in his hand and a dragon’s shadow over him." Frederick froze. No one had spoken of his father in years. Sir Roland placed a gauntleted hand on his shoulder. "You’ve his eyes. And perhaps, if fortune favors us, his heart. Come. The stables need mucking, but after that—we’ll talk of what it means to serve a kingdom on the edge of legend." </nc-section> <nc-section data-title="Version 3: The Spark of Camaraderie" data-color="blue"> Before Frederick could tether Thunder, a hearty laugh rang out. "By the stars, you must be the youngest squire I’ve ever seen!" Sir Roland bounded over, not in full armor but in a tunic splattered with hay. He clapped Frederick on the back hard enough to rattle his teeth. "Come on then, lad! Stables first—duty calls louder than hunger. But don’t worry, once the chores are done, I’ll introduce you to the others. There’s a game of dice after supper, and I’ve got a feeling you’ll be lucky." </nc-section>
created_at
"2026-03-25T10:32:19.364Z"
3/25/2026, 10:32:19 AM
model
"qwen/qwen3-235b-a22b-07-25"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
289
moderation_latency
(null)
generation_time
14823
tokens_prompt
318
tokens_completion
483
native_tokens_prompt
306
native_tokens_completion
452
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0003018
router
(null)
provider_responses
0
endpoint_id
"043eb28f-fc28-4b0d-9800-b249fcfcfbf9"
id
"chatcmpl-9ea2ad1dbe7f5375"
is_byok
false
latency
289
model_permaslug
"qwen/qwen3-235b-a22b-07-25"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1774434739-ixrJk18yYObH8rbHn8L7"
upstream_id
"chatcmpl-9ea2ad1dbe7f5375"
total_cost
0.0003018
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Parasail"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval