NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-09-01KK9PXBPWJ656K6ZVATM3K5GM
Run Details
Model:
GPT-5.4 (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
23.6s
Cost:
$0.014360
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Near the stables, Frederick slid down from Thunder’s back, his boots sinking into the damp straw scattered across the yard. The air smelled of hay, leather, and the sharp warmth of horses after a long ride. Before he could gather his reins, a broad-shouldered knight stepped from the stable doors, his mail glinting softly in the fading light. “You must be Frederick,” the man said, his voice calm and steady. A scar crossed one cheek, but his expression held more kindness than severity. “I am Sir Roland, master of the yard and guardian of every fool squire who thinks a sword is lighter than it truly is.” Frederick straightened at once and bowed his head. “Yes, sir. I have come to begin my training.” Sir Roland took the reins from his hands with practiced ease and gave Thunder an approving glance. “A fine horse, and still standing after carrying you this far. That speaks well of both of you.” Then he turned back toward the stable, motioning Frederick to follow. “Come. The castle will test your courage soon enough. Tonight, you will learn where you sleep, where you eat, and how to keep from being kicked by a warhorse before dawn.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Frederick had scarcely reached the stables when a knight in a forest-green cloak emerged from between the wooden stalls. He moved with the easy confidence of a man long accustomed to armor, though he wore it as lightly as another man might wear a coat. A ring of keys hung from his belt, chiming softly as he approached. “So,” the knight said, resting a gloved hand on the stable gate, “the new squire has finally arrived.” His gray eyes studied Frederick for a moment, sharp but not unkind. “I am Sir Roland. This castle has been expecting you.” Frederick swallowed his nervousness and bowed. “It is an honor, Sir Roland.” Roland’s stern face softened at that. “We shall see if you still think so after a week of drills.” He nodded toward the stalls, where horses stamped and snorted in the twilight. “First lesson: in this place, you greet the horses before the lords. They will carry your weight, your weapons, and sometimes your life.” Frederick glanced toward Thunder as a stable boy hurried over. Sir Roland gave him a brief smile. “Come along, lad. There is much to learn, and the castle wastes no time on latecomers.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> The stable yard was alive with evening sounds: hooves striking wood, harness buckles clinking, and the low murmur of servants finishing the day’s work. Frederick had barely handed off his saddlebag when a tall knight stepped out from the shadow of the stable arch. His blue surcoat bore the silver crest of the castle, and the lantern light caught along the edge of his sword belt. “Frederick,” he said, as though he had known the boy would arrive at that very moment. Frederick turned quickly. “Yes, sir?” The knight inclined his head. “Sir Roland. I oversee the squires, the stables, and, when necessary, the trouble that follows both.” There was a dry humor in his tone that eased some of Frederick’s tension. “You’ve had a long ride. Still, a castle judges a man from the moment he enters its gates.” Frederick tried to stand taller despite the ache in his legs. Sir Roland noticed and gave a faint nod of approval. “Good. You listen already. See that you keep that habit.” He gestured toward the open stable doors, where warm golden light spilled over the straw. “Bring your things. I will show you your place here, and tomorrow we shall learn whether you are merely a boy with a horse—or the making of something more.” </nc-section>
created_at
"2026-03-09T16:29:32.262Z"
3/9/2026, 4:29:32 PM
model
"openai/gpt-5.4-20260305"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3402
moderation_latency
(null)
generation_time
23559
tokens_prompt
318
tokens_completion
1105
native_tokens_prompt
302
native_tokens_completion
907
native_tokens_completion_images
0
native_tokens_reasoning
65
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01436
router
(null)
provider_responses
0
endpoint_id
"9ff5625c-403f-4d7f-b895-58ac7295062c"
id
"resp_064037935b78c7730169aef56c56d88195b91b3f46e8702468"
is_byok
false
latency
190
model_permaslug
"openai/gpt-5.4-20260305"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773073772-xsmkkO7xd9s0WPaqEZCI"
upstream_id
"resp_064037935b78c7730169aef56c56d88195b91b3f46e8702468"
total_cost
0.01436
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval