NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0NG0AWXJK8G7VA9B9EVJ4H
Run Details
Model:
GPT-5.5 (Reasoning, Low)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
18.7s
Cost:
$0.024430
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Thunder slowed to a proud trot as I neared the stables, his iron-shod hooves striking sparks from the cobbles. The air smelled of hay, leather, and the smoke of evening hearths. Before I could dismount, a tall knight in a weathered surcoat stepped from beneath the stable archway, one hand resting lightly upon the pommel of his sword. “You must be Frederick,” he said, his voice deep but not unkind. I swung down from the saddle and bowed as best I could, though my legs still trembled from the long ride. “Yes, sir.” The knight’s stern face softened by the smallest degree. “I am Sir Roland. The castle has been expecting you.” His eyes moved from my dusty boots to the reins clenched in my hands. “A squire’s training begins before dawn, but first he learns to care for his horse. See Thunder fed and brushed, then report to the yard.” I swallowed, nodded, and led my stallion into the warm golden light of the stables, feeling that my true arrival had only just begun. </nc-section> <nc-section data-title="Version 2" data-color="green"> By the time I reached the stables, the last light of the sun had turned the castle stones the color of old gold. Stable boys hurried past with buckets and brushes, but they stopped whispering when a knight in a dark green cloak strode toward me. He was broad-shouldered, with silver at his temples and a scar cutting across one eyebrow. Though he wore no helm, there was something battle-ready in the way he stood. “Frederick of Ashford?” he asked. I straightened in the saddle. “Aye, my lord.” “Sir will do,” he replied. “I am Roland, sworn knight of this keep.” He reached up and took Thunder’s bridle with practiced ease. The stallion, who had tossed his head at every guard on the road, grew still beneath his hand. Sir Roland studied me for a moment. “You have come far. Good. A squire who tires easily is of no use to anyone.” I felt my cheeks warm, unsure whether I had been praised or warned. “Come,” he said, turning toward the stable doors. “Let us see whether you know the difference between courage and foolishness.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> The castle stables stood beneath a slate roof veined with moss, their open doors spilling lamplight into the yard. I guided Thunder toward them, grateful for the promise of rest, when a voice called out from the shadows. “Easy there, lad. No need to ride down the grooms on your first evening.” I pulled sharply on the reins as a knight emerged beside a stack of hay bales. He was not dressed as I had imagined knights to be—no gleaming armor, no shining helm—but in a plain tunic belted at the waist, with a sword at his side and mud on his boots. “Forgive me,” I said, sliding from the saddle. He gave a short laugh. “If every eager squire were punished for haste, the castle would have no squires left.” Then he offered his hand. “Sir Roland.” I stared for a breath before taking it. His grip was firm enough to make my knuckles ache. “Frederick,” I managed. “So I was told.” His gaze flicked to Thunder, then back to me. “A fine horse. Let us hope the rider proves half as steady.” </nc-section>
created_at
"2026-04-24T21:14:23.459Z"
4/24/2026, 9:14:23 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1186
moderation_latency
(null)
generation_time
18527
tokens_prompt
318
tokens_completion
868
native_tokens_prompt
302
native_tokens_completion
764
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.02443
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_099d5fcc4baf67bd0169ebdd2f8be08190a514fb0eca17ba14"
is_byok
false
latency
415
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777065263-h0iyCwlS8VK3znWL89zB"
session_id
(null)
api_type
"completions"
id
"gen-1777065263-uI3jHZVRXCRESRLhaqUA"
upstream_id
"resp_099d5fcc4baf67bd0169ebdd2f8be08190a514fb0eca17ba14"
total_cost
0.02443
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval