NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0MXPQWCEEY41YN6SR15T72
Run Details
Model:
GPT-5.5
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
18.6s
Cost:
$0.024460
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Thunder slowed to a restless trot near the castle stables, Frederick drew in a breath that tasted of hay, rain-wet stone, and woodsmoke. Stable boys hurried to and fro beneath the lanterns, their shadows leaping across the courtyard like startled sprites. A tall knight stood waiting beside the stable doors, one hand resting upon the pommel of a plain but well-kept sword. His armor bore the marks of long service rather than ceremony, and a faded blue cloak hung from his shoulders. “You ride boldly for one so newly arrived,” the knight said, his voice deep but not unkind. Frederick swung down from the saddle and bowed as best he could with trembling legs. “Sir Frederick—no, forgive me. Frederick, my lord. I have come to begin my training.” The knight’s stern face softened into the faintest smile. “I am Sir Roland. Around here, titles are earned in mud before they are spoken in halls.” He stepped forward and took Thunder’s reins with practiced ease. “Welcome to the castle, lad. Your first lesson begins now: care for your horse before you care for yourself.” </nc-section> <nc-section data-title="Version 2" data-color="green"> The castle stables lay just beyond the eastern arch, warm with lamplight and the restless sounds of horses settling for the night. Frederick guided Thunder beneath the lintel, trying not to stare at the rows of polished saddles, shields, and spears arranged along the walls. Before he could call for a groom, a broad-shouldered man emerged from the shadows, carrying a brush in one hand and a bucket in the other. He wore no helm, and his dark hair was threaded with silver, but the sword at his hip and the lion crest upon his breast left no doubt of his station. “You must be the new squire,” the man said. Frederick straightened at once. “Yes, sir.” “Sir Roland will do.” The knight’s eyes flicked over Frederick, then to Thunder. “Fine horse. Proud, too. That can be a blessing or a curse.” Frederick swallowed. “I hope to prove worthy of him.” Sir Roland gave a quiet chuckle. “Then start by proving worthy of the stable. A knight who cannot muck straw has no business bearing steel.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> Frederick had imagined his arrival would be met by trumpets, perhaps even the captain of the guard. Instead, near the stables, he found a single knight leaning against a fence post, feeding an apple to an old gray mare. The knight looked up as Thunder approached. His face was weathered, his beard close-trimmed, and a thin scar cut from his left brow to his cheek. Yet his eyes were bright and sharp, missing nothing. “So,” he said, “this is the lad they sent us.” Frederick dismounted quickly, nearly catching his boot in the stirrup. Heat rushed to his face, but he forced himself into a bow. “Frederick, son of Alden, reporting for service.” The knight nodded once. “Sir Roland of Westmere. I was told you had spirit.” Frederick glanced at Thunder, who tossed his head as if in agreement. “I will work hard, sir.” “That is easily promised.” Sir Roland tossed the apple core aside and gestured toward the stable doors. “Come then. Let us see how you handle a currycomb, a pitchfork, and the first disappointment of every would-be hero.” </nc-section>
created_at
"2026-04-24T21:04:23.811Z"
4/24/2026, 9:04:23 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
772
moderation_latency
(null)
generation_time
18461
tokens_prompt
318
tokens_completion
892
native_tokens_prompt
302
native_tokens_completion
765
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.02446
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_0cac9a41731401c30169ebdad7e60881968a4ecab7b1de66bc"
is_byok
false
latency
325
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777064663-dDFxcVswf7CSGkRYdfFJ"
session_id
(null)
api_type
"completions"
id
"gen-1777064663-5JvXps8ilZdswL1WMQQ0"
upstream_id
"resp_0cac9a41731401c30169ebdad7e60881968a4ecab7b1de66bc"
total_cost
0.02446
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval