NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0PKW9M76C8ZBPCK6PR3GKG
Run Details
Model:
GPT-5.5
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
20.1s
Cost:
$0.027040
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Near the stables, where the scent of hay and leather mingled with the sharper tang of horses, a broad-shouldered knight stood waiting beside the open doors. “You must be Frederick,” he said, his voice deep and steady. Frederick slid down from Thunder’s saddle and bowed quickly, hoping he looked less travel-worn than he felt. “Yes, sir.” The knight smiled beneath a neatly trimmed beard. His armor was plain but well kept, and a blue cloak hung from one shoulder, fastened with a silver pin shaped like a hawk. “I am Sir Roland,” he said. “Captain of the castle guard and the man unfortunate enough to be charged with your training.” Frederick swallowed. “I will do my best, Sir Roland.” “I should hope so.” The knight’s stern expression softened. “The castle has little use for lazy squires. But it has great need of brave hearts.” He took Thunder’s reins and handed them to a stable boy, then gestured toward the courtyard. “Come, Frederick. Let us see whether your courage survived the road.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Frederick guided Thunder toward the stables, where lanterns already glowed beneath the wooden eaves. Stable hands hurried to and fro, carrying buckets, brushing down mounts, and calling softly to restless horses. A man in mail stood at the center of the bustle as if the whole yard moved around him. He was tall, with silver beginning to streak his dark hair, and a long sword rested at his hip. “Easy there,” he said, stepping forward as Thunder tossed his head. “That beast has spirit.” Frederick dismounted and bowed. “He is called Thunder, my lord.” The knight chuckled. “A fitting name. And you are Frederick, unless another dusty lad has come seeking a place among us.” “I am Frederick.” “Then welcome.” The knight offered a gloved hand. “Sir Roland of Ashford. I keep order here, teach swordplay when patience allows, and shout at boys until they become useful.” Frederick shook his hand, surprised by the strength in the knight’s grip. Sir Roland’s eyes studied him closely. “You have arrived at an interesting time, lad. The castle sleeps lightly these nights.” Frederick glanced toward the shadowed towers. “Why?” The knight released his hand and looked toward the darkening hills. “That,” said Sir Roland, “is a tale best told after supper.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> By the time Frederick reached the stables, twilight had gathered in the corners of the courtyard. The horses stamped and snorted in their stalls, and somewhere beyond the wall a raven gave a harsh cry. Frederick had just begun to loosen Thunder’s saddle when a voice behind him said, “Not that strap first.” He turned quickly. A knight stood in the stable doorway, his face half-lit by a hanging lantern. His cloak was travel-stained, his boots muddy, and a narrow scar ran from his brow to his cheek. “If you pull that one first,” the knight continued, “the saddle slips, the horse kicks, and you begin your first day with a broken rib.” Frederick flushed and moved his hand away. “Forgive me, sir. I did not know.” “That is why you are here.” The knight stepped inside and reached up to pat Thunder’s neck. The stallion, to Frederick’s surprise, calmed at once. “A squire is not expected to know everything. Only to learn quickly.” Frederick bowed. “I am Frederick.” “And I am Sir Roland.” The knight’s gaze was sharp, but not unkind. “Your master, if you prove worthy. Your nightmare, if you prove foolish.” Frederick straightened. “I will try not to be foolish, sir.” Sir Roland gave the smallest hint of a smile. “A wise beginning.” </nc-section>
created_at
"2026-04-24T21:33:58.974Z"
4/24/2026, 9:33:58 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
849
moderation_latency
(null)
generation_time
19830
tokens_prompt
318
tokens_completion
1034
native_tokens_prompt
302
native_tokens_completion
851
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.02704
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_00e0e5b06ff4d99d0169ebe1c7116c81908581d4e29214a465"
is_byok
false
latency
277
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777066438-sfRDxZdITsX84PTWLzZd"
session_id
(null)
api_type
"completions"
id
"gen-1777066438-isnPSocFQko54i0Y4PCk"
upstream_id
"resp_00e0e5b06ff4d99d0169ebe1c7116c81908581d4e29214a465"
total_cost
0.02704
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval