NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0NR69Q4ZBVK64RF2JAR7SN
Run Details
Model:
GPT-5.5
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
18.3s
Cost:
$0.023860
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Thunder slowed to a proud trot near the castle stables, Frederick drew in a breath of hay, leather, and evening smoke. Stable boys paused in their work to stare at the newcomer, and somewhere beyond the yard a blacksmith’s hammer rang like a distant bell. A tall knight in a weather-worn cloak stepped from beside the stable doors. His beard was flecked with silver, and the sword at his hip bore a pommel shaped like a lion’s head. “You must be Frederick,” the knight said, his voice calm but carrying the weight of command. “I am Sir Roland, sworn blade of this keep.” Frederick dismounted quickly, nearly tangling his boot in the stirrup before recovering with as much dignity as he could gather. He bowed low. “My lord knight,” he said. “I have come to serve and learn.” Sir Roland studied him for a moment, eyes sharp as winter stars. Then the faintest smile touched his face. “Good. Then your first lesson begins now.” He nodded toward Thunder. “A squire who cannot tend his horse has no business dreaming of glory.” </nc-section> <nc-section data-title="Version 2" data-color="green"> The castle stables lay just within the outer wall, warm with lanternlight and the restless shifting of horses. Frederick guided Thunder beneath the archway, his heart beating faster than the stallion’s hooves upon the cobbles. Before he could call for assistance, a broad-shouldered man emerged from the stable shadows. He wore no helm, but his chainmail glimmered beneath a green surcoat marked by a white stag. “Easy there,” the man said, lifting one hand. Thunder snorted, but settled at once, as though he recognized authority when he saw it. Frederick swung down from the saddle and bowed. “Sir, I am Frederick of Ashcombe. I was told to report for training.” “Aye,” said the knight. “And I am Sir Roland. I’ve been expecting you.” There was kindness in his expression, though it was buried beneath years of battles and hard roads. “You arrived before nightfall,” Sir Roland continued. “That speaks well of you. Whether you remain welcome here will depend on what you do after sunrise.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> By the time Frederick reached the stables, dusk had deepened into blue shadow. Lanterns swayed from iron hooks, painting the courtyard in gold. Thunder stamped and tossed his mane, impatient after the long road. A voice called from nearby. “That is a fine horse for a young squire.” Frederick turned to see a knight leaning against the stable rail, arms folded across his chest. He was older than Frederick had expected, with keen gray eyes and a scar running from his temple to his jaw. Yet there was no cruelty in his face—only watchfulness. “Thank you, sir,” Frederick said. “His name is Thunder.” “A bold name,” the knight replied. “Let us hope his rider proves worthy of it.” Frederick felt his cheeks warm but stood straight. “I will try, sir.” The knight pushed away from the rail and offered his hand. “Sir Roland. I oversee the squires of this castle.” Frederick clasped it, surprised by the strength of the knight’s grip. “Welcome, Frederick,” Sir Roland said. “The road has brought you here. Discipline will decide how far you go.” </nc-section>
created_at
"2026-04-24T21:18:51.718Z"
4/24/2026, 9:18:51 PM
model
"openai/gpt-5.5-20260423"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
919
moderation_latency
(null)
generation_time
18125
tokens_prompt
318
tokens_completion
895
native_tokens_prompt
302
native_tokens_completion
745
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.02386
router
(null)
provider_responses
0
endpoint_id
"58e5b336-423e-430b-a2ab-8bc353f0c51b"
id
"resp_04d6ad135d4873b00169ebde3bdff88197b766f79e073005c5"
is_byok
false
latency
463
model_permaslug
"openai/gpt-5.5-20260423"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777065531-P1HQelnFNRuahi331rG0"
session_id
(null)
api_type
"completions"
id
"gen-1777065531-KrRURpJz2o7OPZMT0el1"
upstream_id
"resp_04d6ad135d4873b00169ebde3bdff88197b766f79e073005c5"
total_cost
0.02386
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval