NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM090NV9A6J39AQ7AX4PPCK9
Run Details
Model:
GPT-5.4 Nano (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
4.2s
Cost:
$0.001148
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: Sir Roland’s Test" data-color="red"> The hoofbeats slowed as I passed the training paddock, steering Thunder toward the stables. Straw and leather scents mingled with the smoke of distant cookfires, and somewhere a smith’s hammer rang like a steady heartbeat. A tall figure stepped from the shadow of the hayloft—plate armor catching the last of the sun, a sword at his hip worn like habit rather than fashion. “So you’re Frederick,” Sir Roland said, voice roughened by too many campaigns. “New to the yard and already riding like you belong here.” He walked closer, boots crunching straw. Without asking permission, he glanced over Thunder as if inspecting a battlefield supply. Then his gaze returned to me. “A squire’s first lesson,” he continued, “is that pride gets you hurt. Do you know the difference between a horse that’s strong and a horse that’s careless?” Before I could answer, he jerked his chin toward the stable door. “Show me how you tend your mount. Care for him like the next man to ride might be you—because it could be.” </nc-section> <nc-section data-title="Version 2: The Knight and the Oath" data-color="green"> I dismounted near the stables, boots sinking slightly into the packed earth. Thunder shifted, ears flicking at the scent of other horses and the scrape of tack being hung. “Frederick!” a voice called from the walkway, clear and commanding. I looked up to see Sir Roland approaching, a local knight whose armor was scuffed in all the right places—proof of battles, not negligence. His cloak snapped once in the evening wind, and the silver clasp at his throat caught the light. He stopped just short of the stable entrance, as though the animals themselves deserved a courteous distance. “You made it,” he said. “Good. The castle’s hungry, the yard’s restless, and the captains don’t care how nervous you feel.” Sir Roland leaned in slightly, lowering his voice. “Listen well, boy. Training begins today, but not with swords. First you learn obedience—to your duties, to your elders, to the laws of this keep. Then, when you’ve earned it, we’ll talk about blades and breath and courage.” He gestured toward the nearest stall. “Come. There are straps to check and water to be measured. And if you do it badly, everyone will know.” </nc-section> <nc-section data-title="Version 3: Steel and Straw" data-color="blue"> As I guided Thunder into the stable lane, the air turned warmer, thick with hay and the faint tang of oil. Lantern light pooled on the stone like honey, and silhouettes moved behind half-open doors—grooms working, horses stamping, ropes creaking. Sir Roland found me there, where the ground was sanded smooth by years of footsteps. He wore his armor as if it were a second skin, polished enough to reflect the lanterns, yet worn at the edges where real life had struck. “You’re late by a heartbeat,” he said, not unkindly—more like a remark carved into stone. “But a squire’s life is measured in heartbeats. One wrong step, one missed warning, and you’re tasting dirt with your pride.” He took two slow passes along the stall front, eyes sharp for anything out of place: a crooked bucket, a loose latch, a bridle placed too hastily. “Good horses,” Sir Roland went on, “can smell intention. If you’re careless, they’ll tell you. If you’re gentle, they’ll trust you.” Then he turned, pointing at my hands. “So—start. Knot that strap. Straighten that cinch. And when Thunder’s quiet, you’ll be quiet too. I’ll speak again once the stable smells right.” </nc-section>
created_at
"2026-03-18T10:49:12.821Z"
3/18/2026, 10:49:12 AM
model
"openai/gpt-5.4-nano-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2530
moderation_latency
(null)
generation_time
3900
tokens_prompt
318
tokens_completion
1078
native_tokens_prompt
302
native_tokens_completion
870
native_tokens_completion_images
0
native_tokens_reasoning
40
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0011479
router
(null)
provider_responses
0
endpoint_id
"0c835f2e-c18d-4e8c-b245-e1e3bd08b97f"
id
"resp_0a203f85e0b0bd8b0169ba8328dfe48197915113f775ad8675"
is_byok
false
latency
165
model_permaslug
"openai/gpt-5.4-nano-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830952-cU48j7WR76W5zFtdRgag"
upstream_id
"resp_0a203f85e0b0bd8b0169ba8328dfe48197915113f775ad8675"
total_cost
0.0011479
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval