NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM07CJYC8PAN7VT01SSR8KD5
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
3.1s
Cost:
$0.002292
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick had barely stepped off the road when a broad-shouldered knight in weathered mail came striding from the stables, a riding crop tucked beneath one arm. “You must be the squire,” Sir Roland said, his voice rough as gravel but not unkind. He studied Frederick with keen gray eyes, then nodded toward the stable doors where the scent of hay and horseflesh drifted on the warm air. “Welcome to the castle. If you can saddle a horse before nightfall, you may yet prove useful.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Near the stables, Frederick was met by Sir Roland, the local knight, who emerged from the shadows of the timbered overhang with a helm under one arm and a stern expression that softened when he saw the boy’s nervous posture. “So, you’re the new squire,” Roland said, glancing toward the line of restless horses. “Good. They say a man is judged first by how he treats a horse, and second by how he handles a sword.” He held out a gloved hand. “Come along. I’ll show you where you’ll sleep—and where the trouble tends to begin.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> As Frederick approached the castle’s lower yard, a knight in a faded blue tabard was waiting by the stables, one hand resting on the wooden fence while the other adjusted the strap beneath his gauntlet. “Frederick, I presume,” said Sir Roland with a measured smile. “You’ve arrived at a busy hour.” Behind him, a mare stamped impatiently, and the stall doors rattled softly in the breeze. Roland gestured toward the stalls. “If you are to serve here, you should meet the beasts before you meet the men. Horses are far less forgiving, but far more honest.” </nc-section>
created_at
"2026-03-18T10:20:45.91Z"
3/18/2026, 10:20:45 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1497
moderation_latency
(null)
generation_time
2925
tokens_prompt
318
tokens_completion
468
native_tokens_prompt
302
native_tokens_completion
459
native_tokens_completion_images
0
native_tokens_reasoning
30
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.002292
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_07878b37cb7f3c0b0169ba7c7df938819382cd4cfee1d84a19"
is_byok
false
latency
183
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773829245-qCIy6JP6RkCL6iv8Xk06"
upstream_id
"resp_07878b37cb7f3c0b0169ba7c7df938819382cd4cfee1d84a19"
total_cost
0.002292
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval