NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM08WJQB2CS38DBVX7WR367R
Run Details
Model:
GPT-5.4 Mini (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
4.1s
Cost:
$0.002720
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick had scarcely reached the stable yard when a broad-shouldered knight stepped out from the shadow of the barn. Sir Roland wore polished mail beneath a travel-stained surcoat, and his gray beard stirred in the evening breeze. “You must be Frederick,” he said, offering a firm hand. “The stables are the first lesson here. A squire who cannot care for his mount will never earn the trust of a knight.” He glanced toward the restless horses, then back to the young man with a stern but not unkind smile. “Come. I’ll show you where you belong.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Near the stables, Frederick was met by a man whose presence seemed to straighten the very air around him. Sir Roland, the local knight, stood with one hand resting on the pommel of his sword and the other raised in greeting. “So this is the new squire,” he said, his voice deep and measured. “I heard you arrived before supper.” He looked Frederick over, from dusty boots to travel-worn cloak. “You’ll begin with the horses. Quiet hands, clean stalls, and respect for every beast you touch. If you can manage that, lad, the rest of your training may yet be worth the effort.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> The scent of hay and leather filled the air as Frederick approached the stables, where a knight in steel-trimmed blue surcoat was waiting. Sir Roland’s expression was serious, though his eyes carried a hint of approval. “Frederick, I presume,” he said, inclining his head. “You’ve come a long road to stand here.” He gestured toward the rows of stalls, where horses stamped and snorted in the dim light. “A squire learns many things, but first he learns patience. Help my stablemaster tonight, and if you prove yourself useful, I may take an interest in your training myself.” </nc-section>
created_at
"2026-03-18T10:46:58.551Z"
3/18/2026, 10:46:58 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2801
moderation_latency
(null)
generation_time
3829
tokens_prompt
318
tokens_completion
624
native_tokens_prompt
302
native_tokens_completion
554
native_tokens_completion_images
0
native_tokens_reasoning
108
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0027195
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_095323c52bbe8e870169ba82a2a03c8197b6eb9e682620ad87"
is_byok
false
latency
277
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830818-ii5vrc3wrhoyK9b8LNvb"
upstream_id
"resp_095323c52bbe8e870169ba82a2a03c8197b6eb9e682620ad87"
total_cost
0.0027195
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval