NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM09BNV4AKRF9CF6Y3V5D402
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
2.8s
Cost:
$0.002346
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick had scarcely set foot in the stable yard when a broad-shouldered knight in polished mail stepped from the shadow of the hayloft. “You must be Frederick,” he said, his voice deep but not unkind. “I am Sir Roland. If you are to serve here, you’ll learn the castle’s ways quickly—and the horses’ temperaments faster.” He glanced toward Thunder, who snorted at the sound of his name. Roland’s mouth twitched in a faint smile. “A spirited mount. Good. Spirited riders make for better squires.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Near the stables, Frederick was halted by the ring of steel and the calm authority of a man who had spent years in the saddle. Sir Roland stood beneath the timbered awning, one gauntleted hand resting on the hilt of his sword. “Frederick, is it?” he asked. “The steward sent word of your arrival.” When Frederick bowed nervously, Roland gave a short nod. “Mind the stable boys, mind the horses, and mind yourself. A castle teaches many lessons, but none so quickly as the smell of saddle leather and hay.” The knight’s gaze softened as he looked Frederick over. “Come. I’ll show you where a squire begins.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> As Frederick approached the stables, he found Sir Roland already there, speaking with the groom-master in a low, measured tone. The knight turned at once, his expression stern until it settled into something more welcoming. “You’re the new squire,” he said. “I’m Sir Roland. These stables will be your first classroom, if the gods have any sense.” He gestured to the rows of horses shifting in their stalls. “Listen to them. They’ll tell you more about patience, fear, and courage than any lecture ever could.” Frederick straightened, suddenly certain that this castle would demand more from him than he had ever imagined. </nc-section>
created_at
"2026-03-18T10:55:13.263Z"
3/18/2026, 10:55:13 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
926
moderation_latency
(null)
generation_time
2582
tokens_prompt
318
tokens_completion
514
native_tokens_prompt
302
native_tokens_completion
471
native_tokens_completion_images
0
native_tokens_reasoning
10
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.002346
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_06a34efb58a8e86d0169ba849154d48196ba149cbf41a037da"
is_byok
false
latency
222
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831313-abEKkXrY5PNJ5dk1h2yo"
upstream_id
"resp_06a34efb58a8e86d0169ba849154d48196ba149cbf41a037da"
total_cost
0.002346
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval