NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0A3ADTEPJENC60DKW5PC05
Run Details
Model:
GPT-5.4 Mini
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
3.1s
Cost:
$0.002233
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick had scarcely dismounted before a broad-shouldered knight in polished mail came striding from the shadow of the stables. "You must be the new squire," he said, his voice carrying the easy authority of long command. "I am Sir Roland. If you mean to serve at this castle, you'll learn its horses before you learn its halls." He offered Frederick a firm hand to shake, then nodded toward the stalls where restless hooves thudded against straw. "Come along, lad. We'll see whether you're frightened of beasts, or only of me." </nc-section> <nc-section data-title="Version 2" data-color="green"> Near the stables, Frederick was met by a knight in a weathered surcoat, who raised a hand in greeting as if he and the boy were already old acquaintances. "Sir Roland," he said with a grin, "and you must be Frederick." The knight glanced over the row of stable doors, listening to the snorts and stamping within. "A castle teaches many lessons, but the horses tell the truth sooner than men do. If you can calm a nervous mare, you'll manage the rest of your duties well enough." His tone was warm, but there was steel beneath it, as though every kindness came wrapped in a lesson. </nc-section> <nc-section data-title="Version 3" data-color="blue"> As Frederick approached the stable yard, a knight with a scar across his chin stepped out to meet him, resting one hand on the pommel of his sword. "Sir Roland," he introduced himself, giving a short, respectful bow. "You've arrived at a good time. The horses are being settled, and a squire should know how a castle breathes before he ever sets foot in the keep." He looked Frederick over with sharp, measuring eyes, then softened slightly. "Walk with me. I’ll show you where the tack is kept—and if you’re lucky, I may even tell you which horse kicks and which one only pretends to." </nc-section>
created_at
"2026-03-18T11:08:08.013Z"
3/18/2026, 11:08:08 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
787
moderation_latency
(null)
generation_time
2951
tokens_prompt
318
tokens_completion
486
native_tokens_prompt
302
native_tokens_completion
446
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0022335
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_0344b09783aca9ca0169ba8798112c81949a3d237124ef9fd9"
is_byok
false
latency
176
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773832088-kJhCU5lYXPimRE8NR80G"
upstream_id
"resp_0344b09783aca9ca0169ba8798112c81949a3d237124ef9fd9"
total_cost
0.0022335
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval