NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM0934WT5453AZ5EJVAZJEDZ
Run Details
Model:
GPT-5.4 Mini (Reasoning, Low)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
2.3s
Cost:
$0.002018
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick had barely dismounted when a broad-shouldered knight emerged from the shadow of the stables, his armor dull with travel and his expression stern but not unkind. “You must be the new squire,” he said, voice deep as a bell. “I am Sir Roland. If you mean to serve here, you’ll learn the rules of this place quickly.” He glanced toward the restless horses, then back to Frederick. “Come. I’ll show you where the work begins.” </nc-section> <nc-section data-title="Version 2" data-color="green"> Near the stables, Frederick found a tall knight in a weathered surcoat leaning against a post, one hand resting on the hilt of his sword. The man straightened as Frederick approached, and his stern face softened into the faintest hint of a smile. “Sir Roland,” he introduced himself, offering a nod of welcome. “You look tired, boy. That’s no crime on your first day. Come, let’s see if you can tell a saddle from a bridle before sunset.” </nc-section> <nc-section data-title="Version 3" data-color="blue"> As Frederick led his horse toward the stables, a knight in polished mail stepped out from the open doors, as though he had been expecting him all along. “Frederick, I presume,” said the man with a grave but courteous bow. “I am Sir Roland, keeper of these yards and, by unfortunate duty, guardian of every clumsy squire who arrives here.” His eyes flicked to Thunder, then to Frederick’s uncertain hands. “Do not worry. The castle can seem harsh at first. I will see that you find your footing.” </nc-section>
created_at
"2026-03-18T10:50:33.768Z"
3/18/2026, 10:50:33 AM
model
"openai/gpt-5.4-mini-20260317"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
841
moderation_latency
(null)
generation_time
2181
tokens_prompt
318
tokens_completion
417
native_tokens_prompt
302
native_tokens_completion
398
native_tokens_completion_images
0
native_tokens_reasoning
11
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0020175
router
(null)
provider_responses
0
endpoint_id
"9ee065b2-3d1c-43bc-bdd7-28af3b148282"
id
"resp_095cb3a24df6e3380169ba8379d3d08190acabaa754d14724c"
is_byok
false
latency
189
model_permaslug
"openai/gpt-5.4-mini-20260317"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773831033-YRAtK63DLgrgVzlC4g7n"
upstream_id
"resp_095cb3a24df6e3380169ba8379d3d08190acabaa754d14724c"
total_cost
0.0020175
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval