NC Bench
Overview
About
Tests
Models
Model
GPT-4.1 Nano
Test
Tool usage within Novelcrafter
Scenario
Create alternate prose sections
Run ID
2025-04-14-01JRTSR95VW2YRF3MVQ5YBKR7V
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Red Dawn" data-color="red"> Sir Roland, a tall, stern knight with a scar across his cheek, approached Frederick as he dismounted. His armor gleamed ominously in the fading light, and his eyes surveyed the young squire with a mix of curiosity and authority. "Welcome to the castle, lad," he said gruffly, resting a hand on his sword hilt. "There's much to learn, and little time to waste." The lanterns flickered to life as they moved toward the stables, the air thick with the scent of horses and steel. </nc-section>
id
"gen-1744656148-C0xMTSMspxcjltnspTke"
upstream_id
"chatcmpl-BMJ3EoOYojG6akhX8DfDmKn98aXoh"
total_cost
0.0000837
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:42:29.359946+00:00"
Monday, April 14th, 2025 at 6:42:29 PM GMT+00:00
model
"openai/gpt-4.1-nano-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
352
moderation_latency
113
generation_time
379
tokens_prompt
301
tokens_completion
134
native_tokens_prompt
301
native_tokens_completion
134
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000837
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Fail
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
33.3333%