NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYXJ90V2Q4SP3DRP53BX8HV
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
11.8s
Cost:
$0.018975
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="A Stern Welcome" data-color="red"> As I dismounted near the stables, a towering figure emerged from the shadows of the doorway. Sir Roland, his armor still bearing the dust of the training yard, fixed me with a hard, appraising stare. "So, you're the new squire," he grunted, crossing his arms over his broad chest. "Let me make one thing clear, boy. I have no patience for the soft sons of nobles who think this is a game. Here, you will bleed, you will sweat, and you will learn—or you will leave. Which will it be?" </nc-section> <nc-section data-title="An Unexpected Kindness" data-color="green"> I had barely led Thunder toward the stables when a warm voice called out to me. "Welcome, lad! You must be Frederick." Sir Roland strode forward, a genuine smile breaking across his weathered face. Though his frame was that of a seasoned warrior, his eyes held a gentle wisdom. He clapped a heavy hand on my shoulder and helped me with the saddle. "Don't look so nervous. We were all young squires once. Come, let me show you where Thunder can rest. A knight always tends to his horse before himself—remember that." </nc-section> <nc-section data-title="A Riddle in the Twilight" data-color="blue"> As the last light faded, I noticed a man leaning casually against the stable wall, half-hidden in shadow. He did not announce himself, but watched me with curious, glittering eyes. "Tell me, young one," Sir Roland said at last, his voice soft and measured, "do you know why the dragon banners no longer fly above these gates?" I shook my head, confused by such a strange greeting. He pushed off from the wall and stepped into the dying light, his expression unreadable. "No. Of course you don't. But you will. That, I'm afraid, is precisely why you were summoned here." </nc-section>
created_at
"2026-05-31T11:44:03.111Z"
5/31/2026, 11:44:03 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2203
moderation_latency
(null)
generation_time
11715
tokens_prompt
318
tokens_completion
502
native_tokens_prompt
435
native_tokens_completion
672
native_tokens_completion_images
(null)
native_tokens_reasoning
40
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.018975
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01LPLTLjRWFEkHcZKqZej523"
is_byok
false
latency
1008
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227843-BGnsWUpRpSakkQtW7GCY"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227843-0VHPgrbsSA9v66LOpxMD"
upstream_id
"msg_01LPLTLjRWFEkHcZKqZej523"
total_cost
0.018975
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval