NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHZ90M0Z1XMZ8606EQ0K20R
Run Details
Model:
Grok 4.20 (Beta, Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
9.7s
Cost:
$0.013793
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Knight's Welcome" data-color="red"> As I guided Thunder through the castle gates and toward the stables, the last rays of sunset painted the stone buildings in hues of amber and rose. I dismounted, my legs stiff from the long ride, and began to loosen the stallion's saddle. Heavy footsteps approached from behind. A broad-shouldered knight in polished plate armor stepped into the lantern light, his crimson cloak billowing slightly. "You must be Frederick, the new squire," he said, his voice deep yet kind. "I am Sir Roland, keeper of the stables and trainer of fledgling warriors. You've chosen a fine mount—few arrive with such spirit in both horse and rider." </nc-section> <nc-section data-title="Version 2: Shadows of Expectation" data-color="green"> The castle gates creaked shut behind me as Thunder carried me the final stretch to the stables. The evening air carried the scent of hay and oiled leather. I slid from the saddle, patting the horse's sweat-dampened neck, when a figure emerged from the stable doorway holding a flickering torch. Sir Roland's face was weathered and serious, his green surcoat embroidered with the kingdom's dragon emblem. "Squire Frederick," he greeted me gruffly, though not unkindly. "You've kept good time. Many squires arrive late and already defeated. Come, let us see if that fire in your eyes matches the strength needed to serve the Last Dragon's legacy." </nc-section> <nc-section data-title="Version 3: The Knowing Glance" data-color="blue"> Thunder snorted and slowed as we reached the shadowed eaves of the stables, the banners above now dark against the twilight sky. My boots hit the packed earth, and I reached for the bridle when a calm voice called out from the gloom. Sir Roland strode forward, his blue cloak seeming to absorb the remaining light, a silver pin shaped like a coiled dragon at his throat. "Frederick," he said, studying me with eyes that seemed older than the castle stones themselves. "The local knight, at your service. The wind told me a squire of uncommon blood would arrive today. Your horse senses it too, I think. Few beasts carry themselves with such pride near these walls." </nc-section>
created_at
"2026-03-12T21:29:38.184Z"
3/12/2026, 9:29:38 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
7455
moderation_latency
(null)
generation_time
9570
tokens_prompt
318
tokens_completion
562
native_tokens_prompt
411
native_tokens_completion
2277
native_tokens_completion_images
0
native_tokens_reasoning
1800
native_tokens_cached
384
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0137928
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"8bbb5f43-2495-615c-80c9-105b8c4cd032"
is_byok
false
latency
270
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773350978-lcBHVzmJDvqJeCA4VdzV"
upstream_id
"8bbb5f43-2495-615c-80c9-105b8c4cd032"
total_cost
0.0137928
cache_discount
0.0006912
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval