NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHYYZ6T9KEW84RA9K3EQABK
Run Details
Model:
Grok 4.20 (Beta, Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
13.7s
Cost:
$0.013469
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Knight's Welcome" data-color="red"> As Frederick guided Thunder through the castle gates and toward the stables, the sun dipped below the western towers, painting the stones in hues of amber and crimson. The local knight, Sir Roland, greets Frederick near the stables with a gauntleted hand raised in salute. "Well met, young squire," the knight declared, his voice carrying the weight of countless battles. "I am Sir Roland, keeper of these halls. Your arrival has been anticipated—there are legends stirring in the mountains, and a squire with fire in his blood may yet prove useful against the shadow of the Last Dragon." </nc-section> <nc-section data-title="Version 2: Shadows in the Stables" data-color="green"> Thunder's hooves clattered against the cobblestones as Frederick reined him in beside the weathered stable doors, the banners overhead snapping like warnings in the evening breeze. The local knight, Sir Roland, greets Frederick near the stables, stepping forward from the hay-scented gloom with a lantern in hand. His armor bore the scars of dragonfire from wars long past, and his eyes gleamed with a mixture of approval and caution. "Steady now, lad," he said, inspecting both rider and steed. "The path you've chosen winds closer to ancient secrets than you know. Come, we'll speak of your training where the walls can't hear." </nc-section> <nc-section data-title="Version 3: The First Test" data-color="blue"> The castle looms larger with every stride of his stallion, until Frederick finally slows Thunder to a trot and turns toward the stables nestled against the curtain wall. The local knight, Sir Roland, greets Frederick near the stables just as the last rays of sunlight vanish. Clad in a cloak edged with silver runes, Sir Roland clasps Frederick's shoulder with surprising strength. "You've ridden well, squire, but the real trial begins here," he rumbled, a faint smile breaking through his stern beard. "The Last Dragon's breath still warms these lands in hidden ways. Prove yourself worthy, and I shall forge you into something greater than you dream." </nc-section>
created_at
"2026-03-12T21:24:09.055Z"
3/12/2026, 9:24:09 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
10744
moderation_latency
(null)
generation_time
13578
tokens_prompt
318
tokens_completion
544
native_tokens_prompt
411
native_tokens_completion
2223
native_tokens_completion_images
0
native_tokens_reasoning
1772
native_tokens_cached
384
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0134688
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"12341bcb-efed-df37-2d1b-72cfeec85428"
is_byok
false
latency
271
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773350649-Sg2dTSbnwfvUCL4Kif6K"
upstream_id
"12341bcb-efed-df37-2d1b-72cfeec85428"
total_cost
0.0134688
cache_discount
0.0006912
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval