NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHR5V5GM7TR444RRE1NSWM3
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
34.0s
Cost:
$0.009679
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Knight's Welcome" data-color="red"> I dismounted from Thunder near the stables, my legs stiff from the long ride. A formidable knight approached me, his armor clinking with each step. Sir Roland, the local knight, greeted me with a stern expression. "Frederick, I presume," he said, eyeing me up and down. "Your reputation precedes you, but reputation means nothing here. Prove yourself in training, or return home in shame." The weight of his words settled on my shoulders like the castle's ancient stones, and I wondered if I had what it took to survive the trials ahead. </nc-section> <nc-section data-title="Version 2: The Knight's Welcome" data-color="green"> I dismounted from Thunder near the stables, my legs stiff from the long ride. The welcoming smile of a seasoned warrior greeted me as he strode forward from the shadows. Sir Roland, the local knight, clapped me on the back heartily. "Ah, young Frederick! We've been anticipating your arrival. Your horse is magnificent – let's see to him first, then get you some food and rest. Tomorrow, your true journey begins." His easy laugh eased the knots in my stomach, reminding me that even in this vast fortress, there might be allies and warmth to be found. </nc-section> <nc-section data-title="Version 3: The Knight's Welcome" data-color="blue"> I dismounted from Thunder near the stables, my legs stiff from the long ride. An enigmatic figure waited in the dim light of the lanterns, his eyes reflecting secrets older than the walls themselves. Sir Roland, the local knight, greeted me with an intense gaze that seemed to pierce my soul. "Frederick," he murmured, glancing toward the darkening horizon where legends whispered of fire and scale. "The last dragon's shadow stirs once more. What you learn here will determine more than your knighthood – it may decide the fate of us all." </nc-section>
created_at
"2026-05-01T12:28:21.557Z"
5/1/2026, 12:28:21 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
30961
moderation_latency
(null)
generation_time
33922
tokens_prompt
318
tokens_completion
1448
native_tokens_prompt
411
native_tokens_completion
3693
native_tokens_completion_images
(null)
native_tokens_reasoning
3281
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00967905
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"8c3283d0-e3b1-9a20-8495-db9af7fdb623"
is_byok
false
latency
92
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777638501-jKJzGmFkdIC1rApi8zCN"
session_id
(null)
api_type
"completions"
id
"gen-1777638501-fsq3VhDtijcM50j8RZu5"
upstream_id
"8c3283d0-e3b1-9a20-8495-db9af7fdb623"
total_cost
0.00967905
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval