NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHZD4WDXHTX7QZD4GS3GR9Z
Run Details
Model:
Grok 4.20 (Beta)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
2.1s
Cost:
$0.002461
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Frederick led Thunder toward the stables, a tall figure emerged from the shadows of the stone archway. Sir Roland, the local knight whose reputation for unyielding discipline preceded him, stood with his arms crossed over his gleaming breastplate. "Hail, young squire," he called out, his voice deep and resonant like distant thunder. "You must be Frederick. The stable master told me of your arrival. Come, let me see to your horse while we speak of what lies ahead in your training." </nc-section> <nc-section data-title="Version 2" data-color="green"> The stables loomed ahead, their wooden beams weathered by countless seasons. Frederick had barely dismounted when he heard heavy footsteps approaching. Sir Roland appeared, his green cloak billowing slightly in the evening breeze, the emblem of the kingdom proudly displayed on his tabard. With a firm but welcoming grip on Frederick's shoulder, the knight smiled. "Well met, lad. I am Sir Roland. The roads have been kind to you, I trust? Leave your steed here and walk with me. There's much to discuss before the first light of your training dawns tomorrow." </nc-section> <nc-section data-title="Version 3" data-color="blue"> Dusk had settled over the castle grounds as Frederick guided his weary mount into the stable yard. From the doorway of the tack room stepped Sir Roland, his blue eyes sharp and assessing beneath a mane of silver-streaked hair. The knight's armor caught the last rays of sunlight, reflecting them like liquid sapphire. "Frederick, is it?" he boomed, extending a gauntleted hand in greeting. "I've been expecting you. The king himself mentioned your coming. Tend to your horse quickly, squire, and then we'll talk of the path that lies before you." </nc-section>
created_at
"2026-03-12T21:31:53.618Z"
3/12/2026, 9:31:53 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
280
moderation_latency
(null)
generation_time
1977
tokens_prompt
318
tokens_completion
458
native_tokens_prompt
409
native_tokens_completion
389
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
384
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0024608
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"f5e5282a-8b2a-a35b-c352-e5466e394b7d"
is_byok
false
latency
242
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773351113-If7r9Y1YzyoxxfoGL1NV"
upstream_id
"f5e5282a-8b2a-a35b-c352-e5466e394b7d"
total_cost
0.0024608
cache_discount
0.0006912
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval