NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHZSTBFT4YN5ZBA4V9QBH2N
Run Details
Model:
Grok 4.20 (Beta)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
2.3s
Cost:
$0.002365
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Frederick guided Thunder toward the stables, a broad-shouldered man in gleaming plate armor stepped out from the shadows of the doorway. Sir Roland, the local knight whose name was known even in the smallest villages, regarded the young squire with a measured gaze. "You must be the new arrival," he said, his voice deep and resonant like distant thunder. "Dismount, lad. The stables are no place for hesitation. Your training begins the moment your boots touch this ground." </nc-section> <nc-section data-title="Version 2" data-color="green"> The moment Frederick slowed his horse near the weathered stable doors, a figure emerged carrying a saddle over one arm. Sir Roland's weathered face broke into a rare smile as he recognized the crest on the boy's cloak. "Ho there, young Frederick!" the knight called out, setting the saddle aside. "I was told to expect you before nightfall. Come, let's see to your mount first. A knight is only as reliable as the care he shows his steed." His tone was warm, almost fatherly, as he approached with a confident stride. </nc-section> <nc-section data-title="Version 3" data-color="blue"> Frederick had barely dismounted when heavy footsteps approached from behind the stable wall. Sir Roland appeared, his armor catching the last rays of the setting sun, making him look like a figure stepped out of legend. The knight's eyes narrowed slightly as he studied the dusty traveler. "Frederick of the eastern marches, I presume," he stated rather than asked, crossing his arms over his broad chest. "The castle has been expecting you. But tell me this before we proceed: why does a squire arrive alone, without escort or banner?" </nc-section>
created_at
"2026-03-12T21:38:48.82Z"
3/12/2026, 9:38:48 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
316
moderation_latency
(null)
generation_time
1906
tokens_prompt
318
tokens_completion
443
native_tokens_prompt
409
native_tokens_completion
373
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
384
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0023648
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"476f139f-ca93-ec3f-8c73-78438fd80ccf"
is_byok
false
latency
243
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773351528-kNORs2C6kXHfVuklFgl8"
upstream_id
"476f139f-ca93-ec3f-8c73-78438fd80ccf"
total_cost
0.0023648
cache_discount
0.0006912
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval