NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHT2FVCFGV65S755V7J7733
Run Details
Model:
Grok 4.20
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
7.9s
Cost:
$0.001449
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Frederick dismounted near the stables, the unmistakable figure of Sir Roland emerged from the shadows of the timbered stalls. The grizzled knight, clad in a worn tunic emblazoned with the kingdom's crest, offered a firm nod of greeting. "Young squire Frederick, I presume," Sir Roland said, his voice like gravel under boots. "I've been expecting you. Tend to your horse quickly—the training yard waits for no one, and neither do the dragons that haunt our borders." </nc-section> <nc-section data-title="Version 2" data-color="green"> Thunder snorted and pawed at the straw-strewn ground as Frederick led him toward the stables. Before he could even loosen the saddle, a broad-shouldered man with a scarred face and a mane of silver hair stepped forward—Sir Roland, the local knight whose legends preceded him. "Hail, Frederick!" the knight boomed with a hearty clap on the squire's shoulder. "Welcome to Castle Eldrath. The stables are yours to use, but mark my words: true knighthood isn't forged in iron alone. Come, let's see what fire burns in you before the moon rises." </nc-section> <nc-section data-title="Version 3" data-color="blue"> The stables loomed ahead, their stone arches draped in ivy, when a commanding voice cut through the evening air. "Hold there, lad." Sir Roland strode out from behind a stack of hay bales, his armor glinting faintly in the fading light. The knight's eyes, sharp as a hawk's, appraised Frederick from head to toe. "You must be the new squire. I am Sir Roland, guardian of these lands. See to your mount, then join me by the forge. Whispers of the Last Dragon stir in the mountains, and every blade needs sharpening—including the mettle of boys like you." </nc-section>
created_at
"2026-05-01T13:01:28.818Z"
5/1/2026, 1:01:28 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
423
moderation_latency
(null)
generation_time
7805
tokens_prompt
318
tokens_completion
453
native_tokens_prompt
409
native_tokens_completion
402
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00144905
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"79bab134-9bad-9241-813a-98751f77051a"
is_byok
false
latency
86
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777640488-VKfkWogWyGTEb1IxnzSg"
session_id
(null)
api_type
"completions"
id
"gen-1777640488-JFaGqAJBuG9chr1aG6DU"
upstream_id
"79bab134-9bad-9241-813a-98751f77051a"
total_cost
0.00144905
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval