NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-12-01KKHZCTBJ8KMFJ79QWFTD52F2
Run Details
Model:
Grok 4.20 (Beta)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
2.2s
Cost:
$0.002527
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Frederick rode through the castle gates and dismounted near the ancient stables, a tall, broad-shouldered figure emerged from the shadows. Sir Roland, the local knight renowned for his valor in the Border Wars, stepped forward with a firm stride. His armor gleamed under the fading sunlight, and a kindly yet stern smile crossed his weathered face. "You must be the new squire," he said, his voice deep and resonant. "Welcome to Castle Eldrath, lad. You'll find no softer training here than the anvil of true duty." </nc-section> <nc-section data-title="Version 2" data-color="green"> Frederick guided Thunder toward the stables, the horse's hooves clattering against the cobblestones. Before he could properly tether the beast, a commanding voice called out from behind a nearby stall. Sir Roland appeared, wiping his hands on a worn leather cloth, his chainmail partially covered by a simple tunic. The knight's eyes sparkled with the fire of experience as he regarded the young arrival. "Frederick, is it? I've been expecting you. Come, let's see to your mount first. A knight is only as strong as the bond he shares with his steed." </nc-section> <nc-section data-title="Version 3" data-color="blue"> The evening mist began to rise as Frederick reached the stables and swung down from his saddle. From the arched doorway stepped Sir Roland, the castle's master-at-arms, his presence as imposing as the legends that preceded him. The knight's cloak billowed slightly in the breeze, and he placed a gauntleted hand on Frederick's shoulder. "Easy now, young squire," he said with a gruff chuckle. "The road has been long, but your true journey begins here. Tell me, do you come seeking glory, or have the old tales of dragons finally stirred your blood?" </nc-section>
created_at
"2026-03-12T21:31:42.841Z"
3/12/2026, 9:31:42 PM
model
"x-ai/grok-4.20-beta-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
256
moderation_latency
(null)
generation_time
1899
tokens_prompt
318
tokens_completion
465
native_tokens_prompt
409
native_tokens_completion
400
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
384
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0025268
router
(null)
provider_responses
0
endpoint_id
"33f49f24-a0de-4dcf-872d-ca514e83fbae"
id
"70cb6241-a71b-9d7e-0d52-6164e1ebcb8e"
is_byok
false
latency
246
model_permaslug
"x-ai/grok-4.20-beta-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773351102-5wI64BV3DRYEntDEdv9N"
upstream_id
"70cb6241-a71b-9d7e-0d52-6164e1ebcb8e"
total_cost
0.0025268
cache_discount
0.0006912
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval