NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHRH2WTRPYDM5N2YB1GA2EA
Run Details
Model:
Grok 4.20
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
5.1s
Cost:
$0.001387
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> As Frederick dismounted Thunder near the castle stables, a burly figure emerged from the shadows of the wooden beams. Sir Roland, the local knight whose reputation for unyielding discipline preceded him, strode forward with a clank of armor. "Hail, young squire," he boomed, his voice echoing off the stone walls. "I am Sir Roland. You've arrived just in time—the training grounds await, and they brook no delay from green lads like you." </nc-section> <nc-section data-title="Version 2" data-color="green"> Thunder's hooves crunched on the gravel as Frederick guided him toward the stables. Before he could fully tether the stallion, a knight with a stern gaze and a scar across his cheek approached from the paddock. It was Sir Roland, the castle's grizzled master-at-arms. "Well met, Frederick," he said, clasping the squire's shoulder with a callused hand. "I've been expecting you. Leave the horse to the grooms; your first lesson in swordplay begins at dawn, and the blade cares not for weary travelers." </nc-section> <nc-section data-title="Version 3" data-color="blue"> Near the dimly lit stables, where the scent of hay and leather hung thick in the evening air, Frederick led his faithful steed to a hitching post. From the adjacent armory stepped Sir Roland, the renowned local knight, his polished plate armor gleaming under lantern light. With a nod of approval, he greeted the newcomer: "Frederick, squire of the realm—your arrival honors us. I am Sir Roland. Stable your mount swiftly, for the path of knighthood is paved with sweat and steel, and I shall be your guide through its trials." </nc-section>
created_at
"2026-05-01T12:34:29.922Z"
5/1/2026, 12:34:29 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
339
moderation_latency
(null)
generation_time
5012
tokens_prompt
318
tokens_completion
428
native_tokens_prompt
409
native_tokens_completion
377
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00138655
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"e7138703-0954-5c07-dec0-c10092e4bcf0"
is_byok
false
latency
68
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777638869-U0QsJ90tOkevfF5E4li0"
session_id
(null)
api_type
"completions"
id
"gen-1777638869-rWA2aH1fZFEIcyah3Fh5"
upstream_id
"e7138703-0954-5c07-dec0-c10092e4bcf0"
total_cost
0.00138655
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval