NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8WFCWMG28QSNWZQ23DSNX
Run Details
Model:
Grok 4
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
18.5s
Cost:
$0.011394
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1 (red)" data-color="red"> As I dismounted Thunder near the stables, the earthy scent of hay and horse sweat filled the air. A towering figure approached, clad in gleaming armor etched with dragon motifs. "Welcome, young Frederick," boomed Sir Roland, the local knight, his voice like rolling thunder. His eyes, sharp as a falcon's, appraised me from beneath bushy brows. "I've heard tales of your lineage. Let us see if you live up to it." He extended a gauntleted hand, firm and unyielding, pulling me into the world of chivalry and peril that awaited within the castle walls. </nc-section> <nc-section data-title="Version 2 (green)" data-color="green"> Reining in Thunder at the edge of the stables, I felt the cool evening mist settle on my skin. From the shadows emerged Sir Roland, the esteemed local knight, his cloak embroidered with emerald leaves symbolizing the ancient forests that bordered the kingdom. "Ah, Frederick, the squire from the distant hills," he greeted with a warm, resonant chuckle, clapping me on the shoulder like an old friend. "The castle has been abuzz with your arrival. Come, stable your beast and join me for a tankard—training begins at dawn, but stories are best shared under the stars." </nc-section> <nc-section data-title="Version 3 (blue)" data-color="blue"> I guided Thunder into the stables' dim glow, where lanterns flickered like distant stars. Waiting there was Sir Roland, the vigilant local knight, his blue surcoat flowing like a river in the breeze. "Hail, Frederick," he said in a measured tone, his gaze steady and wise, reflecting years of battles won and lost. "The winds of fate have brought you here. Dismount and heed my words: the path of a squire is fraught with trials, but valor shall be your guide." With a nod, he gestured toward the castle's heart, where shadows whispered of dragons long forgotten. </nc-section>
created_at
"2026-02-20T19:37:02.629Z"
2/20/2026, 7:37:02 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
7724
moderation_latency
(null)
generation_time
18449
tokens_prompt
318
tokens_completion
486
native_tokens_prompt
969
native_tokens_completion
711
native_tokens_completion_images
0
native_tokens_reasoning
286
native_tokens_cached
968
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.011394
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"752d3beb-b585-16c1-e0ca-45b71ed5a72c"
is_byok
false
latency
186
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616222-W3vTyjsxkSy433UbeuxQ"
upstream_id
"752d3beb-b585-16c1-e0ca-45b71ed5a72c"
total_cost
0.011394
cache_discount
0.002178
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval