NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHR2FQ6X1HY830XBGAWHWDS
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
24.1s
Cost:
$0.005877
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Knight's Greeting" data-color="red"> As I slowed Thunder to a steady trot and guided him into the torchlit stable yard, the last rays of sunset painted the stone walls blood-red. A tall, armored figure emerged from the shadows between the stalls, his crimson cloak stirring in the evening breeze. "Hail, squire," the man called, his voice resonant and unyielding. "I am Sir Roland, knight of this hold. You ride with fire in your blood, Frederick—Thunder is a beast worthy of legend. Come, dismount. We'll see your horse properly cared for before the real work of your training begins." </nc-section> <nc-section data-title="Version 2: The Knight's Greeting" data-color="green"> The stable lanterns flickered to life as I dismounted and led Thunder through the wide oak doors, the scent of fresh straw and oiled leather welcoming me. A broad knight in forest-green tabard stepped forward, his weathered face splitting into a genuine smile beneath a neatly trimmed beard. "Well met, Frederick!" Sir Roland boomed warmly, clapping a heavy hand on my shoulder. "Word of your arrival reached me before your horse's hooves even touched the courtyard. That stallion of yours carries the spirit of the old bloodlines. Let's get him rubbed down and fed—then you and I shall speak of the path that lies ahead for a squire with such promise." </nc-section> <nc-section data-title="Version 3: The Knight's Greeting" data-color="blue"> Twilight had fully claimed the castle by the time I guided Thunder into the shadowed stables, the banners above snapping like distant thunder. A knight in deep sapphire armor waited near the water trough, his piercing gaze seeming to cut through the gloom and measure my very soul. "Frederick," he said quietly, the name rolling from his tongue like a prophecy. "I am Sir Roland, sworn guardian of these walls. Few arrive at dusk as you have, with the wind of distant hills still clinging to their cloak. The last dragon's shadow stirs in old tales once more—perhaps your coming is no coincidence. Tend to your mount, then join me by the forge. Your training begins tonight." </nc-section>
created_at
"2026-05-01T12:26:31.532Z"
5/1/2026, 12:26:31 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
18189
moderation_latency
(null)
generation_time
23972
tokens_prompt
318
tokens_completion
1033
native_tokens_prompt
411
native_tokens_completion
2172
native_tokens_completion_images
(null)
native_tokens_reasoning
1708
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00587655
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"477be3ea-e84c-977d-8331-41c89aa234eb"
is_byok
false
latency
105
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777638391-krLgeqZ5fvwbWqNZ65ul"
session_id
(null)
api_type
"completions"
id
"gen-1777638391-v77OK4w7Kc0BRjSwe69x"
upstream_id
"477be3ea-e84c-977d-8331-41c89aa234eb"
total_cost
0.00587655
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval