NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHSNQ6TR9P7VSMHRRGYME42
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
36.4s
Cost:
$0.007412
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> I guided Thunder through the castle gates and veered toward the long row of wooden stables just as the last rays of sunset bled across the sky. The smell of hay and oiled leather drifted on the evening breeze. Before I could swing down from the saddle, a tall figure stepped from the shadows between the stalls, armor catching the fading light like polished bronze. The local knight, Sir Roland, greeted me near the stables with a broad grin that split his weathered face. "So this is the eager squire everyone's been whispering about," he rumbled, offering a gauntleted hand. "Frederick, isn't it? Come, lad. We'll see to your horse first. A knight's mount is his second soul, after all." </nc-section> <nc-section data-title="Version 2" data-color="green"> The gates groaned shut behind me as I rode the final stretch to the stables, Thunder's hooves clopping softly against packed earth. Lanterns had already been lit along the eaves, painting the twilight in strokes of gold. A knight in a deep-green cloak stood beside a water trough, brushing down a dappled warhorse of his own. He looked up at my approach, eyes sharp beneath a scarred brow. The local knight, Sir Roland, greeted me near the stables with a short, appraising nod. "You're later than the ravens promised," he said, voice low and steady as tempered steel. "But you're here. That's the first test passed. Hand me those reins, Frederick. No squire of mine begins his training with a poorly tended mount." </nc-section> <nc-section data-title="Version 3" data-color="blue"> Dusk had fully claimed the courtyard by the time I reached the stables, the banners I had seen from the fields now hanging still and dark above me. I slid from Thunder's back, boots sinking into fresh straw, when a quiet cough drew my gaze. A knight stood in the wide doorway, his blue surcoat embroidered with a coiled silver dragon that seemed almost to shimmer in the lantern light. The local knight, Sir Roland, greeted me near the stables, studying me with an intensity that made the hairs on my neck rise. "Frederick," he said, speaking my name as though it carried weight beyond its letters. "The wind told me a squire was coming. It did not say the last dragon would be watching through his eyes. Come inside, lad. We have much to discuss before your training even begins." </nc-section>
created_at
"2026-05-01T12:54:30.367Z"
5/1/2026, 12:54:30 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
30091
moderation_latency
(null)
generation_time
36287
tokens_prompt
318
tokens_completion
1200
native_tokens_prompt
411
native_tokens_completion
2786
native_tokens_completion_images
(null)
native_tokens_reasoning
2262
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00741155
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"061eda66-1ac6-92b3-aae8-b07a801bd3bf"
is_byok
false
latency
96
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777640070-XBvt6QeVoK3G6GadfeZQ"
session_id
(null)
api_type
"completions"
id
"gen-1777640070-IRxkQco0sFkhO6OZ4uVL"
upstream_id
"061eda66-1ac6-92b3-aae8-b07a801bd3bf"
total_cost
0.00741155
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval