NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHQES2NEDVNNF2VVCVK0GBM
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
30.9s
Cost:
$0.007037
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1" data-color="red"> Frederick guided Thunder through the towering castle gates and toward the stables, the horse's hooves clacking against worn cobblestones. The scent of hay and leather filled the air as he dismounted. From the shadows of the stable entrance stepped a tall, armored figure. "Well met, young Frederick," the knight called out warmly. "I am Sir Roland, keeper of these stables and trainer of squires. Your arrival has been anticipated. Let us see to your mount before we speak of the trials that await you within these walls." </nc-section> <nc-section data-title="Version 2" data-color="green"> As the banners fluttered above and the sun dipped below the battlements, Frederick steered Thunder toward the dimly lit stables. The stallion snorted, sensing the end of their long journey. Before he could loosen the saddle, a sturdy knight with a weather-beaten face emerged, torch in hand. "Hold, squire," the man said, his voice steady as ancient oak. "I am Sir Roland, the local knight charged with greeting all newcomers. You've ridden hard—your eyes show it. Come, we'll tend to Thunder together and speak of the dragon's shadow that looms over your training." </nc-section> <nc-section data-title="Version 3" data-color="blue"> Frederick slowed his majestic stallion to a walk as they passed beneath the gatehouse, the fading sunset painting the stables in hues of amber and rose. He patted Thunder's neck, whispering words of praise for the faithful steed. Near the water trough, a broad-shouldered knight in polished plate turned to face him. "Hail, Frederick the squire," boomed Sir Roland, the local knight whose reputation for valor echoed across the realm. "Your timing is fortunate; the castle stirs with portents of the Last Dragon. Stable your horse and join me—I would assess your mettle before the moon rises." </nc-section>
created_at
"2026-05-01T12:15:45.755Z"
5/1/2026, 12:15:45 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
25418
moderation_latency
(null)
generation_time
30648
tokens_prompt
318
tokens_completion
1088
native_tokens_prompt
411
native_tokens_completion
2636
native_tokens_completion_images
(null)
native_tokens_reasoning
2230
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00703655
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"6fe4582f-b0f5-9673-9012-bac38629c60b"
is_byok
false
latency
397
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777637745-jug44xhmOz4sUiNjAdoD"
session_id
(null)
api_type
"completions"
id
"gen-1777637745-8M8dPGNp4wwouwHHGyNV"
upstream_id
"6fe4582f-b0f5-9673-9012-bac38629c60b"
total_cost
0.00703655
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval