NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9VSQF460H2FAKQRR2R0VT
Run Details
Model:
Grok 4 Fast
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
3.6s
Cost:
$0.000363
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: A Warm Welcome" data-color="red"> As I reined in Thunder near the stables, a tall figure emerged from the shadows of the hayloft. Sir Roland, the local knight renowned for his valor in the border skirmishes, strode forward with a broad smile. His armor gleamed faintly in the twilight, and his voice boomed like a war drum. "Young Frederick! You've made good time from the villages. Welcome to Castle Ebonspire. Dismount, lad, and let us see to your steed before we discuss your squire's oath." </nc-section> <nc-section data-title="Version 2: An Ominous Greeting" data-color="green"> Thunder's hooves clattered to a halt by the stable doors, steam rising from his flanks. From the dim interior stepped Sir Roland, his face etched with the scars of old battles, eyes sharp as a hawk's. He regarded me with a nod that was more appraisal than warmth. "Frederick, is it? The master-at-arms sent word of your coming. Hand the reins to the stable boy and follow me inside—the shadows lengthen, and there's much to prepare before the morrow's trials." </nc-section> <nc-section data-title="Version 3: A Humorous Encounter" data-color="blue"> I guided Thunder to the stables just as the sun dipped below the horizon, only to nearly tumble from the saddle when a booming laugh echoed from the stalls. Sir Roland, the knight with a reputation for tales taller than the castle towers, clapped me on the back with a force that rattled my teeth. "By the gods, Frederick, you ride like a storm cloud! Come, stable your beast before he eats my favorite mare's mane. Then tell me—did the village gossips warn you about the dragon's ghost haunting these halls?" </nc-section>
created_at
"2026-02-20T19:54:09.016Z"
2/20/2026, 7:54:09 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1592
moderation_latency
(null)
generation_time
3218
tokens_prompt
318
tokens_completion
431
native_tokens_prompt
440
native_tokens_completion
682
native_tokens_completion_images
0
native_tokens_reasoning
294
native_tokens_cached
439
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00036315
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"4e078608-d449-903b-ed28-25f093b70749"
is_byok
false
latency
77
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771617249-GOB74eFS5SylLdU76AeE"
upstream_id
"4e078608-d449-903b-ed28-25f093b70749"
total_cost
0.00036315
cache_discount
0.00006585
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval