NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYXMY5BNTK0JXAP2K222EK2
Run Details
Model:
Gemini 3 Flash (Preview)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
6.3s
Cost:
$0.001463
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
Fail
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="The Stern Welcome" data-color="red"> As I dismounted, the heavy scent of hay and horsehide filled the air. A man clad in worn leather and polished steel stepped from the shadows of the stable eaves. Sir Roland didn't smile; his face was a map of scars earned in the dragon wars. "You're late, boy," he barked, his voice like grinding stones. "Thunder looks winded, and a squire who exhausts his mount before the sun sets is a squire who walks to his death. Strip the gear and rub him down. Your training begins when the horse is tended, and not a moment sooner." </nc-section> <nc-section data-title="A Mentor's Grin" data-color="green"> I was still patting Thunder’s neck when a boisterous laugh echoed off the stone walls. Sir Roland leaned against a stable post, tossing an apple in the air before catching it with a practiced snap. "So, the pride of the valley has finally arrived!" the knight shouted, stepping forward to clasp my shoulder with a gauntleted hand. "You have your father’s eyes, Frederick, but let's hope you have more sense in a saddle. Come, leave the beast to the stablehands for tonight. We have much to discuss before the dragons wake." </nc-section> <nc-section data-title="The Silent Observer" data-color="blue"> The rhythm of the castle was a dull roar of activity, but the stables remained a pocket of cool quiet. Sir Roland was already there, meticulously checking the shoes of a massive warhorse. He didn't look up as I approached, yet he acknowledged my presence with a sharp nod. "The road was clear, I trust?" he asked quietly, finally turning to reveal a gaze that seemed to weigh my very soul. "The Last Dragon is restless, Frederick. The time for childhood is over. Bind your horse and follow me; the King is waiting for news from the border." </nc-section
created_at
"2026-02-08T15:24:20.566391+00:00"
2/8/2026, 3:24:20 PM
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
747
moderation_latency
(null)
generation_time
4264
tokens_prompt
282
tokens_completion
415
native_tokens_prompt
304
native_tokens_completion
437
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.001463
router
(null)
provider_responses
0
status
200
is_byok
false
latency
747
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770564255-t0lpT9Ld69eW6DRuT8SP"
upstream_id
(null)
total_cost
0.001463
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
Fail
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval