NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-19-01KS0TQBT9D326MENX22DKBVEV
Run Details
Model:
Gemini 3.5 Flash (Reasoning)
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
9.1s
Cost:
$0.016089
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: A Grim Welcome" data-color="red"> As Thunder’s hooves clattered onto the cobbled courtyard near the stables, a towering figure stepped out from the shadows of the wooden overhang. Sir Roland was clad in scarred, unpolished plate armor, looking more like a battered iron statue than a man. He didn't smile. As I dismounted, my legs shaking slightly from the long, grueling ride, he stepped into my path. "You're late, boy," Roland grunted, his voice like stones grinding together in a mill. He snatched Thunder’s reins from my hands with a gauntleted fist, his eyes scanning me from head to toe with blatant disappointment. "If you survived the road, you might survive your first week. Rub down your beast, carry your own gear, and then report directly to the armory. Your training begins tonight, squire. There is no room for softness in these walls." </nc-section> <nc-section data-title="Version 2: The Warmth of the Hearth" data-color="green"> I pulled Thunder to a halt by the fragrant, hay-scented warmth of the stables. A man with graying hair and a kind, heavily weathered face was brushing down a sleek chestnut mare nearby. At the sound of our approach, he looked up, tossed a grooming rag over his broad shoulder, and smiled warmly. "Ah, young Frederick! We've been keeping an eye on the road for you," Sir Roland said, stepping forward with an outstretched hand clad in a simple, well-worn leather glove. He gave Thunder’s neck an appreciative, heavy pat. "A fine, sturdy beast you have here. Come, let’s get him fed and watered together. You look like you've ridden half the kingdom today, and a weary squire is no use to anyone. Get some food in your belly and rest tonight, for tomorrow we begin the real work." </nc-section> <nc-section data-title="Version 3: An Eye for Potential" data-color="blue"> The long shadow of the great stone stables swallowed the last rays of twilight as I dismounted. Standing by the stone archway was a man of sharp features and immaculate, rigid posture. His navy-blue cloak, fastened with a silver brooch shaped like a dragon's wing, rustled in the cool evening breeze. Sir Roland watched my descent with calculating, ice-blue eyes, taking note of how I held the reins and the alignment of my spine. "A bit unsteady on the left stirrup, Squire Frederick," he remarked, his voice smooth, precise, and devoid of malice. He bowed his head just a fraction, a gesture of formal, traditional knightly protocol. "But you have the frame of a true rider, and your mount is well-cared for. That bodes well. The path before you is fraught with ancient secrets and grueling discipline. Welcome to the High Keep. Let us see if your spirit matches the reputation of your house." </nc-section>
created_at
"2026-05-19T19:17:11.118Z"
5/19/2026, 7:17:11 PM
model
"google/gemini-3.5-flash-20260519"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1725
moderation_latency
(null)
generation_time
9028
tokens_prompt
318
tokens_completion
1164
native_tokens_prompt
304
native_tokens_completion
1737
native_tokens_completion_images
0
native_tokens_reasoning
1085
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
service_tier
"standard"
usage
0.016089
router
(null)
provider_responses
0
endpoint_id
"bd50061c-e1d8-469e-96b1-d322e77378f8"
id
"N7cMarrZEr3Vz7IPjY_kiQo"
is_byok
false
latency
1724
model_permaslug
"google/gemini-3.5-flash-20260519"
provider_name
"Google AI Studio"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779218231-Z5wanPXzxCcmGppflXo0"
session_id
(null)
api_type
"completions"
id
"gen-1779218231-CAZrAWChq1m4mxrfbwfk"
upstream_id
"N7cMarrZEr3Vz7IPjY_kiQo"
total_cost
0.016089
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval