NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8XGK327EEK3DVNA9FGM0MY
Run Details
Model:
Gemma 4 31B
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
32.2s
Cost:
$0.000233
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Stern Welcome" data-color="red"> As I brought Thunder to a halt near the stables, a man with a face like etched granite stepped forward. He wore a suit of scarred plate armor that had seen a dozen campaigns, and his gaze was piercing. This was Sir Roland. He didn't smile; instead, he looked me up and down with a critical eye, as if searching for a reason to send me packing. "You're late, boy," he barked, his voice sounding like grinding stones. "In this castle, punctuality is the difference between a victory and a funeral. Get your gear off that beast before I decide you're unfit for the stables, let alone the sword." </nc-section> <nc-section data-title="Version 2: The Jovial Mentor" data-color="green"> I had barely dismounted when a booming laugh echoed through the courtyard. A broad-shouldered knight with a magnificent, curling beard and a bright crimson cloak swept toward me. Sir Roland beamed, clapping a heavy hand onto my shoulder that nearly knocked the wind out of me. "Ha! Look at the fire in this one's eyes!" he roared, his voice full of genuine warmth. "Welcome to the fold, Frederick! You've arrived just in time for the evening feast. Come, let us get your horse settled, and I shall tell you the tales of the Great Wyrm while we drink some ale!" </nc-section> <nc-section data-title="Version 3: The Weary Veteran" data-color="blue"> A lean man leaned against the stable fence, polishing a gauntlet with a piece of oiled leather. He looked tired, with deep lines around his eyes and a silvering beard, but there was a quiet dignity to his posture. He looked up as I approached, offering a small, knowing nod. "Easy there, Thunder," he murmured to the horse before looking at me. "I am Sir Roland. You must be the new squire." He sighed, a sound of long-term exhaustion, yet his eyes held a flicker of kindness. "It's a hard life we lead, Frederick, and the days are long. But if you have the heart for it, I'll make sure you survive the training." </nc-section>
created_at
"2026-05-10T12:24:15.303Z"
5/10/2026, 12:24:15 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
180
moderation_latency
(null)
generation_time
19771
tokens_prompt
318
tokens_completion
515
native_tokens_prompt
329
native_tokens_completion
500
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00023277
router
(null)
provider_responses
0
endpoint_id
"29b94af5-d8f8-419a-9cf1-ae1f2831142b"
is_byok
false
latency
98
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Venice"
status
429
1
endpoint_id
"69279254-6a96-4e9d-abc6-ea76e6bf8e97"
is_byok
false
latency
385
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Novita"
status
429
2
endpoint_id
"576fca5b-f09c-4bda-b560-9989c39ac3ab"
is_byok
false
latency
5112
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Together"
status
429
3
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-REGbT9pdO3eG8pQ3TQB1InaI"
is_byok
false
latency
66
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415855-Xqjh7FbzfxEGSJ4yhGZj"
session_id
(null)
api_type
"completions"
id
"gen-1778415855-kVZedLAF944ecJiO1Hdd"
upstream_id
"chatcmpl-REGbT9pdO3eG8pQ3TQB1InaI"
total_cost
0.00023277
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval