NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8W504RJ9EC01RPNBMEXDEZ
Run Details
Model:
Gemma 4 31B
Test:
Tool usage within Novelcrafter
Scenario:
Create alternate prose sections
Duration:
23.1s
Cost:
$0.000192
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
Pass
Match blue prose section
(details)
Pass
Match green prose section
(details)
Pass
Match red prose section
(details)
no eval
Overall
Model input
System
You are an expert fantasy novel author. You are working on a novel called "The Chronicles of the Last Dragon". The following tools are available to you: - <nc-section> Use this tool to create a new section of prose. Arguments: - "data-title": The title of the section. - "data-color": Any color of (red, green, blue, yellow, purple, orange, pink, brown). Example usage: <nc-section data-title="Chapter 1: The Arrival" data-color="blue"> Frederick, a young squire, arrives at the castle to begin his training. </nc-section>
User/Human
The story so far: <storySoFar> Frederick, a young squire, arrives at the castle to begin his training. </storySoFar>
AI
My horse, a majestic stallion named Thunder, galloped across the open field. The wind whipped through my hair as I rode towards the castle gates. The sun was setting, casting a warm glow over the stone walls. I could see the banners of the kingdom fluttering in the breeze.
User/Human
Continue the story with the following instructions: """The local knight, Sir Roland, greets Frederick near the stables.""" Create 3 alternate prose sections that continue the story. - Version 1 (red) - Version 2 (green) - Version 3 (blue)
Model response
Messages
Meta
AI
<nc-section data-title="Version 1: The Stern Welcome" data-color="red"> As Frederick brought Thunder to a halting stop, a towering figure stepped out from the shadows of the stables. Sir Roland, clad in weathered steel that spoke of a hundred battles, looked the boy up and down with a critical eye. "You're late, lad," the knight barked, his voice like grinding stone. "In this castle, punctuality is the difference between a victory and a burial. Get your gear unloaded; your training begins before the dew dries." </nc-section> <nc-section data-title="Version 2: The Warm Reception" data-color="green"> Near the stables, a man with a hearty laugh and a golden beard waited for Frederick's arrival. Sir Roland stepped forward, clapping a heavy hand on the squire's shoulder with genuine warmth. "Welcome to the fold, Frederick! I've heard great things about your spirit," he boomed, his eyes twinkling with kindness. "Come, let us see to your horse first. A knight is only as good as the beast that carries him, and Thunder looks like a champion." </nc-section> <nc-section data-title="Version 3: The Mysterious Encounter" data-color="blue"> The air grew still as Frederick dismounted, noticing a solitary figure leaning against the stable wall. Sir Roland did not speak at first, merely watching the boy with an intensity that felt as though he were reading Frederick's very soul. When he finally spoke, his voice was a low, melodic hum. "The stars foretold a new arrival," Roland murmured, a cryptic smile playing on his lips. "I wonder if you possess the spark required for what is to come. Follow me, and let us see what you are made of." </nc-section>
created_at
"2026-05-10T12:00:20.128Z"
5/10/2026, 12:00:20 PM
model
"google/gemma-4-31b-it-20260402"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
839
moderation_latency
(null)
generation_time
22998
tokens_prompt
318
tokens_completion
423
native_tokens_prompt
329
native_tokens_completion
393
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00019211
router
(null)
provider_responses
0
endpoint_id
"9d4644e9-a5e5-41f4-b810-d75f484d2c24"
is_byok
false
latency
24
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"Parasail"
status
429
1
endpoint_id
"4149ab03-3b69-45a0-b94c-6c253b8641a1"
id
"chatcmpl-RvCl2ZHM1pgcsvWP5U7R9KwY"
is_byok
false
latency
59
model_permaslug
"google/gemma-4-31b-it-20260402"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414420-oE6EsfkfosVg87Md48Oz"
session_id
(null)
api_type
"completions"
id
"gen-1778414420-Sj9dV7vP5mV1clmaAVF9"
upstream_id
"chatcmpl-RvCl2ZHM1pgcsvWP5U7R9KwY"
total_cost
0.00019211
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
Pass
Match blue prose section
/<nc-section data-title="[^"]*" data-color="blue">[^<]+<\/nc-section>/
n/a
Pass
Match green prose section
/<nc-section data-title="[^"]*" data-color="green">[^<]+<\/nc-section>/
n/a
Pass
Match red prose section
/<nc-section data-title="[^"]*" data-color="red">[^<]+<\/nc-section>/
n/a
no eval