NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR90RPF05VYB5EK5P8YJ3F79
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
6.2s
Cost:
$0.000248
Provider:
SiliconFlow
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the blueprint, which was actually just a hand-drawn map on a grease-stained napkin. Bill, you are looking at this all wrong. It is not a fortress. It is a bounce house. It is a tactical nightmare, Molly! Bill hissed, adjusting his black balaclava, which was slightly too small for his head. The perimeter is crawling with toddlers. They have no sense of fear and zero respect for personal space. They are four-year-olds, Bill. Not the Mossad. They are unpredictable! Bill gestured wildly at the napkin. Did you see the guest list? Timmy is having a dinosaur theme. Do you know what happens when a child in a triceratops costume sees a man in black spandex? Total anarchy. Focus. We enter through the patio sliding door. We bypass the juice box station, head straight for the gift pile near the snack table, and extract the loot before the cake cutting ceremony begins. The cake cutting? Bill gasped. That is the high-risk window! That is when the sugar rushes hit. We will be wading through a sea of hyperactive minions. Nobody is calling them minions, Bill. It is a superhero theme. Exactly! Chaos! Bill wiped sweat from his forehead. And what about the distraction? You said you had the distraction. I have a bag of high-grade, organic, gluten-free dinosaur nuggets. Bill stared at her. Nuggets? You want to heist a residential playground with nuggets? They are the ultimate distraction, Bill. One tray of these and every child in a five-mile radius will be sprinting toward the buffet, leaving the presents unguarded. It is foolproof. It is poultry-based espionage, Molly. It is beneath us. It is efficient. Now, what about the exit strategy? We grab the loot, we slip through the rose bushes, and we meet at the getaway van parked behind the cul-de-sac. The getaway van? You mean your Honda Civic? It is a discreet, neutral-colored vehicle! Bill snapped. It has a bumper sticker that says I Love My Golden Retriever, Molly reminded him. It adds a layer of civilian camouflage! Bill argued. Fine. But if we get cornered by a kid in a Spiderman suit, I am leaving you behind. They are not that intimidating, Bill. Tell that to the one with the juice box, Molly said, grabbing her tactical bag. Let’s go. The birthday boy is turning five in twenty minutes. We are professionals, Molly. We are shadows in the night. We are ghosts. You are a man in a tight mask holding a bag of nuggets, Bill. I am a specialist, Bill corrected, puffing out his chest. Now, let us move out. To the bouncy castle!
created_at
"2026-05-10T13:20:59.878Z"
5/10/2026, 1:20:59 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2314
moderation_latency
(null)
generation_time
6000
tokens_prompt
77
tokens_completion
669
native_tokens_prompt
79
native_tokens_completion
596
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00024788
router
(null)
provider_responses
0
endpoint_id
"eb382d64-dce1-42d5-b23d-6fefb91948a3"
id
"chatcmpl-8f83a96f6d8fa53a"
is_byok
false
latency
2314
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"SiliconFlow"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778419259-vwC9CCJ9LFLNUc5R74th"
session_id
(null)
api_type
"completions"
id
"gen-1778419259-v1bxKZffOKND8Z3YSu1X"
upstream_id
"chatcmpl-8f83a96f6d8fa53a"
total_cost
0.00024788
cache_discount
(null)
upstream_inference_cost
0
provider_name
"SiliconFlow"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
440
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededClean
false
wordsTotal
452
wordsDialogue
0
0.0000%