NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZAE8B13DS1CGEKP3SWGPA
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 36s
Cost:
$0.000206
Provider:
DekaLLM
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the grease-stained table, pinning the corners down with two half-eaten jelly donuts. It wasn’t a bank vault or a jewelry exchange, but the layout of the Henderson backyard was equally treacherous. "Listen closely, Bill. This is high-stakes larceny. We are talking about a concentrated cluster of high-value consumer goods," Molly whispered, tapping a crayon-drawn circle near the patio. Bill squinted at the drawing, his brow furrowed in genuine confusion. "Molly, it’s a six-year-old’s birthday party. I’ve seen more security at a petting zoo." "That is exactly what they want you to think!" Molly hissed, pointing to a cluster of tiny footprints near the sandbox. "Look at the perimeter. The bouncy castle provides excellent visual cover, but the terrain is unstable. One wrong step and you’re bouncing your way straight into a nap time zone." "I'm not crawling through a ball pit for a handful of Lego sets and some off-brand action figures," Bill argued, crossing his arms. "Where’s the thrill? Where’s the adrenaline?" "The adrenaline is in the precision, you amateur!" Molly snapped. "We enter through the service gate during the cake cutting. That is the window of maximum distraction. While the sugar crash begins to settle the crowd, we strike." "And the getaway?" Bill asked, leaning in. "Are we using the van? Or are we actually going to try that ridiculous getaway tricycle idea you had?" Molly glared at him. "The tricycle was a tactical choice for navigating narrow garden paths. Besides, the van is too conspicuous near a cluster of minivans." "I just feel like we’re losing our dignity," Bill muttered, eyeing the list of targets. "Is 'Sparkle-Pony Deluxe' really worth a felony charge?" "It’s not about the pony, Bill. It’s about the principle of the heist," Molly replied, her eyes gleaming with a manic intensity. "We take the presents, we leave no trace, and we exit before the piñata is even cracked open. It’s a ghost operation." Bill sighed, rubbing his temples. "What if a kid follows us? I am not being chased through a suburban cul-de-sac by a toddler in a dinosaur costume. I have a reputation to uphold." "Then you stay low and move with purpose," Molly commanded. "If a child approaches, you deploy the distraction. Do you have the juice boxes?" "I have the organic apple juice boxes, just like you requested," Bill said, patting his tactical vest. "Good. Now, let's go over the extraction route one more time. If we hit the hedge at a forty-five-degree angle, we can bypass the clown entirely." "The clown is the real threat, isn't he?" Bill asked, his voice dropping an octave. "He’s a wildcard, Bill. A total wildcard," Molly agreed solemnly.
created_at
"2026-05-10T12:55:44.148Z"
5/10/2026, 12:55:44 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4429
moderation_latency
(null)
generation_time
95552
tokens_prompt
77
tokens_completion
713
native_tokens_prompt
68
native_tokens_completion
613
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00020637
router
(null)
provider_responses
0
endpoint_id
"d99bcbe6-bec3-4991-965e-45fea79fe63c"
id
"chatcmpl-b6d3325d95b0aabf"
is_byok
false
latency
4429
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DekaLLM"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417744-kzTK82jc0F6SrBuLJTZH"
session_id
(null)
api_type
"completions"
id
"gen-1778417744-hrSQhrwWQfxISzs7aLWl"
upstream_id
"chatcmpl-b6d3325d95b0aabf"
total_cost
0.00020637
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DekaLLM"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
450
0.00%
Dialogue to Total Word Ratio
Ratio: 73.00%, Deviation: 43.00%
neededClean
false
wordsTotal
463
wordsDialogue
338
0.0000%