NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8WXJM2RBBKKMWVZM854DQV
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
24.3s
Cost:
$0.000209
Provider:
DekaLLM
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0001%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly stared at the blueprints, which were actually just a colorful coloring book page of a castle. She tapped a crayon-smudged finger on the drawing. Target is the bouncy castle, Molly whispered. Bill squinted at the paper, nursing a lukewarm juice box. Are you sure? It looks like a dragon is eating a cupcake in that corner. That is the cake table, Bill. Keep up, Molly snapped. Once we infiltrate the sandbox, we move toward the pile of primary-colored wrapping paper. The pile of presents? Bill asked, his eyes widening. You mean the haul? The haul, Molly confirmed. We strike during the inevitable sugar crash. When the toddlers go limp and the parents are distracted by the chaos of spilled juice, we swoop in. How do we get past the perimeter? Bill asked. Is there a laser grid? It is a backyard, Bill. The perimeter is a picket fence and a very territorial golden retriever named Barnaby, Molly said. Do we need disguises? Bill reached for his tactical turtleneck. We are wearing clown noses, Molly said. It is the only way to blend in with a group of six-year-olds. A clown nose? Bill sighed. That is a bit cliché, don't you think? It is camouflage! Molly hissed. Nobody suspects the man in the oversized polka-dot bowtie. What about the getaway vehicle? Bill asked. I have rented a motorized tricycle, Molly replied. A tricycle? Bill groaned. We are professional thieves, Molly. We cannot flee a crime scene on a plastic three-wheeler. It has a turbo button, Bill! Molly insisted. Is it even street legal? It is a driveway, not a highway, Molly countered. Now, tell me you have the tools. Bill patted his pockets proudly. I have a plastic hammer and a very sturdy spatula. A spatula? Molly stared at him in disbelief. For what? For prying open the lid of the piñata, Bill explained. That is where the high-value targets are kept. The hard candies. The chocolate coins. Molly rubbed her temples. We are here for the electronics and the high-end building blocks, Bill. Not the candy. The candy is the tactical advantage, Bill argued. If we control the sugar, we control the room. You are a liability, Molly muttered. I am a visionary, Bill corrected. Do you have the blueprints for the gift pile organized by size? Molly asked, trying to regain control. I have them organized by color, Bill said. Blue presents are in sector one. Red is sector two. That is going to take us forever, Molly sighed. Not if we use the spatula to create a diversion, Bill suggested. Molly looked at the coloring book, then at the man in the polka-dot bowtie. Fine, she said. But if the dog bites me, I am leaving you with the tricycle. Deal, Bill grinned.
created_at
"2026-05-10T12:13:45.483Z"
5/10/2026, 12:13:45 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
996
moderation_latency
(null)
generation_time
24210
tokens_prompt
77
tokens_completion
717
native_tokens_prompt
68
native_tokens_completion
620
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00020868
router
(null)
provider_responses
0
endpoint_id
"d99bcbe6-bec3-4991-965e-45fea79fe63c"
id
"chatcmpl-99a1732294514695f5e608ce08693e7b"
is_byok
false
latency
996
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DekaLLM"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778415225-aEj7A9nMcoFkrFqvAcHA"
session_id
(null)
api_type
"completions"
id
"gen-1778415225-OixDhqvPKCPnQqFImXx0"
upstream_id
"chatcmpl-99a1732294514695f5e608ce08693e7b"
total_cost
0.00020868
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DekaLLM"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
466
0.00%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
475
wordsDialogue
0
0.0001%