NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZF7788DFX7GZQGTXA7J5Q
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
33.8s
Cost:
$0.000217
Provider:
DekaLLM
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
3.73%
Dialogue to Total Word Ratio
(details)
1.8629%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the length of the cramped van, her combat boots squeaking against the linoleum. She slammed a colorful, glitter-covered map onto the folding table. "The target is the Miller residence, Bill. Saturday, two o'clock sharp. Theme is 'Galactic Explorers,'" Molly said, pointing at a drawing of a cartoon rocket. Bill squinted through his thick glasses, chewing on a stale pretzel. "Galactic Explorers? That sounds structurally unsound for a heist. Are there laser beams?" "No lasers. Just high-frequency toddler screams and aggressive balloon animals," Molly replied. "I don't like it," Bill muttered. "The tactical disadvantage of a bounce house is immense. It’s unpredictable terrain." "Focus! We aren't there for the cake, Bill. We are there for the haul. The primary objective is the pile of gifts by the gift table. I’m talking high-end electronics, limited edition action figures, and potentially a drone or two," Molly insisted. Bill leaned forward, his brow furrowed. "What about the perimeter? Who’s guarding the gift table?" "The birthday boy, Leo, a six-year-old with zero situational awareness and a sticky grip," Molly explained. "He’ll be distracted by a magician or a clown. We hit the table during the 'Pin the Tail on the Rocket' segment. That’s our window." "And the extraction?" Bill asked. "We can't exactly haul twenty cardboard boxes through a yard full of sugar-crazed children without looking suspicious." "We use the tactical strollers," Molly whispered. "Modified with silent-tread wheels and reinforced bottoms. We blend in as weary parents looking for a nap area. We push the strollers through the side gate, load the loot, and vanish before the piñata breaks," she said. Bill shuddered. "The piñata is a variable I can’t account for. What if it explodes prematurely? The chaos could be unmanageable." "Then we move faster! We grab the boxes and we go!" Molly shouted, then immediately lowered her voice. "Quiet! We need stealth." "Stealth in a room full of juice boxes? It’s a fool’s errand, Molly," Bill sighed. "What if someone asks us to play? I have no training in 'Duck, Duck, Goose.'" "You don't play. You observe. You act as the lookout while I scoop the loot into the stroller," Molly commanded. "And if a child follows me?" Bill asked, his voice trembling. "A small human, Molly. They have a terrifyingly persistent way of clinging to one’s leg." "Then you give them a temporary distraction. A sticker. A piece of lint. Do not engage in emotional bonding!" Molly warned. Bill rubbed his temples. "Stickers. Right. Tactical adhesive. I can work with that." "Good," Molly said, a predatory grin spreading across her face. "Check your gear. I want those strollers greased and the sticker-dispenser loaded. We strike at two." "God help us," Bill whispered. "The galaxy is a dangerous place."
created_at
"2026-05-10T12:58:20.786Z"
5/10/2026, 12:58:20 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
5033
moderation_latency
(null)
generation_time
33665
tokens_prompt
77
tokens_completion
748
native_tokens_prompt
68
native_tokens_completion
646
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00021726
router
(null)
provider_responses
0
endpoint_id
"d99bcbe6-bec3-4991-965e-45fea79fe63c"
id
"chatcmpl-8691637d316c76efeb57bd674d0b94e0"
is_byok
false
latency
5033
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DekaLLM"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417900-yz62e5mxHxY9o2kT8M9d"
session_id
(null)
api_type
"completions"
id
"gen-1778417900-476EUD0MB0U1H6ZtkInX"
upstream_id
"chatcmpl-8691637d316c76efeb57bd674d0b94e0"
total_cost
0.00021726
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DekaLLM"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
458
3.73%
Dialogue to Total Word Ratio
Ratio: 78.11%, Deviation: 8.11%
neededClean
false
wordsTotal
466
wordsDialogue
364
1.8629%