NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91TQ2K1CG6725S2MT8RH89
Run Details
Model:
Gemma 4 26B (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
3m 49s
Cost:
$0.000210
Provider:
DekaLLM
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
0.0047%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a red marker against a blueprint of a suburban living room. The basement air smelled of stale coffee and impending felony. "The Henderson birthday bash is high-stakes, Bill," Molly said, her eyes narrowing. "This isn't a jewelry heist. This is high-volume, low-value chaos." Bill leaned over the map, squinting at the crayon-drawn perimeter. "I don't like the sound of 'low-value.' I spent three hours sharpening my screwdriver for a pile of stuffed animals?" "Think of the sheer quantity! If we grab the LEGO sets, the remote-controlled cars, and the various licensed action figures, we're looking at a significant windfall." "And the wrapping paper? It’s going to take us ages to unwrap everything in the van." "We don't unwrap them, you idiot! We take the boxes! We leave the paper as a signature. A calling card of mystery." "A calling card of being a common thief who stole a Barbie Dreamhouse?" "It's called branding, Bill. Now, let's talk about the primary obstacle. The bouncy castle." "That's not an obstacle, that's just a trampoline with walls." "It's a kinetic hazard! If we enter the zone, the floor is unstable. We could lose our footing and crash right into the juice boxes." "I can handle a little bouncing. I was the king of the playground in '94." "This isn't a playground, it's a tactical nightmare. And then there’s the clown. Puddles the Magnificent. He’s been spotted in the area." Bill wiped a bead of sweat from his forehead. "Is he a threat? Does he have a permit?" "He’s a wild card. One minute he’s making a poodle, the next he’s balloon-sword fighting the toddlers. We move during the 'Happy Birthday' song. That’s our window." "The singing? That's when everyone's eyes are on the candles." "Exactly. While they’re distracted by the glucose spike, we slip in through the patio door, bag the loot, and vanish into the hydrangea bushes." "What about the dog?" "A golden retriever named Buster. He’s easily bribed with a piece of pepperoni." "I'll bring the deli meat." "Good. And Bill?" "Yeah?" "If you see a kid in a superhero cape, do not engage. Just keep moving." "What if he tries to join the team? They're very persuasive." "Then we use the tactical distraction: the extra bag of gummy worms." Bill nodded, pulling a pair of dark sunglasses from his pocket. "I'll bring the worms. And the pepperoni. But I'm telling you, Molly, if there's a piñata involved, I'm out." "The piñata is strictly off-limits, Bill. We aren't here for candy." "It's about the principle of the thing." Molly sighed, rubbing her temples. "Just get the bags ready." Bill stood up, his heavy boots thudding against the concrete floor.
created_at
"2026-05-10T13:39:34.627Z"
5/10/2026, 1:39:34 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1235
moderation_latency
(null)
generation_time
229080
tokens_prompt
77
tokens_completion
7565
native_tokens_prompt
68
native_tokens_completion
625
native_tokens_completion_images
(null)
native_tokens_reasoning
6353
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00021033
router
(null)
provider_responses
0
endpoint_id
"d99bcbe6-bec3-4991-965e-45fea79fe63c"
id
"chatcmpl-8b3e1300f3d0de27"
is_byok
false
latency
1235
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DekaLLM"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420374-PQmhIKJQCsBpipzkc2Ai"
session_id
(null)
api_type
"completions"
id
"gen-1778420374-Ya0IbF7jobJGvnPeg4mM"
upstream_id
"chatcmpl-8b3e1300f3d0de27"
total_cost
0.00021033
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DekaLLM"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
450
0.01%
Dialogue to Total Word Ratio
Ratio: 83.62%, Deviation: 13.62%
neededClean
false
wordsTotal
458
wordsDialogue
383
0.0047%