NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR91J9YYXD34ZJDZAQE47ZAE
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.2s
Cost:
$0.000220
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The blueprints were spread across the grease-stained table of the dim diner, looking less like a tactical map and more like a preschooler’s finger painting. Molly tapped a manicured fingernail against the drawing of a bouncy castle. "The perimeter is soft, Bill. Too soft," Molly whispered, her eyes narrowing behind designer sunglasses. "The target is a seven-year-old’s birthday bash. We aren’t hitting a vault; we’re hitting a sugar-induced frenzy." Bill squinted at the map, chewing on a piece of stale toast. "I don't see the problem. It’s high volume, low security. Most of these marks are distracted by juice boxes and inflatable slides." "That’s exactly the danger," Molly countered, leaning in closer. "Distraction is a double-edged sword. If a kid sees us, they don't call the cops; they scream for their mother, and then we’ve got a mob of weeping toddlers on our tails. We need stealth. We need precision." "I thought we were going as clowns," Bill suggested, a hopeful glint in his eye. "I already bought the oversized shoes. They provide excellent cushioning for quick exits." Molly stared at him in dead silence. "We are not going as clowns, Bill. This is a high-stakes gift extraction, not a circus act. We need tactical camouflage. Dark linens. Silent movement. We slip in during the cake cutting, hit the gift pile, and vanish before the Piñata is even cracked." "What if the Piñata is a diversion?" Bill asked, genuinely concerned. "What if it’s booby-trapped with glitter bombs or those little plastic spiders?" "It’s a donkey filled with raisins, Bill, not a laser grid," Molly snapped. "Focus. The haul is massive. We’re looking at Lego sets, high-end handheld consoles, and possibly some premium remote-controlled drones. The resale value on a limited-edition superhero action figure alone could fund our getaway to Aruba." "I just think the clown angle has merit," Bill muttered, tracing a line toward the snack table. "I can hide the loot in the oversized pockets. It’s built-in cargo space." "You will wear black, and you will act like a professional," Molly commanded, pointing to the patio entrance. "We enter through the hedge line. We bypass the petting zoo. We intercept the presents during the peak chaos of the 'Happy Birthday' song. Do you understand?" Bill nodded solemnly, though his eyes drifted to a nearby menu. "Understood. Stealth. Silence. No clowns. Unless they're very subtle clowns." "One more word about clowns and I’m leaving you to face the wrath of the birthday boy alone," Molly warned. She folded the map with a sharp, decisive snap. "Check your gear. We move at fourteen-hundred hours. And for heaven's sake, leave the squeaky nose at home."
created_at
"2026-05-10T13:34:59.046Z"
5/10/2026, 1:34:59 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
152
moderation_latency
(null)
generation_time
15101
tokens_prompt
77
tokens_completion
703
native_tokens_prompt
79
native_tokens_completion
632
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00022041
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RYnpKK6lulDub6xIhOg0YcFc"
is_byok
false
latency
66
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778420099-mPzyZIpdgsDnL24pzQhx"
session_id
(null)
api_type
"completions"
id
"gen-1778420099-46C4KEqBgAmgzgYfZcuQ"
upstream_id
"chatcmpl-RYnpKK6lulDub6xIhOg0YcFc"
total_cost
0.00022041
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
443
0.00%
Dialogue to Total Word Ratio
Ratio: 73.19%, Deviation: 43.19%
neededClean
false
wordsTotal
455
wordsDialogue
333
0.0000%