NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR9045E321S9RFY7XMJHV13W
Run Details
Model:
Gemma 4 26B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
10.4s
Cost:
$0.000227
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the grease-stained table, tapping a frantic finger on a drawing of a bouncy castle. The dim light of the basement flickered, casting long, suspicious shadows against the stacks of stolen toaster ovens. This is the target, Bill, she whispered, her eyes gleaming with a terrifying level of intensity. The Miller residence. Saturday, two o’clock. The theme is ‘Galactic Explorers.’ Bill leaned in, squinting through his cracked spectacles. Is there a perimeter alarm? Only a sensory minefield of toddlers and sugar-crazed siblings, Molly replied. We enter through the side patio, bypass the clown, and move straight for the gift pile. The clown? Bill asked, his voice rising an octave. Is he armed? He’s armed with squeaky noses and existential dread, Molly said, rolling her eyes. Don't be dramatic. He’ll be distracted by the balloon animal ritual. Bill rubbed his temples, looking deeply unsettled. I don’t like it, Molly. This feels high-risk. What if there’s a bouncy castle casualty? It’s a diversion, she insisted. We use the bounce to mask our movement. The vibration of the jumping will camouflage the sound of us stuffing the LEGO sets into the duffel bags. We aren't just talking about toys, are we? Bill muttered. No, Molly said, her voice dropping to a conspiratorial hiss. We go for the big ticket items. The high-end remote-controlled drones. The limited-edition plushies. The unboxed tablets. Bill sighed, leaning back in his creaky chair. And the cake? Is the cake part of the haul? Don't be a fool, Bill, Molly snapped. We are professional thieves, not hungry lunatics. We leave the buttercream alone. What about the parents? Bill asked, staring at the ceiling. They’re like sleeper agents. One minute they’re handing out juice boxes, the next they’re calling the authorities. They’ll be trapped in a cycle of forced small talk and lukewarm coffee, Molly promised. We’ll be gone before the piñata even cracks. Bill grabbed a pencil, sketching a hasty diagram of a getaway van. I’ll need a disguise. Something inconspicuous. You’re wearing a dinosaur onesie, Molly said flatly. A dinosaur? Bill sputtered. That’s too conspicuous! It’s a tactical advantage, she countered. No one suspects a T-Rex of grand larceny. They just think you’re a very large, very lost party guest. Bill stared at her for a long moment, then slowly began to nod. A T-Rex. It’s brilliant. The sheer absurdity will paralyze them. That’s the spirit, Molly said, slamming her fist onto the table. Now, let’s talk about the extraction route. If the toddler in the blue shirt sees us, we pivot to the garden gate. Understood, Bill said, his eyes narrowing with sudden professional resolve. Operation Birthday Bash is a go.
created_at
"2026-05-10T13:09:47.082Z"
5/10/2026, 1:09:47 PM
model
"google/gemma-4-26b-a4b-it-20260403"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
97
moderation_latency
(null)
generation_time
10219
tokens_prompt
77
tokens_completion
737
native_tokens_prompt
79
native_tokens_completion
651
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00022687
router
(null)
provider_responses
0
endpoint_id
"d71c6981-347d-4622-a86b-2c9e5d0f2c6c"
id
"chatcmpl-RKUYh5yLGk1qeGTiED4Wa0yz"
is_byok
false
latency
48
model_permaslug
"google/gemma-4-26b-a4b-it-20260403"
provider_name
"DeepInfra"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418587-3C1YsmB8tWNWTkGkvLpm"
session_id
(null)
api_type
"completions"
id
"gen-1778418587-4nTl1UCYme2cyEAQwZNw"
upstream_id
"chatcmpl-RKUYh5yLGk1qeGTiED4Wa0yz"
total_cost
0.00022687
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
448
0.00%
Dialogue to Total Word Ratio
Ratio: 0.44%, Deviation: 29.56%
neededClean
false
wordsTotal
454
wordsDialogue
2
0.0000%