NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7XTKVN56CGA426HKFSK7C
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
29.1s
Cost:
$0.000116
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth felt cold under Molly’s elbows. She traced the sticky ring left by someone’s forgotten coffee, contemplating the sheer audacity of the plan. Outside, a grey drizzle plastered itself against the window, mirroring the gloom in her soul – a gloom only slightly alleviated by the potential for a haul of Lego and gaming consoles. “So,” Bill said, picking at a rogue crumb on his plate. He’d ordered toast, unbuttered. Naturally. The man subsisted on sadness and carbohydrates. “Operation Birthday Bonanza. You still think it’s viable?” Molly didn’t bother looking up. “Viable? Bill, it's *genius*. It's low risk, high reward, and honestly? Morally grey, but in a way that’s almost charming.” Bill snorted. “Charming. Right. Like robbing a six-year-old of their Paw Patrol fire truck is peak charm.” “They won’t even *notice* individually,” Molly argued. “It’s the sheer volume we’re after. Little Timmy won't miss one remote control car amongst fifty. The parents will just think the kids are overwhelmed with generosity.” She spread a glossy, aerial photo of 14 Birchwood Lane across the table. A sprawling suburban house, a bouncy castle inflating in the backyard, a tiny, ludicrously cheerful inflatable dinosaur guarding the front door. “The party starts at two. Principal target window, two thirty to three thirty. That's when peak present-opening fatigue sets in. Parents get distracted, kids wander off to argue about who gets the first slice of cake. Chaos.” Bill pushed the toast around his plate. “And our disguise?” “Clowns. Obviously. Who suspects a clown? They're already inherently untrustworthy.” “I hate clowns,” Bill stated flatly. “Everyone hates clowns, Bill. That’s the point! It’s camouflage. We'll blend into the general air of child-induced hysteria. I’ve already sourced the outfits. Extra-large shoes, honking noses, the works.” “Do they at least have pockets?” “Of course they have pockets! I'm not an amateur, Bill. They’re reinforced. Tactical clown pockets. We’re talking industrial-strength Velcro. We can fit a surprising amount in those things." Molly tapped the photo. "The garden backs onto the woods. Quick in, quick out. We’ll have the loot stashed in the van within fifteen minutes.” Bill sighed. “And the overall plan relies on exploiting the joy of children?” “It relies on exploiting adult oversight, Bill. Big difference.” Molly leaned forward, her voice dropping to a conspiratorial whisper. "Think of the possibilities! We're not just stealing presents, we're redistributing happiness! Okay, redistributing *items that cause happiness* to our own pockets… but still!” “You’re really selling this, aren’t you?” Bill said, a faint smile tugging at his lips. “I always sell it, Bill. It's a gift. Now, are you in or are you going to spend the afternoon bemoaning the existential dread of unbuttered toast?"
created_at
"2026-02-20T19:20:18.32Z"
2/20/2026, 7:20:18 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
175
moderation_latency
(null)
generation_time
28940
tokens_prompt
93
tokens_completion
793
native_tokens_prompt
75
native_tokens_completion
688
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011608
router
(null)
provider_responses
0
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-RjdqEgRruOKY25HBqpeKbxfp"
is_byok
false
latency
55
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615218-8pM1PH1JkpfKys3CU5lj"
upstream_id
"chatcmpl-RjdqEgRruOKY25HBqpeKbxfp"
total_cost
0.00011608
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
456
0.00%
Dialogue to Total Word Ratio
Ratio: 64.91%, Deviation: 34.91%
neededClean
false
wordsTotal
456
wordsDialogue
296
0.0000%