NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8A1E6AC2MEWJ0BEKBB84F
Run Details
Model:
Gemma 3 12B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
26.8s
Cost:
$0.000097
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
13.5829%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The community hall smelled overwhelmingly of sugar and disappointment. Disappointment, because the bouncy castle was deflated and forlorn in one corner. Sugar, because a frankly obscene amount of cupcakes sat on a table draped in glittery pink fabric. Molly surveyed the scene with a practiced eye, her nose wrinkling slightly. Bill, meanwhile, was attempting to subtly taste a rogue jelly bean he’d plucked from the floor. "Are you *seriously* eating evidence, Bill?" she hissed, sidling closer to him. "It's reconnaissance, Molly! Gotta know the enemy's flavor profile," he mumbled, popping the jelly bean into his mouth with a satisfied sigh. "Lime. Very childish. Predictable." Molly pinched the bridge of her nose. "We’re stealing presents, Bill, not analyzing confectionery trends. Operation Sugar Rush is a go, remember? Focus." He swallowed with theatrical deliberation. "Right, right. Presents. Mountains of them, apparently. Little Timmy's got a remote-controlled monster truck. Little Susie's getting a pony… a *toy* pony, thank heavens, no actual equines involved." The hall was bustling with parents trying to appear enthusiastic about coloring-in competitions and pin-the-tail-on-the-donkey. Kids, fueled by sugar, were screaming and chasing each other with reckless abandon. It was chaos, controlled chaos, and perfect for covering their tracks. "The target zone is the gift pile," Molly continued, pointing to a teetering tower of brightly wrapped boxes accumulated under a banner that read, 'Happy 5th Birthday, Lily!' "Approximately seventy-eight packages. Range from Barbie dream houses to educational dinosaur kits. High-value targets: anything with batteries." "Seventy-eight! That's… substantial," Bill said, his eyes gleaming. "We’ll need a strategy. A plan. Something… elegant." "Elegant doesn’t come into it, Bill. It’s a children’s party. We’re aiming for efficient. I’ve already identified the two exits – the main doors, obviously secured, and a fire escape at the back. Less guarded. You’ll handle the distraction." "Distraction?" Bill's face fell. "What kind of distraction?" "You," Molly stated flatly. "You're going to pretend to be a ridiculously enthusiastic magician. Card tricks, disappearing rabbits… the whole shebang. Pull their attention away from the gift pile. Keep them buying into the absurdity." "A magician? Molly, I can barely juggle three grapes without dropping two!" "You do not need to juggle. Just be… convincingly awkward. Embarrassingly silly. Think a clown who’s lost his way." "I can do awkward," Bill said, puffing up his chest slightly. "I’m practically a professional at awkward. But sillier than me? That's a tall order." Molly ignored him. “While you’re charming the five-year-olds with your masterful illusions, I'll be loading the presents into the getaway bag – a reinforced duffel, naturally. Meet me at the fire escape in fifteen minutes. And Bill?" "Yes?" “No jelly beans.” She turned, blending into the crowd, leaving Bill to contemplate his impending role as a spectacularly bad magician. He eyed the cupcakes again. Reconnaissance, he told himself. It was all about reconnaissance.
created_at
"2026-02-20T19:26:58.515Z"
2/20/2026, 7:26:58 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
388
moderation_latency
(null)
generation_time
26624
tokens_prompt
93
tokens_completion
837
native_tokens_prompt
75
native_tokens_completion
725
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009725
router
(null)
provider_responses
0
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-R0pOEtwB791zW8Jc91VzvLxC"
is_byok
false
latency
56
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615618-iq0853qUJCIXKOgEZDXX"
upstream_id
"chatcmpl-R0pOEtwB791zW8Jc91VzvLxC"
total_cost
0.00009725
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
0.00%
Dialogue to Total Word Ratio
Ratio: 52.39%, Deviation: 22.39%
neededClean
false
wordsTotal
481
wordsDialogue
252
13.5829%