NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7YXHFB5Q20HKEB0G2HZPH
Run Details
Model:
Gemma 3 12B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
24.0s
Cost:
$0.000094
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.03%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0152%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The community hall smelled aggressively of sugar and desperation. Molly wrinkled her nose, peering through the smeared window at a gaggle of five-year-olds attempting to organize a conga line. Streamers in primary colours drooped like defeated flags. “Seriously, Bill?” she said, her voice low and bordering on disgusted. “A children’s party? Our career has really hit rock bottom.” Bill, a man whose enthusiasm for outlandish schemes outweighed his common sense by a considerable margin, grinned, adjusting his ridiculous fake handlebar moustache. “Think of the haul, Molly! Think of it! Every little darling’s birthday wishes, piled high! Remote-controlled cars, fluffy unicorns, enough Lego to build a small fortress! It's genius, I tell you!” “Genius is robbing a museum, Bill, not pilfering presents from toddlers. Their parents will be furious. And imagine the headlines: ‘Gift-Grabbing Grinches Strike! Children Weep!’” “Details, details! We’ll be in and out before the pinata explodes. Look, I’ve been observing the security. Minimal. One slightly bored-looking teenager manning the door.” He tapped a crumpled napkin covered in frantic sketches. “Operation Sparkle Thief is go.” Molly sighed, running a hand through her short, choppy hair. “And your plan? Elaborate, please. Because your last 'elaborate plan' involved a trained hamster and a lot of glitter.” “This is different! This is… strategic! First, we create a diversion. You’re the distraction.” “Me? How exactly am I supposed to distract twenty sugar-fueled children?” “You’ll be the… the magician!” Bill declared, puffing out his chest. “I acquired a magic kit. Slightly used, but functional. You pull a rabbit out of a hat, maybe a dove if we can find one. Kids love that stuff. While they’re mesmerized by your dazzling display of prestidigitation, I slip in, scoop up the presents, and we’re gone!” Molly stared at him, aghast. “You purchased a magic kit? Bill, you can barely tie your shoes without assistance.” “Hey! I watched a YouTube tutorial. I'm practically David Copperfield now. Think, Molly, think! The sheer chaos! The wonder in their eyes! Then, boom! Presents! It's a masterpiece of misdirection and larceny!” “It’s a recipe for disaster and potential restraining orders,” Molly countered. “And what about the parents? They're not going to stand around watching a mediocre magician while their children’s gifts vanish.” “They’ll be too busy taking photos and posting them on social media! 'Look at my kid being amazed by a spontaneous magic show!' It's perfect! Just picture it: a whirlwind of colour, laughter, and then…POOF! Presents gone! We'll be swimming in toys!" Molly pinched the bridge of her nose. "Okay, let's say, just *say*, I go along with this spectacularly ill-conceived plan. What's my cue?" Bill beamed, nearly dislodging his moustache. "When the teenager briefly glances towards the buffet table to refill the juice boxes. That's Showtime, Molly! That's when the magic begins!”
created_at
"2026-02-20T19:20:54.076Z"
2/20/2026, 7:20:54 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
144
moderation_latency
(null)
generation_time
23893
tokens_prompt
93
tokens_completion
810
native_tokens_prompt
75
native_tokens_completion
702
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009426
router
(null)
provider_responses
0
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-R3Sf2QZ5US9mi3sMfafG0SCm"
is_byok
false
latency
52
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615254-mjGeJWgfej9zPqwucC1y"
upstream_id
"chatcmpl-R3Sf2QZ5US9mi3sMfafG0SCm"
total_cost
0.00009426
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.03%
Matches word count
n/a
neededClean
false
words
470
0.00%
Dialogue to Total Word Ratio
Ratio: 75.32%, Deviation: 25.32%
neededClean
false
wordsTotal
470
wordsDialogue
354
0.0152%