NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7TW3TMZH1JWDYR047WZM0
Run Details
Model:
Gemma 3 12B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.3s
Cost:
$0.000086
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
2.65%
Dialogue to Total Word Ratio
(details)
1.3261%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of 'Brenda’s Bait & Tackle’ smelled powerfully of fish and regret. Molly was meticulously layering a rainbow wig over her already vibrant purple hair. Bill, slumped on a fishing crate, was attempting to inflate a giant inflatable dinosaur. He was failing spectacularly. “Are you *sure* about this, Bill?” Molly asked, not looking up. "Stealing presents from children? It's a bit… low, even for us." Bill let out a frustrated puff of air that barely registered on the dinosaur. “Low? Molly, we’re talking about a potential haul of *hundreds* of toys! Think of the resale value! Plus, look at Mrs. Gable’s prize-winning poodle, Fluffy. Those tiny sweaters alone could fetch a fortune on Etsy." Molly finally turned, eyebrow raised. “You’re planning to steal dog sweaters, Bill? That's… incredibly specific.” “Hey, details matter! This isn’t some smash-and-grab operation. This is precision, this is artistry! This,” he gestured weakly at the limp dinosaur, “is the key.” “The inflatable dinosaur is the key to stealing presents from a five-year-old’s birthday party? Please elaborate.” “Distraction! I'll wander through, all bewildered and amazed by the dinosaur. Everyone will be looking at *me*! You swoop in, Santa-style, grab the presents, and we're out! Like ninjas… but with better fashion choices.” He pointed to Molly's increasingly flamboyant wig. Molly sighed. “Santa doesn’t usually wear a rainbow wig, Bill. He’s more of a red suit kind of guy.” “Details, Molly! Branding! We’re going for a modern Santa, the Santa of the 21st century. The, uh, ‘Festival Santa’.” "Festival Santa?" "It's got a ring to it, doesn't it? Anyway, I’ve already scoped out the location. Gable’s garden is a fortress of bouncy castles and sugar-fueled chaos. Perfect for a diversion.” “And children. Remember, Bill. Small, sticky children." “They’ll be too busy admiring the dinosaur! Besides, I’m bringing glitter. Everyone likes glitter." "Glitter is the herpes of the crafting world, Bill. It gets everywhere, and it refuses to go away." “Think big, Molly! Think shimmering, celebratory chaos! Now, can you give me a hand with this thing? It feels like I'm wrestling a very large, very stubborn balloon." Molly reluctantly approached the dinosaur, poking it with a finger. “Right. So, we’re going to dress up as a vaguely festive giant reptile and steal toys from children. Do you ever worry about karma, Bill?” Bill puffed out his cheeks. “Karma? Molly, we’re thieves! Karma left us at the door. Now, hold this leg, would you? And try to look vaguely… jolly.”
created_at
"2026-02-20T19:18:41.538Z"
2/20/2026, 7:18:41 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
191
moderation_latency
(null)
generation_time
15183
tokens_prompt
93
tokens_completion
709
native_tokens_prompt
75
native_tokens_completion
640
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0000862
router
(null)
provider_responses
0
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-Rd84iNYJfZl0T1XWkOkS9qP2"
is_byok
false
latency
111
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771615121-nfLiENZyrKyVaqH1aprC"
upstream_id
"chatcmpl-Rd84iNYJfZl0T1XWkOkS9qP2"
total_cost
0.0000862
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
419
2.65%
Dialogue to Total Word Ratio
Ratio: 78.52%, Deviation: 8.52%
neededClean
false
wordsTotal
419
wordsDialogue
329
1.3261%