NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYA7QGMGZHVCJAKPBCN90PH
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.1s
Cost:
$0.000114
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The chipped Formica of the diner booth felt cold under Molly’s elbows. She stabbed a french fry with unnecessary force. Across from her, Bill was meticulously folding and refolding a napkin. It was a nervous habit. He got twitchy before a job. “So, the bounce house is… secure?” Molly asked, not looking up. Bill stopped his origami. “Relatively. Brenda, the mom, is obsessed with glitter. Glitter *everywhere*. It’s a tactical distraction, honestly. She’ll be too busy vacuuming to notice a couple of seasoned professionals relieving her of a mountain of plastic junk.” “Seasoned is generous. We stole a garden gnome and a bag of gummy bears last month, Bill. That barely qualifies us for amateur hour.” “Hey! That gnome was porcelain! And the bears were *sour* gummy bears. Higher risk, higher reward.” Molly rolled her eyes. “Right. Okay, so glitter. Bounce house. What about the kids?” “That’s where you come in, Sunshine. Your uncanny ability to blend in with small humans is a gift.” Bill finally abandoned the napkin and leaned forward. “Fake enthusiasm. Ask about superheroes. Compliment questionable art projects. You’re a natural.” “I’m a thirty-year-old woman pretending to be thrilled about finger painting. It’s degrading.” “Degrading but lucrative. Think of the payday. Enough for that vintage espresso machine you’ve been eyeing.” Molly’s eyes flickered. “The La Pavoni?” “The La Pavoni is within reach, Molly. Within reach. The plan is simple. I create a minor diversion – tripped extension cord, rogue balloon animal, something. You melt into the present pile, use the oversized gift bags as concealment, and start loading. I’ll signal when Brenda’s focusing on the cake.” “‘Signal’ being you loudly ask where the bathroom is?” “Precise timing is crucial. And multitasking. I’ll also subtly gauge the quality of the cake. Information gathering, Molly, information gathering.” “You’re going to critique a five-year-old’s birthday cake while I’m stuffing a Thomas the Tank Engine train set down my pants?” “It’s not down your pants! It's into the bags! Strategic concealment!” Bill’s voice was rising. “Look, think of it: Power Rangers action figures, Lego castles, enough Barbie accessories to open a boutique. We’ll be set for months.” “What if they have a dog?” Bill paused. “Brenda didn’t mention a dog.” He pulled a crumpled flyer from his pocket — “Barnaby’s Birthday Bash! 3 PM - 5 PM!” — and squinted at the fine print. “Wait a minute… ‘Please, no gifts containing small parts. Barnaby has a curious canine companion.’ A canine companion?!” Molly groaned. “Of course. It's always a canine companion. Just our luck.” “Okay, new plan. We go in early, bribe the dog with bacon. The La Pavoni is at stake, Molly. *Bacon*.”
created_at
"2026-02-20T20:00:39.966Z"
2/20/2026, 8:00:39 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
159
moderation_latency
(null)
generation_time
17919
tokens_prompt
93
tokens_completion
777
native_tokens_prompt
75
native_tokens_completion
675
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000114
router
(null)
provider_responses
0
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-R6GW0QJ5qD8A5th33ar7vLRZ"
is_byok
false
latency
65
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771617639-cAZ8P2Bzjf36X9UAhFSd"
upstream_id
"chatcmpl-R6GW0QJ5qD8A5th33ar7vLRZ"
total_cost
0.000114
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
443
0.00%
Dialogue to Total Word Ratio
Ratio: 80.14%, Deviation: 30.14%
neededClean
false
wordsTotal
443
wordsDialogue
355
0.0000%