NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAG73GXC8YWW7J61SAH33E
Run Details
Model:
Gemma 3 27B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.3s
Cost:
$0.000109
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
2.54%
Dialogue to Total Word Ratio
(details)
1.2694%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
“Right, so let me get this straight,” Bill said, meticulously polishing his sunglasses with a napkin pilfered from a diner. “We’re robbing…a birthday party?” Molly rolled her eyes, expertly picking the lock on a suspiciously flimsy briefcase. “It’s not *robbing*, Bill, it’s…strategic gift acquisition. And yes. Little Timmy Henderson is turning six. His parents clearly have money to burn, and judging by the guest list, everyone’s bringing excessively overpriced plastic junk.” “Excessively overpriced plastic junk that’s going to a six-year-old,” Bill pointed out. “Feels…low, even for us.” Molly snapped the briefcase open, revealing a frankly alarming array of suction cups, miniature grappling hooks, and a tiny drone painted to resemble a ladybug. “Low? Bill, we once extracted a priceless Faberge egg from a chihuahua. A chihuahua! This is practically humanitarian work. Think of the resale value on a limited edition Robo-Rex!” “Robo-Rex?” “Apparently. Timmy’s obsessed. It’s the centerpiece of the gift table. That’s where we focus.” “Focus. Right. And the plan?” Bill asked, still looking dubious. He put the sunglasses on, despite being indoors. “Simple. You create a diversion. Something…loud. Maybe a rogue clown?” “A rogue clown? Molly, seriously?” “It’s a children’s party, Bill! They *expect* a clown! A slightly unhinged one won't raise any eyebrows. Just juggle flaming torches, accidentally release a flock of pigeons, the usual.” Bill groaned. “Flaming torches. Pigeons. My union rep is going to have a field day with this one.” “Just picture it,” Molly said, ignoring him, and unfurling a detailed floor plan of the Henderson’s backyard. “While everyone’s distracted by your…performance, I slip through the back patio, disable the security – a pathetic motion sensor, honestly – and sweep the present table. In and out, clean and efficient.” “What about the parents?” “They’ll be too busy trying to figure out why a clown is setting the inflatable castle on fire.” “Hopefully. What if Timmy starts crying?” Molly sighed. "Bill, don't you *dare* get sentimental on me now. This is a professional operation. Think of the Robo-Rex. Think of the…the miniature train set I spotted on the registry." “A train set? Okay, you’re talking my language now. But still. A birthday party. Feels a bit…Grinchy.” “It feels lucrative, Bill. That’s what it feels. Now, are you in, or are you gonna sit here and wax poetic about childhood innocence while I raid the gift table solo?” Bill adjusted his sunglasses again. “Fine. Rogue clown it is. But I’m getting hazard pay.” “Just try not to accidentally sit on any of the children.”
created_at
"2026-02-20T20:05:18.077Z"
2/20/2026, 8:05:18 PM
model
"google/gemma-3-27b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1589
moderation_latency
(null)
generation_time
24191
tokens_prompt
93
tokens_completion
749
native_tokens_prompt
75
native_tokens_completion
645
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0001092
router
(null)
provider_responses
0
endpoint_id
"8f22002c-c045-446f-a1b9-9896133536b8"
id
"chatcmpl-RQ3WAAjMupy6Vp6qpVVV3JLs"
is_byok
false
latency
54
model_permaslug
"google/gemma-3-27b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771617918-MBJGTPsTKFqAe2DkpQ4a"
upstream_id
"chatcmpl-RQ3WAAjMupy6Vp6qpVVV3JLs"
total_cost
0.0001092
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
420
2.54%
Dialogue to Total Word Ratio
Ratio: 78.57%, Deviation: 8.57%
neededClean
false
wordsTotal
420
wordsDialogue
330
1.2694%