NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY96DNFCGQCVS7JC3PYJNR6
Run Details
Model:
Gemma 3 12B
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.3s
Cost:
$0.000099
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
40.61%
Dialogue to Total Word Ratio
(details)
70.2636%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The aroma of stale coffee and desperation hung heavy in Molly's cramped apartment. Balloons, remnants from a previous, less lucrative venture, still clung stubbornly to a corner. Bill, a man whose physique suggested a fondness for donuts and a deep distrust of exercise, was sprawled across a lumpy armchair, meticulously studying a flyer. “Right, so," Molly began, pacing like a caged ferret, "Operation Sugar Rush is a go. Little Timmy Thompson’s fifth birthday. Five-year-olds. Presents. Mountains of them.” Bill grunted, not looking up. "Mountains of what, exactly? Mostly plastic junk, I'm guessing. Limited resale value.” “Bill, have you *seen* what parents buy for five-year-olds these days? Robotic dinosaurs! Talking unicorns! Personalized slime kits! It’s a goldmine!” Molly gestured wildly, nearly knocking over a precarious stack of stolen garden gnomes. “Slime kits. Wonderful. Think anyone’s going to be clamoring for personalized lime-green slime shaped like a pug?” “Don’t be a pessimist! People buy ridiculous things. My point is, we’re aiming for volume, not quality. We overload the van, unload the van at a discount warehouse, and boom! Profit!” Bill finally looked up, fixing her with a skeptical stare. "And how, precisely, do we intend to insert ourselves into a children’s party without raising… alarm?” “Charm, Bill! Sheer, unadulterated charm. We pose as entertainers. I'll be ‘Madame Mystique the Magnificent,’ purveyor of wonder and… minor illusions.” Molly struck a dramatic pose, nearly tripping over a loose rug. Bill winced. “Madame Mystique? Seriously? You look like you just rolled out of bed. And 'minor illusions’ involves what, exactly? Making a rabbit disappear that you borrowed from the pet store two blocks over?” “Details, details! You’ll be ‘Barnaby the Clown,’ purveyor of balloons and… general merriment.” Bill groaned. “I am not wearing a clown suit, Molly. Absolutely not. I’ll attract more attention than a flock of pigeons at a picnic table.” “Oh, stop being dramatic. It’s a rented clown suit. Extra-large, to accommodate your… generous physique.” Molly smirked. “Besides, you’re good at the goofy voice.” “I only did the goofy voice once, Molly! At Uncle Harold’s funeral! It was immensely inappropriate!” “It was hilarious! And it’ll work wonders with a bunch of five-year-olds. Think about it: balloon animals, silly faces, and strategically placed distraction while I… subtly liberate the presents.” Molly’s eyes gleamed with mischief. “Subtly? You have the subtlety of a rhinoceros in a tutu.” “Hey! That’s a bit harsh. Just focus on the balloon animals, Bill. And try not to sat on the clown shoes. Last time, we had to return them with… significant damage.” Bill sighed, rubbing his temples. “So, let me get this straight. I’m going to be a clown, surrounded by screaming children, while you’re pulling off elaborate present-snatching maneuvers disguised as a magician. And the loot will likely be radioactive slime and plastic dinosaurs.” Molly grinned, grabbing a handful of glitter and tossing it into the air. "Exactly! Operation Sugar Rush. It’s going to be legendary!”
created_at
"2026-02-20T19:42:28.537Z"
2/20/2026, 7:42:28 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
384
moderation_latency
(null)
generation_time
15952
tokens_prompt
93
tokens_completion
863
native_tokens_prompt
75
native_tokens_completion
739
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009907
router
(null)
provider_responses
0
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-RO6WtaD1Oaux7YOCwjJKVJwV"
is_byok
false
latency
48
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771616548-vRRcRy3u1suUWaDe6V6S"
upstream_id
"chatcmpl-RO6WtaD1Oaux7YOCwjJKVJwV"
total_cost
0.00009907
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
497
40.61%
Dialogue to Total Word Ratio
Ratio: 74.25%, Deviation: 4.25%
neededClean
false
wordsTotal
497
wordsDialogue
369
70.2636%