NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9327KMJVNVXV71AQ2ZC5A
Run Details
Model:
Gemma 3 12B
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
14.0s
Cost:
$0.000099
Provider:
DeepInfra
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
34.0511%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The inflatable bouncy castle shimmered in the afternoon sun, a lurid pink behemoth dominating Mrs. Higgins’ prize-winning petunias. A gaggle of miniature princesses and pirates shrieked with delight, fuelled by sugar and an evident lack of adult supervision. Molly surveyed the scene with a professional eye, chewing on the end of a pen. Beside her, Bill was attempting to camouflage himself behind a particularly bushy rhododendron, looking less like a master criminal and more like a startled garden gnome. “Are you *sure* this is the right address, Bill?” Molly asked, her voice low. “Because it looks… excessive. Even for Mrs. Higgins.” Bill peeked out from behind the rhododendron, twigs clinging to his tweed jacket. “Positive. Number 17 Primrose Lane. The flyer said ‘Extravaganza!’ I interpreted that as an opportunity.” “You interpreted it as a ludicrous risk, more like. Stealing presents from children, Bill? Really?” Molly gestured vaguely at the chaos unfolding. “We usually go for banks. Jewellery stores. Not… glitter cannons and pony rides.” “Look, Molly,” Bill said, pushing his spectacles further up his nose. "We're in a bit of a pinch, remember? Big Bart owes us a tidy sum, and frankly, the security at First National is tighter than a drum. Mrs. Higgins, on the other hand... seems to have entrusted the entirety of her valuables to a six-year-old’s security detail, which appears to consist of a small boy in a Spider-Man costume.” Molly sighed. "Don't remind me. He's got a water pistol. A *water pistol*, Bill. We’re robbing a children’s party.” “Think of the haul, Molly! Lego sets. Scalextric tracks. Possibly a drone! High-value commodities, all ripe for the taking.” Bill’s eyes gleamed with a suspicious excitement. “And the element of surprise! Who suspects a heist at a birthday party?” “Everyone, probably. Especially when they see a grown man trying to blend in with the hydrangeas.” Molly pointed. "And you're attracting attention, Bill. You look like you’re expecting a miracle, not planning a heist.” “Right, right.” Bill flinched and retreated further into the foliage. “Plan of action? You handle the distraction, I snatch the loot. Classic double-team.” “Distraction?” Molly raised an eyebrow. “What distraction? Are you going to offer them a magic trick? Because I’m pretty sure ‘Pull a rabbit out of a hat’ isn’t going to divert attention from a crying toddler who’s lost his balloon animal.” “Okay, fine. You improvise. You’re the creative one. I’ll focus on the… acquisition phase.” Bill fumbled in his pocket and produced a pair of oversized novelty sunglasses. "These might help me blend in? Think I look like a cool uncle?" Molly stared at him. "You look like a very confused pigeon. Just… stay out of sight, Bill. And for heaven’s sake, don’t talk to the Spider-Man." The air throbbed with the frantic beat of a children’s DJ, pumping out an endless loop of bubblegum pop. Molly took a deep breath and moved towards the heart of the party, steeling herself for the inevitable onslaught of sticky fingers and sugar-fueled mayhem. Operation Birthday Blitz was officially underway.
created_at
"2026-02-20T19:40:38.526Z"
2/20/2026, 7:40:38 PM
model
"google/gemma-3-12b-it"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
144
moderation_latency
(null)
generation_time
13880
tokens_prompt
93
tokens_completion
845
native_tokens_prompt
75
native_tokens_completion
736
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00009868
router
(null)
provider_responses
0
endpoint_id
"eb06dc92-5a16-47ec-a776-6ef956457c47"
id
"chatcmpl-R6JZsapVesG0k8XsswByoUHy"
is_byok
false
latency
43
model_permaslug
"google/gemma-3-12b-it"
provider_name
"DeepInfra"
status
200
api_type
"completions"
id
"gen-1771616438-7Tr0Oh5Flv5f13vFFrH0"
upstream_id
"chatcmpl-R6JZsapVesG0k8XsswByoUHy"
total_cost
0.00009868
cache_discount
(null)
upstream_inference_cost
0
provider_name
"DeepInfra"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
514
0.00%
Dialogue to Total Word Ratio
Ratio: 60.89%, Deviation: 30.89%
neededClean
false
wordsTotal
514
wordsDialogue
313
34.0511%